Gene Lcho_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0098 
Symbol 
ID6161766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp101859 
End bp103268 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID641662842 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_001789138 
Protein GI171056789 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00133426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCTTG ATGTCGTGAT CATGGCTGCC GGCAAGGGCA CACGGATGAA ATCGGCCCGA 
CCCAAGGTGC TGCACCCGCT GGCGGGGCGA GCCTTATTGC AGCATGTCCT GGAAATGGGC
GCCGGGCTGG GGGCGGATCG CCTCATCACC ATCACCGGCC ACGGCGCCGA GTCGGTCGAA
GCCGCCATGC GCGCCGCGCT GCCGGCCGCG CCGCTGGCCT TCGTGCGCCA GGAGCCGCAG
CTCGGCACCG GCCACGCCGT GCAGCAGGCG GTGCCCGCAT TGGGTGACGA GGGCACCACG
CTGATCCTCA ACGGCGACGT GCCGCTGGTG CGCCCCGAGA CCGCTCGCGC GCTGATCGCC
GCCTGCGGGG GCGAGAAGCT GGCACTGCTG ACCGTCGAAC TGGCCGACCC GACCGGCTAC
GGCCGCATCG TGCGTGACGC CGCCGACCGC GAGCGCGTGC TCGCCATCGT CGAGCACAAG
GACGCCACGC CCGAGCAGCG CGCCATCACC GAGGGCTACA CCGGCATGAT GGCCGTGCCG
ACCCGCCACC TCAAGCGCTG GCTGGCCGCG CTGCGCAACG ACAACGCGCA GAAGGAGTAC
TACCTGACCG ACATCGTCGC GATGGCCGAG GCCGACGGCG TGCCGGTGGT CGCCACGCTG
GCGGGCAACG AGACCGAGGT GCTGGGCGTC AACAGCCCGC TGCAGCTGGC CGAACTCGAG
CGGCGCTTCC AGCGCGTGCA GGCCGAATCG CTGATGGAAG CCGGCGTGCG CCTGATGGAC
CCGGCGCGCT TCGACCTGCG CGGCGTGCTG CGCTGCGGCC GCGACGTCGC GATCGACGTC
AACTGCGTCT TCGAGGGCGA GGTCGAGCTC GGCGACGAGG TGCAGATCGG CGCCAACTGC
GTGATCCGCA ACGCCCGCAT CGCCGCCGGC GCGGTGATCC ACCCCTTCAC CCACATCGAC
GGCGAAGCCG CCGGCGTCGA GGTCGGCGAG GGCGCGCTGA TCGGCCCGTT CGCGCGCCTG
CGCCCGGGCG CCAGGCTGGG CCGTGCAGTC CACATCGGCA ACTTCGTCGA GGTCAAGAAC
TCCACGCTGG CCGACGGCGC CAAGGCCAAT CACCTGGCCT ACCTGGGCGA CGCCACGGTG
GGTGAACGTG TCAACTACGG CGCCGGCAGC ATCACCGCCA ACTACGACGG CGCCAACAAG
CACCGCACCG TGATCGAGGC CGACGTGCAC ATCGGCAGCA ACTGCGTGCT GGTGGCGCCG
GTGACGATCG GCGCCGGTGC CACGGTGGGC GGCGGCTCGA CCATCACCAA GGACGTGGCG
CCCGGCCAGC TCGGCGTGGC GCGTGGCAAG CAGGTGGTGC TCGACGGCTG GGTGCGCCCG
AGCAAGAACA AGCCCGCCAA GCCGGCCTGA
 
Protein sequence
MRLDVVIMAA GKGTRMKSAR PKVLHPLAGR ALLQHVLEMG AGLGADRLIT ITGHGAESVE 
AAMRAALPAA PLAFVRQEPQ LGTGHAVQQA VPALGDEGTT LILNGDVPLV RPETARALIA
ACGGEKLALL TVELADPTGY GRIVRDAADR ERVLAIVEHK DATPEQRAIT EGYTGMMAVP
TRHLKRWLAA LRNDNAQKEY YLTDIVAMAE ADGVPVVATL AGNETEVLGV NSPLQLAELE
RRFQRVQAES LMEAGVRLMD PARFDLRGVL RCGRDVAIDV NCVFEGEVEL GDEVQIGANC
VIRNARIAAG AVIHPFTHID GEAAGVEVGE GALIGPFARL RPGARLGRAV HIGNFVEVKN
STLADGAKAN HLAYLGDATV GERVNYGAGS ITANYDGANK HRTVIEADVH IGSNCVLVAP
VTIGAGATVG GGSTITKDVA PGQLGVARGK QVVLDGWVRP SKNKPAKPA