Gene Lcho_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2072 
Symbol 
ID6163531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2254923 
End bp2258657 
Gene Length3735 bp 
Protein Length1244 aa 
Translation table11 
GC content75% 
IMG OID641664841 
Productcellulose synthase domain-containing protein 
Protein accessionYP_001791104 
Protein GI171058755 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00026145 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGACGCA CCGCCTGGAT CCTCGCGCTG ATCCTGCTGC CGACAGGCAG CCCCGCCGCG 
CCCGGCGACG CCCAGCAGCG CGGCGTCAAC GAGCTGATCG ACAGCGCCCA GCTCTGGCAG
ACCCGTCGCC GCCCCGACCT GGCGCGCAGC CTGCTGCAGA AGGTGCTGGC GGTGCGGGCC
GACGAACCGC GCGCGCTGCT GCTGCTCGGC GAGATCGACC TGCGCAGCGG CCGCAAGGAT
GCCGCGCTGC CGCAGCTGGC CATCCTGCAG GCGCGCCACC CGGACAGCGC CGAACTGCGC
GAACTGCAGC AGCTCGCCTC GCTCTACAAC AACGACCCGC TGCGCCTGAG CCGCCTGCAG
CAGCTGCGCG AAGCCGGCCG CAACGCCGAG GCGGCCAGCC TGGCGCGCCA GATCTTCCCC
AGCGCCAGGC CGCCGGGCAT GCTGGGCGCC GATCTCGCCG GCCTGATCGC CAGCACGCCG
GGCGCCTGGG AGGTGAGCCG CCGGCAAGCC GAGGAACGGG TCGCCGCCAG CAATTCGGCC
ACGAACCGGC TGGCGCTGGC CGAGGTGCTG GCGCTGCGAC CGGCCACCCG CGCGCGTGCC
TGGGAAACCT ACAACCTGCT GGCCCGCGAA GGCCAACTGC CGAGCGACAC GGTGGCCTCG
TCCTGGCGCC GCTCGATCCG CATGCAGCCG CCGGGCACGC CGGTCGGGCC GCTCTGGGCC
TCGCTGAACA AGGCCATCGA CGGCCCCGGC GCTGCGCTGA CCACCGCCGC CGGGCCCCGC
TCGGCGGGGG CGAACGCAGC CGTCGTGACC GCCGCGATCG CTGCGCTGAA CGGCGAGGTC
GACGGTGCGG CCGGCAGGCC GGGCAACGGC GGTAATGGCG GTAATGGCCG GAACGGCAAC
GCGACCTCGG CCAACGGCAA CGGCAACGGC GCAACGGCCG GCGCTGCGCC GGCGCCGATC
AGCGCGCGGG CCGTGGCCAT CCAGCAGGCG CTGGCCGACG CCCAGAAACA GCTTGCGGCC
GGCGCCAACG AACAGGCTGC CAGGCTGCTC GAAGCCAGCC TGGTGCGCAA CGGCGACGAC
GGGCCGACCT GGGGTCTGCT GGGCCTGTCG CGCATGCGCG AGGGCCAGCA CGCCGAGGCC
GAGCCGGCGT TCGCCCGCGC CTTGCGCCTC GATCCGGGCG ATGCCGACCG CTGGCGCTCG
CTCGGCGTGG CGGCGCACTA CTGGGCCACG GTCAGCCTGG CGCGCATCGA GGCCGACGTC
GGCCGCACCG AGGTCGCCGC CGATCTGCTG CGTCCGGTGG CCGACACCCA GCCCACCGCG
ATCGAGGCCA AGTTGCTGCT GGCCCGGCTC GAGGGTGAAC TCGGCCAGGA CGAGATCGCC
CTCGAGCACT TCGAGCAGGT GCTCGCTGTC GAGCCGAAAG AAGAGCGCGC CTGGCGCGGC
CGCTTCGTCG TGCGCCTGAA GACCGACCCC GAAGCCGCGC TGACCGAACT GGCCGCCTTC
GGCCCGGGCG CGGCCGGGAT CGTCGACGGC GGCGCGGTGC GCGATCTGGC CGATGGCCGC
ATCGCCCAGG GGCGCACCAG CGCGGCCTTG CGGCTGCTCG AGCAGGCCAT CGACGCGGCG
CCGGGCGACC CCTGGCTGCG TCACGACGCG GCCCGCATCT ACCTGCGGCT GGGCCTGCCC
GACAGTGCCC GCTCGCTGAT GGCCGAAGGC CGTGCCGCGG CCGACCCCGA CAACGCCGTC
TCGATGCTGC ACGCCAGCGC GCTGGTGGCG CTGGCCGCCG ACGAGAACAA CCTGGCGGTC
GAGCTGGTCT CGGCCCTGCC CGAGGCCGAA GCCGACGGCC CGCTGCGCGA TCTGGTGCGG
CGCGCCCGGC TCGAGCAGGC CTTTGCCAAC GCCCGGCAAG CGATCGACGC CCAGCGCCCG
ACGATGGCGC ACGACTGGCT CGACAAGGCC GACCGCCTGA TCGACAGCAG CGCCGGGCAA
GGCATGCGGC TGGCCCGGCT GCGACTGGGC GCCGGCCAGC GGCAGCAGGC GCTGGCGCTG
CTGCGACGCA TCGACCCGGT CGAGCTGGAC GCCGCCGACA CGCGGGCCTG GGCCGGCATG
GTGGCCGATG CCGGCGCCGC CGAGCTGGCG CTCGACCGCC TGCAGGCGCG CATCGACGCC
AACCTCGGCG CGCGCAACCT CATGCTCGCC AGCCCGGCCG AGCCAGGCGC GGTCGACGGC
AACGAGCCCC CGCCCCTGCC GGTCTGGAAC ACCGACGACC TGACGCAGGC ACGGCTGGCG
CACGCCGATC TCGCCATCGA GGCCGACCTG CCGCAGACCG CCCGCCAGGA TCTGCAGCGG
CTGGCCGCCG ACCTGCCCGC CGATGCGCTC GACGATCGCC TGCGGCTGCT GCAGCTGCAG
CGGCGGGTGG GCGACACGTC GGCCGCGCGG GCCACGCTGG CCGACCTGAT GGCGCGCTCG
GGCGACGATC CGGCCGTGCG CATCGAGGCC GCCCGGCAGG CCCGCATCGA CGACGACGAC
ACCGCCGCTC GCGCCCACCT GCAGCTGGCG CGCGAGCGCA GCGCACCGGG CAGCGAGACC
GCCGCCGAGG CCAGCCGCGC GATCGACGTG ATCGACGCCG AGCGCCAGCC GCTGTTCGAG
ATGGCGATGC ACGACGCCTC GCTCGACGGC AGCGACGGCC GCGCCCGCCT GCACAGCAAC
GAGGCCACCG CCCGCCTGAG CTGGCCGGGC GTGGCCAACG GCACCTTCTT CACGCAGGTC
GACCTGATCC GGCTCGATGC CGGCACGCTG GCAGCGGACA TCGACCAGTC CGAGGCCTTC
GGCCGCGTGC TGGCGAGCAG CCCGGCCGGG CTGGCCGCCG CGGCACCGCA GAACGACCGC
GGCGCCGCCC TCGGCGTGGG CTGGAGCAGC CGCGACGACA GCGTCGACAT CGGCGTGGTC
GGCCTGCGGC TGCGCAACTG GGTGGGCGGC TGGTCGCACA CCCGCCGGCG TGACGACGGC
GGATGGGGCG TCGAGATCTC GCGCCGCGTC CTGACGGGCA GCCTGATGTC GTGGGCCGGC
GTGCAGGATC CGGTCGACGG GGCGGTCTGG GGCGGCGTGA CGCTCAATGC GCTGACGCTG
CGCACCGAGC ATGTGCTGAG CCCGGCCGAT TCGGTCTCCG CCAGCCTGCG CCTGGGCCTG
CTGCAGGGCC GCAACGTGGC GAGCAACCGG ATGCAGCAGC TGCGCGCGGC GTACGACCAC
GACCTGCAGC GCAGCGACGA TCACCGCCTG CGCATCGGCC TGAACGCCAA CGTCTGGCTC
TATGCGCGCA ACCTGAGTTT TCACACCTTC GGCCAGGGCG GCTACTACAG CCCGCAGCGT
TACGTGTCGA TCGGCGTGCC GCTCGAAGCC GCCGGCATGA ACGGCCGGCT GAGCTACCAG
GTGCGCGCCA GCGTGTCGCA GTCCTGGACC CGCGAGGACG ACACGCCCTA CTACCCGACC
GACCCGGCGG CGCAGGCCGC GGCCGGCAAT CCGATCCACA CCGGTGGCCC GGGCGGCGGG
ACGGGCGCCT CGCTGCGGGC GGCGCTGGAG TGGCGGGTCG CCCCGCAATG GGCGATCGGC
ACGGCATTGG CGCTCGAACG CTCGACCGAC TACACGCCCG GGCGCGCCAG CGTCTACCTG
CGCCACTGGC TCGGCCGCGT GCCGGCGCCG CTGGACTGGC CGCCGCAGCC GCTGACGCCC
TATCTGCGCA ACTGA
 
Protein sequence
MRRTAWILAL ILLPTGSPAA PGDAQQRGVN ELIDSAQLWQ TRRRPDLARS LLQKVLAVRA 
DEPRALLLLG EIDLRSGRKD AALPQLAILQ ARHPDSAELR ELQQLASLYN NDPLRLSRLQ
QLREAGRNAE AASLARQIFP SARPPGMLGA DLAGLIASTP GAWEVSRRQA EERVAASNSA
TNRLALAEVL ALRPATRARA WETYNLLARE GQLPSDTVAS SWRRSIRMQP PGTPVGPLWA
SLNKAIDGPG AALTTAAGPR SAGANAAVVT AAIAALNGEV DGAAGRPGNG GNGGNGRNGN
ATSANGNGNG ATAGAAPAPI SARAVAIQQA LADAQKQLAA GANEQAARLL EASLVRNGDD
GPTWGLLGLS RMREGQHAEA EPAFARALRL DPGDADRWRS LGVAAHYWAT VSLARIEADV
GRTEVAADLL RPVADTQPTA IEAKLLLARL EGELGQDEIA LEHFEQVLAV EPKEERAWRG
RFVVRLKTDP EAALTELAAF GPGAAGIVDG GAVRDLADGR IAQGRTSAAL RLLEQAIDAA
PGDPWLRHDA ARIYLRLGLP DSARSLMAEG RAAADPDNAV SMLHASALVA LAADENNLAV
ELVSALPEAE ADGPLRDLVR RARLEQAFAN ARQAIDAQRP TMAHDWLDKA DRLIDSSAGQ
GMRLARLRLG AGQRQQALAL LRRIDPVELD AADTRAWAGM VADAGAAELA LDRLQARIDA
NLGARNLMLA SPAEPGAVDG NEPPPLPVWN TDDLTQARLA HADLAIEADL PQTARQDLQR
LAADLPADAL DDRLRLLQLQ RRVGDTSAAR ATLADLMARS GDDPAVRIEA ARQARIDDDD
TAARAHLQLA RERSAPGSET AAEASRAIDV IDAERQPLFE MAMHDASLDG SDGRARLHSN
EATARLSWPG VANGTFFTQV DLIRLDAGTL AADIDQSEAF GRVLASSPAG LAAAAPQNDR
GAALGVGWSS RDDSVDIGVV GLRLRNWVGG WSHTRRRDDG GWGVEISRRV LTGSLMSWAG
VQDPVDGAVW GGVTLNALTL RTEHVLSPAD SVSASLRLGL LQGRNVASNR MQQLRAAYDH
DLQRSDDHRL RIGLNANVWL YARNLSFHTF GQGGYYSPQR YVSIGVPLEA AGMNGRLSYQ
VRASVSQSWT REDDTPYYPT DPAAQAAAGN PIHTGGPGGG TGASLRAALE WRVAPQWAIG
TALALERSTD YTPGRASVYL RHWLGRVPAP LDWPPQPLTP YLRN