Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2072 |
Symbol | |
ID | 6163531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 2254923 |
End bp | 2258657 |
Gene Length | 3735 bp |
Protein Length | 1244 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641664841 |
Product | cellulose synthase domain-containing protein |
Protein accession | YP_001791104 |
Protein GI | 171058755 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00026145 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGACGCA CCGCCTGGAT CCTCGCGCTG ATCCTGCTGC CGACAGGCAG CCCCGCCGCG CCCGGCGACG CCCAGCAGCG CGGCGTCAAC GAGCTGATCG ACAGCGCCCA GCTCTGGCAG ACCCGTCGCC GCCCCGACCT GGCGCGCAGC CTGCTGCAGA AGGTGCTGGC GGTGCGGGCC GACGAACCGC GCGCGCTGCT GCTGCTCGGC GAGATCGACC TGCGCAGCGG CCGCAAGGAT GCCGCGCTGC CGCAGCTGGC CATCCTGCAG GCGCGCCACC CGGACAGCGC CGAACTGCGC GAACTGCAGC AGCTCGCCTC GCTCTACAAC AACGACCCGC TGCGCCTGAG CCGCCTGCAG CAGCTGCGCG AAGCCGGCCG CAACGCCGAG GCGGCCAGCC TGGCGCGCCA GATCTTCCCC AGCGCCAGGC CGCCGGGCAT GCTGGGCGCC GATCTCGCCG GCCTGATCGC CAGCACGCCG GGCGCCTGGG AGGTGAGCCG CCGGCAAGCC GAGGAACGGG TCGCCGCCAG CAATTCGGCC ACGAACCGGC TGGCGCTGGC CGAGGTGCTG GCGCTGCGAC CGGCCACCCG CGCGCGTGCC TGGGAAACCT ACAACCTGCT GGCCCGCGAA GGCCAACTGC CGAGCGACAC GGTGGCCTCG TCCTGGCGCC GCTCGATCCG CATGCAGCCG CCGGGCACGC CGGTCGGGCC GCTCTGGGCC TCGCTGAACA AGGCCATCGA CGGCCCCGGC GCTGCGCTGA CCACCGCCGC CGGGCCCCGC TCGGCGGGGG CGAACGCAGC CGTCGTGACC GCCGCGATCG CTGCGCTGAA CGGCGAGGTC GACGGTGCGG CCGGCAGGCC GGGCAACGGC GGTAATGGCG GTAATGGCCG GAACGGCAAC GCGACCTCGG CCAACGGCAA CGGCAACGGC GCAACGGCCG GCGCTGCGCC GGCGCCGATC AGCGCGCGGG CCGTGGCCAT CCAGCAGGCG CTGGCCGACG CCCAGAAACA GCTTGCGGCC GGCGCCAACG AACAGGCTGC CAGGCTGCTC GAAGCCAGCC TGGTGCGCAA CGGCGACGAC GGGCCGACCT GGGGTCTGCT GGGCCTGTCG CGCATGCGCG AGGGCCAGCA CGCCGAGGCC GAGCCGGCGT TCGCCCGCGC CTTGCGCCTC GATCCGGGCG ATGCCGACCG CTGGCGCTCG CTCGGCGTGG CGGCGCACTA CTGGGCCACG GTCAGCCTGG CGCGCATCGA GGCCGACGTC GGCCGCACCG AGGTCGCCGC CGATCTGCTG CGTCCGGTGG CCGACACCCA GCCCACCGCG ATCGAGGCCA AGTTGCTGCT GGCCCGGCTC GAGGGTGAAC TCGGCCAGGA CGAGATCGCC CTCGAGCACT TCGAGCAGGT GCTCGCTGTC GAGCCGAAAG AAGAGCGCGC CTGGCGCGGC CGCTTCGTCG TGCGCCTGAA GACCGACCCC GAAGCCGCGC TGACCGAACT GGCCGCCTTC GGCCCGGGCG CGGCCGGGAT CGTCGACGGC GGCGCGGTGC GCGATCTGGC CGATGGCCGC ATCGCCCAGG GGCGCACCAG CGCGGCCTTG CGGCTGCTCG AGCAGGCCAT CGACGCGGCG CCGGGCGACC CCTGGCTGCG TCACGACGCG GCCCGCATCT ACCTGCGGCT GGGCCTGCCC GACAGTGCCC GCTCGCTGAT GGCCGAAGGC CGTGCCGCGG CCGACCCCGA CAACGCCGTC TCGATGCTGC ACGCCAGCGC GCTGGTGGCG CTGGCCGCCG ACGAGAACAA CCTGGCGGTC GAGCTGGTCT CGGCCCTGCC CGAGGCCGAA GCCGACGGCC CGCTGCGCGA TCTGGTGCGG CGCGCCCGGC TCGAGCAGGC CTTTGCCAAC GCCCGGCAAG CGATCGACGC CCAGCGCCCG ACGATGGCGC ACGACTGGCT CGACAAGGCC GACCGCCTGA TCGACAGCAG CGCCGGGCAA GGCATGCGGC TGGCCCGGCT GCGACTGGGC GCCGGCCAGC GGCAGCAGGC GCTGGCGCTG CTGCGACGCA TCGACCCGGT CGAGCTGGAC GCCGCCGACA CGCGGGCCTG GGCCGGCATG GTGGCCGATG CCGGCGCCGC CGAGCTGGCG CTCGACCGCC TGCAGGCGCG CATCGACGCC AACCTCGGCG CGCGCAACCT CATGCTCGCC AGCCCGGCCG AGCCAGGCGC GGTCGACGGC AACGAGCCCC CGCCCCTGCC GGTCTGGAAC ACCGACGACC TGACGCAGGC ACGGCTGGCG CACGCCGATC TCGCCATCGA GGCCGACCTG CCGCAGACCG CCCGCCAGGA TCTGCAGCGG CTGGCCGCCG ACCTGCCCGC CGATGCGCTC GACGATCGCC TGCGGCTGCT GCAGCTGCAG CGGCGGGTGG GCGACACGTC GGCCGCGCGG GCCACGCTGG CCGACCTGAT GGCGCGCTCG GGCGACGATC CGGCCGTGCG CATCGAGGCC GCCCGGCAGG CCCGCATCGA CGACGACGAC ACCGCCGCTC GCGCCCACCT GCAGCTGGCG CGCGAGCGCA GCGCACCGGG CAGCGAGACC GCCGCCGAGG CCAGCCGCGC GATCGACGTG ATCGACGCCG AGCGCCAGCC GCTGTTCGAG ATGGCGATGC ACGACGCCTC GCTCGACGGC AGCGACGGCC GCGCCCGCCT GCACAGCAAC GAGGCCACCG CCCGCCTGAG CTGGCCGGGC GTGGCCAACG GCACCTTCTT CACGCAGGTC GACCTGATCC GGCTCGATGC CGGCACGCTG GCAGCGGACA TCGACCAGTC CGAGGCCTTC GGCCGCGTGC TGGCGAGCAG CCCGGCCGGG CTGGCCGCCG CGGCACCGCA GAACGACCGC GGCGCCGCCC TCGGCGTGGG CTGGAGCAGC CGCGACGACA GCGTCGACAT CGGCGTGGTC GGCCTGCGGC TGCGCAACTG GGTGGGCGGC TGGTCGCACA CCCGCCGGCG TGACGACGGC GGATGGGGCG TCGAGATCTC GCGCCGCGTC CTGACGGGCA GCCTGATGTC GTGGGCCGGC GTGCAGGATC CGGTCGACGG GGCGGTCTGG GGCGGCGTGA CGCTCAATGC GCTGACGCTG CGCACCGAGC ATGTGCTGAG CCCGGCCGAT TCGGTCTCCG CCAGCCTGCG CCTGGGCCTG CTGCAGGGCC GCAACGTGGC GAGCAACCGG ATGCAGCAGC TGCGCGCGGC GTACGACCAC GACCTGCAGC GCAGCGACGA TCACCGCCTG CGCATCGGCC TGAACGCCAA CGTCTGGCTC TATGCGCGCA ACCTGAGTTT TCACACCTTC GGCCAGGGCG GCTACTACAG CCCGCAGCGT TACGTGTCGA TCGGCGTGCC GCTCGAAGCC GCCGGCATGA ACGGCCGGCT GAGCTACCAG GTGCGCGCCA GCGTGTCGCA GTCCTGGACC CGCGAGGACG ACACGCCCTA CTACCCGACC GACCCGGCGG CGCAGGCCGC GGCCGGCAAT CCGATCCACA CCGGTGGCCC GGGCGGCGGG ACGGGCGCCT CGCTGCGGGC GGCGCTGGAG TGGCGGGTCG CCCCGCAATG GGCGATCGGC ACGGCATTGG CGCTCGAACG CTCGACCGAC TACACGCCCG GGCGCGCCAG CGTCTACCTG CGCCACTGGC TCGGCCGCGT GCCGGCGCCG CTGGACTGGC CGCCGCAGCC GCTGACGCCC TATCTGCGCA ACTGA
|
Protein sequence | MRRTAWILAL ILLPTGSPAA PGDAQQRGVN ELIDSAQLWQ TRRRPDLARS LLQKVLAVRA DEPRALLLLG EIDLRSGRKD AALPQLAILQ ARHPDSAELR ELQQLASLYN NDPLRLSRLQ QLREAGRNAE AASLARQIFP SARPPGMLGA DLAGLIASTP GAWEVSRRQA EERVAASNSA TNRLALAEVL ALRPATRARA WETYNLLARE GQLPSDTVAS SWRRSIRMQP PGTPVGPLWA SLNKAIDGPG AALTTAAGPR SAGANAAVVT AAIAALNGEV DGAAGRPGNG GNGGNGRNGN ATSANGNGNG ATAGAAPAPI SARAVAIQQA LADAQKQLAA GANEQAARLL EASLVRNGDD GPTWGLLGLS RMREGQHAEA EPAFARALRL DPGDADRWRS LGVAAHYWAT VSLARIEADV GRTEVAADLL RPVADTQPTA IEAKLLLARL EGELGQDEIA LEHFEQVLAV EPKEERAWRG RFVVRLKTDP EAALTELAAF GPGAAGIVDG GAVRDLADGR IAQGRTSAAL RLLEQAIDAA PGDPWLRHDA ARIYLRLGLP DSARSLMAEG RAAADPDNAV SMLHASALVA LAADENNLAV ELVSALPEAE ADGPLRDLVR RARLEQAFAN ARQAIDAQRP TMAHDWLDKA DRLIDSSAGQ GMRLARLRLG AGQRQQALAL LRRIDPVELD AADTRAWAGM VADAGAAELA LDRLQARIDA NLGARNLMLA SPAEPGAVDG NEPPPLPVWN TDDLTQARLA HADLAIEADL PQTARQDLQR LAADLPADAL DDRLRLLQLQ RRVGDTSAAR ATLADLMARS GDDPAVRIEA ARQARIDDDD TAARAHLQLA RERSAPGSET AAEASRAIDV IDAERQPLFE MAMHDASLDG SDGRARLHSN EATARLSWPG VANGTFFTQV DLIRLDAGTL AADIDQSEAF GRVLASSPAG LAAAAPQNDR GAALGVGWSS RDDSVDIGVV GLRLRNWVGG WSHTRRRDDG GWGVEISRRV LTGSLMSWAG VQDPVDGAVW GGVTLNALTL RTEHVLSPAD SVSASLRLGL LQGRNVASNR MQQLRAAYDH DLQRSDDHRL RIGLNANVWL YARNLSFHTF GQGGYYSPQR YVSIGVPLEA AGMNGRLSYQ VRASVSQSWT REDDTPYYPT DPAAQAAAGN PIHTGGPGGG TGASLRAALE WRVAPQWAIG TALALERSTD YTPGRASVYL RHWLGRVPAP LDWPPQPLTP YLRN
|
| |