Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2070 |
Symbol | |
ID | 6163529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 2251478 |
End bp | 2253766 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641664839 |
Product | cellulose synthase regulator protein |
Protein accession | YP_001791102 |
Protein GI | 171058753 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00029094 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCACAC CCTCCACCAT GCAGCATCGA AACGCCAGCA GTTTTCACGC GTTTGTCGCA GCTGTCCTGC TCGGCGGCAT CCCTCTGCAA GGCGGTTCCG CCCCGGCGCC GACCGATGCC GTCGCCGCGG TTGCGGCGGC GCCCGCCGAC ACGAGCACAC GCATCCAGAC CCTCAGTCTG CGAGACCTCG GCGCCCAGCG CCCGATCGAA CTGCGCGGCA TCGATCACAG CGTCTACCTG CCGCTGTCGG TGCGCCTCGA CGAAACCGTG ACGCGGGCGC GTCTGAAGCT CAACTACACC TTTTCTCCCG GCCTGCTGGC CGATCTGTCG CAGCTGAAGG TGTTCGTCAA CGACGAGGTG CTGGCCACGG TGCCGGCCGT CAAGGGCAAG CTCGGCAGCC CGCAGACGAT CGAGATCGAC CTGGAACCGC ACTTCTTCAC CGAGTACGCC CAGCTGCGGC TGCAGTTCAT CGGCCACTAC ACGCTCGATT GCGAATACCC GTTCCACAGC AGCCTGTGGG CCAACATCAG CAACCAGAGC AGCCTCGAGA TGACGACGCA GCGCCTGCCG CTGCGCAACG ACCTGGCGCT GCTGCCGGCG CCCTTCTTCG ACGTGCGCGA CGGCACCCGC CTGAAGCTGC CGTTCGTGAT CGCGCCGTCG CCCTCGCCGG AGGCCCTGCG CAGCGCCGGG GTGCTGGCGA GCTGGTTCGG CGCGCTCGCC AGCTATCGCG GCGCGCAGTT CCCGGTGTCC GACGCGCTGC CCGCAGGCCA TGCCGTGGTG ATGGCCACCA ACGACCAGCG CCCGGCCGGC CTCGACCTGC CCCAGGTCGA GGCACCGACG CTCAAGATCG TGCCGCACCC GTCGGATCCG GGCGCCAAGC TGCTGCTGGT GCTCGGCCGC GACGCCGCCC AGCTCAAGAC CGCCACCGAC GCGCTGGTGC TGGGCCAGGC GGCCCTCAGC GGCGACCAGG TGCAGGTCGA CAAGCTCAAC TACCCGGCGC GACTGGCCGC GCACGAGGCG CCCAACATCG TCAAGACCGG CGGCATCGTG CGCCTGGGCA GCCTGGTGCC GAACGCCGCC GCGCTGCAGG TGGCGGGCGC GCAGCTGCAG ACCATCCGCC TGCCGCTGCG CCTGCCGGCC GACACCTTCG CCTGGCAGTC CGAGGGCCTG CCGATCGAGC TGCGCTACCG CTACACGCCG CCGCACGAGC TGGGCGCGGC GCGGCTGGCG GTGCAGATCA ACGAGCAGCT GGTCGAGTCC TTCCTGCTGC GCCCGGCCGG CACCCAGGGC ACGCGCAGCC AGCGCATGGT GCTGCCCTTT CTCGAGATCG GGGGCACCTA CGACCGCCAG GACATCACCG TGCCGGCGTT CCAGCTGGGC AACAACAACG AACTGCAGTT CCGCTTCGAG ATCCCGCCGG TCGACACCTC GCGCTGCCGC GAGACGGTGC TGATCTCGCA GGCCGGCATC GACGCCGACT CCACCATCGA CCTGCGCAAC GTCGAGCACT ACGTCACCCT GCCGAACCTG GCGCTGTTCG CCAACAGCGG TTTCCCGTTC ACCAAGTTCG CCGACCTGGC CGAGACCGCG CTGATCCTGT CGGACGCACC GGCCGTGGCC GAGATCGAGG CCGCACTCAA CCTGCTCGGC CAGATGGGTG CGGCCACCGG CATCGCCGGC ACGCGGCTGC AGGTGCTGCC GGCCAGCCGG GTCAAGGAGG CGGCCGACCG CGACCTGCTG GTGATCGCCA GCGGCAGCGC ACCGGCGCCG CTGGCCAACT GGTCGCAGCA CCTGCCGGCG CGACTCGACG CCAGCCGGCG CAGCAACACC TCGCTGACCC GGTTGAGCGA CGCCGGCTCC GAATGGTTCT CGGGTGCCCT GCCGCGCAAC TTTCCCGACG ACGAGTGGGC CGAGCTGCAG GCCAAGGGGC CGCTGACGGC GCTGATGGGT TTCGAGTCGC CTCTCAACAG CGGCCGCAGC GTGGTCGCCC TGCAGGCGAC CGAAGGCGCG TCCCTGCAGC AGGCCGCCGC CACCCTGCTC GACCCCGGCA AGATCCGCCT GATCCAGGGC GACCTGGTGC TGATGCGCAA CGAGGCGTTC GAGGCCTTCC GCATCGGCGA GGTCTACCAG GTGGGCGAAC TGCGCTGGTG GCGCTGGGTC TGGTACCAGA TGCGCGGACA CCCGCTGCTG ATGGTGCTGC TGGTGGGCCT GGTGTCCTTG TTGCTGGCCC TGCCGCTGTA CCGCGCACTG CGCATGCGGG CCGAACGTCG CGTGCGCAGC GAGACCTGA
|
Protein sequence | MATPSTMQHR NASSFHAFVA AVLLGGIPLQ GGSAPAPTDA VAAVAAAPAD TSTRIQTLSL RDLGAQRPIE LRGIDHSVYL PLSVRLDETV TRARLKLNYT FSPGLLADLS QLKVFVNDEV LATVPAVKGK LGSPQTIEID LEPHFFTEYA QLRLQFIGHY TLDCEYPFHS SLWANISNQS SLEMTTQRLP LRNDLALLPA PFFDVRDGTR LKLPFVIAPS PSPEALRSAG VLASWFGALA SYRGAQFPVS DALPAGHAVV MATNDQRPAG LDLPQVEAPT LKIVPHPSDP GAKLLLVLGR DAAQLKTATD ALVLGQAALS GDQVQVDKLN YPARLAAHEA PNIVKTGGIV RLGSLVPNAA ALQVAGAQLQ TIRLPLRLPA DTFAWQSEGL PIELRYRYTP PHELGAARLA VQINEQLVES FLLRPAGTQG TRSQRMVLPF LEIGGTYDRQ DITVPAFQLG NNNELQFRFE IPPVDTSRCR ETVLISQAGI DADSTIDLRN VEHYVTLPNL ALFANSGFPF TKFADLAETA LILSDAPAVA EIEAALNLLG QMGAATGIAG TRLQVLPASR VKEAADRDLL VIASGSAPAP LANWSQHLPA RLDASRRSNT SLTRLSDAGS EWFSGALPRN FPDDEWAELQ AKGPLTALMG FESPLNSGRS VVALQATEGA SLQQAAATLL DPGKIRLIQG DLVLMRNEAF EAFRIGEVYQ VGELRWWRWV WYQMRGHPLL MVLLVGLVSL LLALPLYRAL RMRAERRVRS ET
|
| |