Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1579 |
Symbol | |
ID | 4809570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1908483 |
End bp | 1910045 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640106997 |
Product | ABC transporter related protein |
Protein accession | YP_001037998 |
Protein GI | 125974088 |
COG category | [R] General function prediction only |
COG ID | [COG3845] ABC-type uncharacterized transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAAAAAA CAAGCCATAT TGATCCGAAC GTAGTTGCCG TTCAAATGAC TGACATCGTC AAAAAGTTCG GTAACTTCGT GGCAAACGAT CACATCAACC TAACCGTTTA CAAAGGCGAA GTTCACGCTA TTCTTGGCGA AAACGGCGCC GGAAAAAGTA CGCTGATGAA CATATTGTAC GGACTGTATA AACCGACATC CGGTAGCATC TCCATATTCG GTTCTCCTGT GCATATCGAG AGTCCGCGCC ATGCCATCGA ACTTGGCATC GGCATGGTGC ATCAGCACTT CATGCTGGTG GAACCCTTTA CCGTTACCGA AAACATTATT CTCGGCATGG AGCCCACCAA AGGTATGGTG GTGGACATCA AAAAGGCACG CCGGAAAGTG TTGGAACTGT CCGAGCGCTA TGGCATGCAT GTGGATCCCG ATGCCAAAAT CGAAGACATT TCGGTAGGAA TGCAGCAGCG TGTTGAAATC CTTAAAGTTC TGTACCGGGG CGCCAATATC CTTATTCTCG ACGAGCCCAC CGCCTCGCTG ACACCGCAGG AAATCGAGGA ACTGATCAAC ATCATCCACA ACCTCACAGC CGACGGCAAA ACCGTGTTGC TAATCACTCA TAAGCTGAAG GAAATCAAGG CTTCGGCTGA CAGGTGTACC ATTATCCGAC AGGGCAAATA TATCGGCACT GTCGATGTAG AAGCTGTCAG TGAACACGAT CTGGCTTCCA TGATGGTAGG CCGCGATGTT CAATTTGTGG TGGACAAGAA GCCCATTGAG CCAGGTGAAG TCATTATTGA CATACAGGAC CTGCACGCCA GGGATTACCG CGGCGTGGAA GTGCTTAAGG GTTTAAGCCT TAAAGTGCGC CGTGGTGAAA TCGTGGGTTT AGCCGGCGTA GACGGCAATG GTCAGACCGA GCTGGTGGAG ATTTTAACCG GTTTGCGGAA AGGCGAATCC GGAAAAGTCA TAATAGGCGG CAAAGATTTG TTCAACGCCG ATGCACGTAC ACTGTTTGAT AATGGCGTAT CCAGCATTCC GGCCGACCGG CAAAAGCACG GGCTGATACT GGATTATTCT GTAGCTTACA ATCTGGTTCT GCAAAATTAT GAACAACCCC CTTTCTCCAA ACGCGGTATT TTAAAGAAAG ATGCGATTTA CAAGCATGCG GCAGAGCTTG TCGAAAAATT TGACGTCCGG GGTGCTGACG GCGGCACCAA AGAAGCAGGC AAGCTGTCCG GCGGTAACCA GCAAAAGGTA ATCATTGCCC GTGAGGTTAC CAATGACAAG GATTTGCTTA TCGCCGTTAA TCCGACTCGC GGTTTGGACG TAGGCGCCAT CGAGTTTGTT CACCGTTATA TTGTGGAGCA GCGCAATAAA AATAAAGCTG TATTGCTGGT GTCCTTTGAA CTGGATGAAA TCATGAGCCT TTCCGACAGG ATTGAAGTCA TTTACAGCGG TAATATCGTA GGCAGTGTGC CAGGCCACGA GGCCGATGAA AAAGTCCTAG GACTGATGAT GGCAGGAGGA ACTAAAAATG AAACACAAAA ACAACCAGAA TAA
|
Protein sequence | MEKTSHIDPN VVAVQMTDIV KKFGNFVAND HINLTVYKGE VHAILGENGA GKSTLMNILY GLYKPTSGSI SIFGSPVHIE SPRHAIELGI GMVHQHFMLV EPFTVTENII LGMEPTKGMV VDIKKARRKV LELSERYGMH VDPDAKIEDI SVGMQQRVEI LKVLYRGANI LILDEPTASL TPQEIEELIN IIHNLTADGK TVLLITHKLK EIKASADRCT IIRQGKYIGT VDVEAVSEHD LASMMVGRDV QFVVDKKPIE PGEVIIDIQD LHARDYRGVE VLKGLSLKVR RGEIVGLAGV DGNGQTELVE ILTGLRKGES GKVIIGGKDL FNADARTLFD NGVSSIPADR QKHGLILDYS VAYNLVLQNY EQPPFSKRGI LKKDAIYKHA AELVEKFDVR GADGGTKEAG KLSGGNQQKV IIAREVTNDK DLLIAVNPTR GLDVGAIEFV HRYIVEQRNK NKAVLLVSFE LDEIMSLSDR IEVIYSGNIV GSVPGHEADE KVLGLMMAGG TKNETQKQPE
|
| |