Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1580 |
Symbol | |
ID | 4809571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1910020 |
End bp | 1911138 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640106998 |
Product | inner-membrane translocator |
Protein accession | YP_001037999 |
Protein GI | 125974089 |
COG category | [R] General function prediction only |
COG ID | [COG4603] ABC-type uncharacterized transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0212662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACA AAAACAACCA GAATAACAAG CCCTCTCAGA ATATTTTGAA AATTAAAGCA CAAAGACTAA TTCATCAAAA TGCCCTATAT ACCATTATTG CTATTTTCTT CGGATTTCTG GTCGGCGGCA TTTTCCTGTG GGCAGCCGGC TTCAGTCCCA TTGAAGCATA TGCAAAGCTG TTCAGCAGCG TGTTCAGCCG TCCCAAATAT CTCATTTGGT CGGTGATCTA TGCAACTCCG CTGATATTTA CGGGGTTAAG CGTGGCATTT TCTTACAAAA TGGGTGTTTT CAACATCGGC GCCGAAGGGC AGTTTGTTGT AGGCTCACTG GCTGCTCTGT GCGTAGGCAT CCTGGTGGAC GCTCCTCCCG GGTTGCACGT GCTGCTATGT ATGCTGGCCG CTGTTGCCGC CGGCATGCTG TGGAGCTTTC TGGTGGCAGT GCTGCGCGTC CGGTTCGGTA TCAATGAAGT TTTGTCCTTC ATTATGTTCA ACTGGATCGC CTTTTATTTT TCAAATTATG TAGTCAATAC TGCAGCTATT CACAAGGTTG GCGGCGGAGA GGCTTCCAAG GATATTCGCG AATCTGCAAG AATTCTGCTG CCCCAATCGT TGCAGAATAT TTTTCAGAGT AATAAAGCCA ACTACGGTAT TTTCCTGGCA ATTATTGCGG CAATTGTTAT TTGGTTTATC CTGACGAAAA CCACTCTTGG CTATAAAGTA CAGGCCGTTG GTCTCAATCC TCACGCAGCC AAATACGGCG GTATTAATTC CAACAAGACG ATGTATATTG CCATGAGCCT GTCGGGTGCT CTGGCAGCTT TGGGCGGTGC CGTGCAACTG ATGGGTAACT CCATGCGCAT CAGTCAGTTT GCCGGACAGG AAGGCTTCGG CTTCCAGGGA ATCACCGTTG CGTTAATCGC AAGCTCTCAC CCTATTGGTT GTATTTTTTC AGGGCTGTTT TACGGTGCCA TGAAATACGG CGGCTCCAAG CTCAATCTAA TTGATGCTCC CACAGAAGTC GTTGACATCA TTATGGGCAC CATCGTGCTC TTTATCGCCA TATCACATGT ATTCCGCTAT TTAATTACAA GACGGCTTAA GAATAAGGAG GACAAATAA
|
Protein sequence | MKHKNNQNNK PSQNILKIKA QRLIHQNALY TIIAIFFGFL VGGIFLWAAG FSPIEAYAKL FSSVFSRPKY LIWSVIYATP LIFTGLSVAF SYKMGVFNIG AEGQFVVGSL AALCVGILVD APPGLHVLLC MLAAVAAGML WSFLVAVLRV RFGINEVLSF IMFNWIAFYF SNYVVNTAAI HKVGGGEASK DIRESARILL PQSLQNIFQS NKANYGIFLA IIAAIVIWFI LTKTTLGYKV QAVGLNPHAA KYGGINSNKT MYIAMSLSGA LAALGGAVQL MGNSMRISQF AGQEGFGFQG ITVALIASSH PIGCIFSGLF YGAMKYGGSK LNLIDAPTEV VDIIMGTIVL FIAISHVFRY LITRRLKNKE DK
|
| |