Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2533 |
Symbol | |
ID | 4809289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3002477 |
End bp | 3003361 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107949 |
Product | sulfate ABC transporter, inner membrane subunit CysW |
Protein accession | YP_001038928 |
Protein GI | 125975018 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4208] ABC-type sulfate transport system, permease component |
TIGRFAM ID | [TIGR00969] sulfate ABC transporter, permease protein [TIGR02140] sulfate ABC transporter, permease protein CysW |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00678647 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGGA TTGTTCCTGT GAGAATAAAA ACGGCAAATA AAGTGGAATA TGCAAAAAAG CGGGCGGTAA AAGACTCAAA AATTGTTCAG GTGGTTTTAA CGACAATAGC GATACTATTT TTTATAATAA TGCTCATAGT TCCACTTGTT TCGGTGTTCG TAAAAGCTTT TGAACAGGGA GCGAATTTGT ATTTTGCTTC AATTACCGAT CCGATAGCAC TAAAGGCCAT AAAACTTACC CTCATAACCA TAGCAATCAC GGTTCCGATT AACACCATAT TTGGCCTTGC AGCTGCCTGG GCGATTGCAA AATTTAAGTT TAAAGGAAAA AACATATTGA TTACGATTAT TGATTTGCCT TTTTCCATAT CGCCGGTGGT GGCGGGATTG ATATTTGTTT TGCTCTTTAG CACCAGTCAC GGACTTTTGG GACCGCTGCT TAATGCTTTG GGAATAAAAA TTATCTTTGC CCCGCCGGGA ATTGTTATTG CAACATTGTT TGTTACTCTC CCATTTGTGG CAAGGGAGCT GATACCCTTG ATGGAAGCCC AGGGAACTGC AGAGGAAGAA GCTGCGCTGA CCCTGGGGGC AAGCGGCTGG AAAACCTTTT GGTACATTAC GCTCCCCAAT ATAAAATGGG CGCTGCTGTA CGGCGTTATG CTGACAACTG CAAGGGCGGC CGGAGAGTTT GGTGCGGTTT CCGTTGTATC AGGGCATATC AGAGGTCTTA CCAATACGGT TCCTCTTCAT GTTGAAATAT TGTACAACGA ATATAAATTT TCTGCCGCAT TTGCAGTGGC ATCGCTGCTG ACCCTCATTG CCTTGATAAA TTTGATTGTA AAGAACGTAG CCCATTGGAA GATACAGCAG CAAAGCAAAA TATAG
|
Protein sequence | MAGIVPVRIK TANKVEYAKK RAVKDSKIVQ VVLTTIAILF FIIMLIVPLV SVFVKAFEQG ANLYFASITD PIALKAIKLT LITIAITVPI NTIFGLAAAW AIAKFKFKGK NILITIIDLP FSISPVVAGL IFVLLFSTSH GLLGPLLNAL GIKIIFAPPG IVIATLFVTL PFVARELIPL MEAQGTAEEE AALTLGASGW KTFWYITLPN IKWALLYGVM LTTARAAGEF GAVSVVSGHI RGLTNTVPLH VEILYNEYKF SAAFAVASLL TLIALINLIV KNVAHWKIQQ QSKI
|
| |