Gene Cthe_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1189 
Symbol 
ID4810141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1418008 
End bp1418976 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content40% 
IMG OID640106611 
ProductABC transporter related protein 
Protein accessionYP_001037614 
Protein GI125973704 
COG category[R] General function prediction only 
COG ID[COG4586] ABC-type uncharacterized transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATAA AAGTGGAGAA CCTTACAAAA ACCTATAAAT CTTACGAGCG GGGAAATACA 
TTCAGGGAAG CCGTTTACAG TCTCTTTGTC CGCAAAACCA AATATGTGGA AGCCTTAAAA
GGAATCTCCT TCACTATGGA AAAGGGAGAA CTGGTTGGCT TTCTTGGCCC CAACGGTGCG
GGCAAATCAA CGACATTGAA GATATTAACC GGAATACTCT TCCCTACCGG TGGAAAAGTT
GATATTATGG GTTATACTCC CTGGAAGGAC AGAAAAAAAT ATGTGGCCCA TATTGGTGCC
GTGTTTGGCC AAAAATCACA GCTTCTGTGG GACATACCAC CCATTGATGC TTTTTATCTG
AACAAAGCAA TATATTCCAT TCCCGATAAA ATCTTTAAAA AAAATTTGGA CAATATGGTA
GAGCTCCTTA ATGTCGGTGA TTTAATAAAA AAGCCTACAA GGCTTCTTTC CCTTGGTGAA
AGAATGAAAT GCGAATTTAT TATGGCAATG CTTCATAATC CTGAAATAGT GTTTCTTGAC
GAGCCTACCA TTGGACTTGA TGTCATTGCC AAGGACAAAA TTCGTGAATT CATTCTTGAA
ATGAACAAAC AGGGAGTGAC ATTCATCCTT ACCACCCACG ACCTTGATGA CGTGGAACGC
CTTGCACAAA GAGTTATAGT TATCAATCAC GGGCAAATTG TGTTTGACAA CTCCATCGAT
GCCCTAAGGA AGCACTTCGG GGAGAAAAAA GTGGTGTCTG TATCCACTCA CAATCCATTA
CCGTCTCTGG ACATGCCCGG AGTACTGGTA AAAAACAAAA TATCCGAGTA CAATGCGGAA
CTTGAACTTG ACGTCAGCAA ACTGGAGCTT AACAAATTTA TCGACTATAT AAATAAAAAC
AGCACAATAA ATGATTTGCT GGTTCAGTCG CTTCCTATTG AAGATGTAAT AAAGGATTTA
TATTCATAA
 
Protein sequence
MIIKVENLTK TYKSYERGNT FREAVYSLFV RKTKYVEALK GISFTMEKGE LVGFLGPNGA 
GKSTTLKILT GILFPTGGKV DIMGYTPWKD RKKYVAHIGA VFGQKSQLLW DIPPIDAFYL
NKAIYSIPDK IFKKNLDNMV ELLNVGDLIK KPTRLLSLGE RMKCEFIMAM LHNPEIVFLD
EPTIGLDVIA KDKIREFILE MNKQGVTFIL TTHDLDDVER LAQRVIVINH GQIVFDNSID
ALRKHFGEKK VVSVSTHNPL PSLDMPGVLV KNKISEYNAE LELDVSKLEL NKFIDYINKN
STINDLLVQS LPIEDVIKDL YS