Gene Cthe_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2998 
Symbol 
ID4811146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3519775 
End bp3521502 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content44% 
IMG OID640108419 
ProductABC transporter related protein 
Protein accessionYP_001039387 
Protein GI125975477 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000215819 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT TTATCAGAAT ACTTAAACAG GGACGACAAT ACTGGAAATA TTTGATAGTC 
GCCGGTATAA GCCTGCTCGC GATTACGGCT TTCAACCTTG TTACCCCATG GAAAGTCAGA
GAACTGGTTG ATATATTGTC AAAAGAAGGC ACGAAGAATA TGGCAAGTAT ACGCAATATT
GCAATTATAC TTGCCGTTGC ATACATCGCA AGAGCGTTTT TTTCCTATCT GTATCGCTAT
TTCAGCCATG TTGCCGCATG GAGACTGGTT GCGGACATGA GGGTGATTGT GTACGAACAT
CTGCAGAAGC TGTCTTTGAG TTTCTATCAA GACAAGCAGA CCGGGCAGCT TATGTCAAGG
GCAATAAATG ACACGGCCAC TTTTGAGGTT TTAATAGCCC ATGCTGTTCC GGACCTTGTT
ACCAATGTGC TTATTCTCGT TGGCACCGGT ATACTGTTGT TTGTAATAAA CCCTTTGCTG
GCGGCGCTTA CCCTTATACC CATACCGTTT TTGGTACTGG GTTCGGGAAT ATTTACCAAA
AAAGTTCTTC CGAATTTCCG GGAGGCGCAG GGAAAACTTG GGGATCTTAA CGCAGTGGTG
CAGGACAATA TTTCGGGTAT CAAGGAAATT CAGGCTTTCA ACCAACAGTC AAGGGAGAAA
AAACGCGTTG AAAAAAGGGC AAGAAAATAC ACTTCCGCAA TACTTCATGC TTTAAGGGTA
AGTGCGATAT TTCATCCCGT TGTGGAAATG ATAAGCTCCC TTGGAACGGT TATTGTCGTG
GCCTTTGGCG GATGGTTTGC ATTAAAAGGT TATGTCAGCA CGGCTGATAT TGTAGGATTT
ATAATGTTTC TGGGACTTTT TTATCAGCCT ATCACCACAC TGGCAAGAGT AATAGAAGAT
TTACAGCAGG CTGCGGCGGG AGCGGAAAGA GTGTTTGAGC TTCTTGACAC CGAGCCGGAT
ATTGTCGACA GTGAAGGGGC GAAAACCATT ACAAGTTCCA AAGGTGAAAT AACCTTTAAA
AATGTAAGTT TCCACTATAT TCCTTCAAAT CCGGTTTTGA AAAATATAAG CTTTACCGCA
AAGCCCGGTC AGATGATAGC ATTGGTGGGG CCGACCGGAG TGGGAAAAAG CACTGTAATT
AGTTTAATAG CAAGATTTTA TGACCCGGTT TCGGGAGAAA TTCTTCTTGA CGGTATGAAT
ATTAAGGATA TAACAATATC TTCTCTAAGA AACCAGATAA GCATAGTGCT TCAGGATACA
TTCCTTTTCA ACGGCAGTGT GGCTGACAAT ATTGCTTACG GAAGCAGGGA CGCTTCTTTT
GAAGATATTG TTAGAGCGGC AAAAATCGCA AGGGCTCATG ATTTTATAAT GCAGCTTCCC
GAAGGCTATG ACACCGTGAT AGGTGAAAGG GGCGTAAAGC TTTCCGGAGG GCAGAAACAG
CGTTTGTCCA TCGCCAGAGC GGTTCTTAGA AACACTCCTA TATTGATATT GGACGAAGCA
ACCGCGTCAG TTGATGTCGA AACGGAAGCG GAAATCCAGA AAGCGATTGG TGAATTGGCA
GGAACCCGCA CTATTATTGT GATAGCCCAC AGGCTGTCCA CCGTAAAGCA GGCAGACAAC
ATCCTTGTCT TAAAAGACGG GGAAATTGTA GAGTCGGGCA CTCATGACGA GCTTATAAAG
CAAGACGGCT TGTACAAATA TCTTTGCGAA GTACAATTCG GAATATGA
 
Protein sequence
MKNFIRILKQ GRQYWKYLIV AGISLLAITA FNLVTPWKVR ELVDILSKEG TKNMASIRNI 
AIILAVAYIA RAFFSYLYRY FSHVAAWRLV ADMRVIVYEH LQKLSLSFYQ DKQTGQLMSR
AINDTATFEV LIAHAVPDLV TNVLILVGTG ILLFVINPLL AALTLIPIPF LVLGSGIFTK
KVLPNFREAQ GKLGDLNAVV QDNISGIKEI QAFNQQSREK KRVEKRARKY TSAILHALRV
SAIFHPVVEM ISSLGTVIVV AFGGWFALKG YVSTADIVGF IMFLGLFYQP ITTLARVIED
LQQAAAGAER VFELLDTEPD IVDSEGAKTI TSSKGEITFK NVSFHYIPSN PVLKNISFTA
KPGQMIALVG PTGVGKSTVI SLIARFYDPV SGEILLDGMN IKDITISSLR NQISIVLQDT
FLFNGSVADN IAYGSRDASF EDIVRAAKIA RAHDFIMQLP EGYDTVIGER GVKLSGGQKQ
RLSIARAVLR NTPILILDEA TASVDVETEA EIQKAIGELA GTRTIIVIAH RLSTVKQADN
ILVLKDGEIV ESGTHDELIK QDGLYKYLCE VQFGI