Gene Cthe_0396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0396 
Symbol 
ID4808399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp491956 
End bp493680 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content42% 
IMG OID640105810 
ProductABC transporter related protein 
Protein accessionYP_001036827 
Protein GI125972917 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000464799 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TGTCGGTTTA TTTAAAGGGA TATATAAAGG AAAGTATTCT TGGACCCTTA 
TTCAAACTTT TGGAAGCATC ATTTGAATTA CTGGTACCTA TTGTTATCAA ATCCATTGTT
GATACCGGCA TTGGGCGGGC TGACAAAGTA TATATCATTA AAATGTGCCT GTTGTTGATT
CTTTTGGGTG TTGTTGGAAT GGTATGTTCG GTAACGGCCC AGTATTTTGC CGCCAAAGCT
TCAGTGGGAT TTGTGACAAA GCTTCGCCGT GCACTGTTTA GACATATTGG CCAATTGTCT
TATACGGAAA TTGACACCCT TGGTACATCC AGTATGATTA CCCGTATGAC GAGTGATATA
AACCAGGTAC AGACCGGAAT GAACCTGACA TTGCGTCTGT TGCTGCGTTC ACCTTTTATT
GTGTTTGGTG CCATGATTAT GGCGTTTACT GTGGATACCA ATGCGGCTTT TACTTTTGTT
GTAGCCATCC CTGCGCTTTT TATAGTTGTT TTTGCAATTA TGCTGCTGTC CATTCCTCTT
TACAGGAAGG TACAGCAAAG GCTTGACAGA GTGTTAAAGT CAACAAGGGA AAATCTCACA
GGTGTCCGGG TGATTAGGGC TTTTCGCCTT GAAGAAAAAG AAATAGCTGA ATTTGACAAA
CGCAATGAAG AATTGACCTC AACACAGATA TTTGTAGGCA GAATTTCAGC TCTGATGAAT
CCTTTGACCT ATGTTATTAT TAATCTTGCC ATTATCTGGC TTATTCATAT CGGTGCAATC
CGCGTATCAA AGGGACTGCT TACGCAGGGA GCTGTTTTGG CACTCTATAA TTATATGTCC
CAAATTCTTA CTGAGCTTAT AAAATTCGCA AACCTAATCA TTAGCATTAC AAAAGCTGTT
GCCAGCGCCA ACCGTATCAG TGCGGTGCTG GATGTAGAAT CCAGTTTGGT GGAGAAAGAG
TCTGAGCCGC AAGGTGACAG GTCCGAATTT ATCGTTGAAT TTCACAATGT AGGATTGACG
TATAAAAATG CCGGTGCTGA AGCTTTAACA AATATTAATT TTTCAGTCCG CCGCGGTGAA
GTGGTTGGTA TCATCGGCGG TACAGGTTCC GGTAAAACTT CTTTGGTAAA TCTTATTCCG
AGGTTTTACG ATGCCACGGT AGGCGAGGTT ATTGTAGACG GTATTAATGT TAAGGATTAT
CCTTTGAAAA AGCTGCGTGA CAAGATAGGT GTCGTTCCGC AAAAAGCTGT GCTGTTTAAG
GGCAGTATCC GCGAAAATAT GCGTTGGGGA AATAAAAATG CAACCGATGA AGAAATCATG
GAAGCCATTA ATATTGCCCA GGCAGGAGAA ATTGTGGCTC AAAAGAAGGA AGGGCTTGAT
TTTCTTATTG AACAAGGAGG AAAAAACCTT TCGGGAGGGC AGCGTCAACG TTTTACCATA
GCACGGGCCA TTGTTAAAAA GCCGGAAATC TTAATTCTTG ATGACAGTGC TTCGGCTCTT
GACTTTGCGA CCGATGCAGC TCTTCGCAAG GCTCTTCGTG AGTTACCGTG GAATCCGACA
ATTTTTATTG TTTCGCAACG TACTTCATCG ATTCAACATG CTGATAAGAT TATAGTTCTC
GATGACGGAG AAATTGTTGG TATCGGTAAG CATGACGAGC TGCTTGAAAC TTGTGAGGTA
TATCGCGAAA TTTATGACTC ACAATTCAAG AAGGAGGAAA AATGA
 
Protein sequence
MKKLSVYLKG YIKESILGPL FKLLEASFEL LVPIVIKSIV DTGIGRADKV YIIKMCLLLI 
LLGVVGMVCS VTAQYFAAKA SVGFVTKLRR ALFRHIGQLS YTEIDTLGTS SMITRMTSDI
NQVQTGMNLT LRLLLRSPFI VFGAMIMAFT VDTNAAFTFV VAIPALFIVV FAIMLLSIPL
YRKVQQRLDR VLKSTRENLT GVRVIRAFRL EEKEIAEFDK RNEELTSTQI FVGRISALMN
PLTYVIINLA IIWLIHIGAI RVSKGLLTQG AVLALYNYMS QILTELIKFA NLIISITKAV
ASANRISAVL DVESSLVEKE SEPQGDRSEF IVEFHNVGLT YKNAGAEALT NINFSVRRGE
VVGIIGGTGS GKTSLVNLIP RFYDATVGEV IVDGINVKDY PLKKLRDKIG VVPQKAVLFK
GSIRENMRWG NKNATDEEIM EAINIAQAGE IVAQKKEGLD FLIEQGGKNL SGGQRQRFTI
ARAIVKKPEI LILDDSASAL DFATDAALRK ALRELPWNPT IFIVSQRTSS IQHADKIIVL
DDGEIVGIGK HDELLETCEV YREIYDSQFK KEEK