Gene Cthe_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0397 
Symbol 
ID4808400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp493682 
End bp495442 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content43% 
IMG OID640105811 
ProductABC transporter related protein 
Protein accessionYP_001036828 
Protein GI125972918 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000282557 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCA GAAAAATTCA AATGGACACT ATCCGAAAGG TTCTCAGATA TATAAAAAAA 
TATCGTGTTC TTTTATTCAT TTCCATTCTT CTTGCTGCGT CCACAGTAGC CTTAACTCTC
TATGTGCCTA TTCTTATCGG TAACGCCATT GATTATATCG TAGGACCGGG TAATGTAAAT
TTTGAAGTCG TAGCGAAAAT TCTTGTCCAA ATTGCTTTTG CCGTTGGTAT TACTGCTATA
TTTGAATGGT TTATGTACAC AATAAACAAT AATATTACTT ACCAGGTGGT GCGGGATATT
CGTGAAAAAG CTTTTCGAAA AATTGAGATT TTACCTCTTT CATATATAGA TTCTCATCCC
CACGGTGAAA TTGTAAGCCG TGTGATTGCT GATGTTGAAC AGTTTGCTGA AGGTCTTCTG
ATGGGCTTTA CCCAGCTTTT CACCGGTGTT GTTACGATTA TTGCCACGTT GATTTTCATG
CTCACCATCA ATATAAAGAT TACTTTTATT GTAGTTATCC TCACACCGCT TTCTCTTTTT
GTTGCAAATT TTATTGCAAA GCATACCTAT TCCATGTTTA AGTTGCAGTC CGAAACCCGC
GGAGAGCAAA CTTCGCTTAT TGAAGAGATG ATAGGCAACG TAAAAGTGGT TCAGGCGTTT
TCCTATCAGG AAGAAGCACT GAAGAAGTTC GACGAGATTA ATGAAAGGCT TGAAAAATAC
TCCCTTCGTG CTGTCTTCTT TTCGTCTTTG ACCAATCCTT TAACCCGTTT TATCAACAGC
CTTGTTTATG CGGCAGTCGC CCTTGCGGGC GCAATAGCGG TTATCAGCGG TGATGTGGGA
AGTACCATGA CGGTGGGCGG ACTTTCCATA TTTTTAAGTT ATGCCAGCCA GTATGCTAAA
CCCTTCAATG AAATTTCCGG TGTGATAACC GAGATGCAAA ATGCCCTTGC CTGTGCAGGA
CGCGTTTTTG AGTTGATTGA AGAAAAACCT CAGGTTCCGG ATGCTGATGA TGCAGTCACG
CTCAAAAACG CCAGCGGCAA TGTTGTTTTT GACAATGTGG CTTTTTCCTA TGTTCCTGAA
CGTCCTTTGA TTCGAAATTT AAACCTTGAA GTAAAACCCG GACAGCGTGT GGCGATTGTT
GGGCCCACCG GTTCCGGCAA AACTACTGTG ATTAATCTTT TGATGCGTTT TTATGATGTT
GATTCCGGCA GCATAAAAGT GGAGGGTATA GATATTCGAA ACATCACCCG GCAAAGCCTG
AGGGAAAACT ATGGCATGGT ATTGCAGGAT ACGTGGCTCA AATCCGGTAC CATTCGCGAA
AATATCACAA TGGGAAAACC GGACGCAACG GAGGAGGAAA TCATTACTGC GGCAAAAGCC
GCTCATGCCC ACAGTTTTAT CAAACGGCTG GAAAACGGCT ATGATACCGT CATCAGCGAG
GAAGGAGGAA GTCTTTCACA GGGACAAAAG CAATTGCTCA GTATTGCCCG CGTTATGCTT
TGTTTGCCGC CGATGCTTAT TTTGGATGAA GCAACATCTT CCATTGACAC CCGTACCGAG
GTCAAAATTC AGGAAGCCTT TGCAAGACTT ATGCAAGGCC GCACAAGTTT TGTTGTTGCG
CACCGCTTGT CCACAATACG GGAAGCCGAC GTTATTCTCG TGATGAAAGA CGGTGATATT
ATCGAGCAGG GTACCCATGA AGAGTTGCTC TCGAAAAAAG GTTTTTATGC CAATCTGTAT
AACAGTCAAT TTGCGCAATG A
 
Protein sequence
MKSRKIQMDT IRKVLRYIKK YRVLLFISIL LAASTVALTL YVPILIGNAI DYIVGPGNVN 
FEVVAKILVQ IAFAVGITAI FEWFMYTINN NITYQVVRDI REKAFRKIEI LPLSYIDSHP
HGEIVSRVIA DVEQFAEGLL MGFTQLFTGV VTIIATLIFM LTINIKITFI VVILTPLSLF
VANFIAKHTY SMFKLQSETR GEQTSLIEEM IGNVKVVQAF SYQEEALKKF DEINERLEKY
SLRAVFFSSL TNPLTRFINS LVYAAVALAG AIAVISGDVG STMTVGGLSI FLSYASQYAK
PFNEISGVIT EMQNALACAG RVFELIEEKP QVPDADDAVT LKNASGNVVF DNVAFSYVPE
RPLIRNLNLE VKPGQRVAIV GPTGSGKTTV INLLMRFYDV DSGSIKVEGI DIRNITRQSL
RENYGMVLQD TWLKSGTIRE NITMGKPDAT EEEIITAAKA AHAHSFIKRL ENGYDTVISE
EGGSLSQGQK QLLSIARVML CLPPMLILDE ATSSIDTRTE VKIQEAFARL MQGRTSFVVA
HRLSTIREAD VILVMKDGDI IEQGTHEELL SKKGFYANLY NSQFAQ