Gene Cthe_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2962 
Symbol 
ID4810850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3480056 
End bp3481084 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content44% 
IMG OID640108384 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001039352 
Protein GI125975442 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACA TTGAAGCAAA AACTTCTTCA TCCGGCTATA ATTCAGCCCG GAAAAAAGAT 
GTTTTAATAG AAGTGAAAAA TTTAAAGCAA TATTTTAACA TAAAAACCAG CCTTGGTAAA
AAAGCCACAG TTAAAGCGGT GGATGATGTA ACCTTTGAGA TTTACAAAGG CGAAACCCTC
GGACTCGTTG GCGAATCCGG TTCAGGTAAA ACCACTTTGG GAAGGACAAT TCTCAGGATT
TATGAGCCTA CAGCCGGGCG AGTTGTATTT TTGGGAGTTG ATATAACCAA ATTGGGAAGG
GGACAGCTCC TTCCCTACAG GAAAAAAATG CAGTATATTT TTCAAGACCC TTACGCATCC
CTCGACCCTC GTATGACCGT TTCGGATATT GTGGGCGAAG CACTGGATAT TCATAGACTG
GTTTCTTCCA AAAAAGAGAG GGAGGAAAAA GTCAGAGAAC TGTTAAAAAT GGTAGGACTT
AATACCGAAC ACGCATCCCG TTATCCTCAT GAATTTTCAG GAGGACAGCG CCAAAGAATC
GGAATAGCCC GGGCTATCGC CGTAGAACCT GAATTTATCG TATGTGACGA GCCGGTATCC
GCACTTGACG TTTCTATAAG GGCCCAGATA ATTAACACGC TGGAAGAAAT GCAGGAAAGG
CTGAACCTGA CCTACCTTTT CATCTCCCAT GATTTGGGCG TGGTAAGGCA TACATGTGAC
AGAGTAGGGG TCATGTACCT GGGACATATA GTGGAACTGG TGGAATCGGA AGAATTGTAC
AAAAATCCTC TCCATCCATA CACTCAGGCA CTATTGACGG CTATTCCCAG ACCTAATCCT
GAGATTGCCA AGAAAAGAAA CAGAATTATC TTAAAGGGTG AAATCCCGTC ACCGGTGAAT
CCGCCATCTG GCTGCAAGTT CAGAACCAGA TGTCCCTATG CAAAGGATAT CTGTGCAAAA
GAAGTGCCCG AGTTCAAAGA TTACGGAAAC GGTCATTATG TAGCCTGCCA TTTTGCAGGT
AAATTATGA
 
Protein sequence
MTNIEAKTSS SGYNSARKKD VLIEVKNLKQ YFNIKTSLGK KATVKAVDDV TFEIYKGETL 
GLVGESGSGK TTLGRTILRI YEPTAGRVVF LGVDITKLGR GQLLPYRKKM QYIFQDPYAS
LDPRMTVSDI VGEALDIHRL VSSKKEREEK VRELLKMVGL NTEHASRYPH EFSGGQRQRI
GIARAIAVEP EFIVCDEPVS ALDVSIRAQI INTLEEMQER LNLTYLFISH DLGVVRHTCD
RVGVMYLGHI VELVESEELY KNPLHPYTQA LLTAIPRPNP EIAKKRNRII LKGEIPSPVN
PPSGCKFRTR CPYAKDICAK EVPEFKDYGN GHYVACHFAG KL