Gene Cthe_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2342 
Symbol 
ID4808976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2792403 
End bp2793461 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content34% 
IMG OID640107749 
Producthypothetical protein 
Protein accessionYP_001038737 
Protein GI125974827 
COG category[V] Defense mechanisms 
COG ID[COG2348] Uncharacterized protein involved in methicillin resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTAT TGGTGAATAT AGATGACAAT GAAAAATGGG ATAATACAGT CAAAGGATTT 
GAAAATTTTG ATGTTTATTA TCTGTCAGGA TATGTAAAGG CTTTCCATGC CCATGGTGAC
GGGGAGCCTA AACTGTTTTT TTATGAAGAT AGCAATATTA AAGCAATTAA TGTTTTTATG
AAAAGAGATA TTGAAAAGGA TCAAAACTTT TCGGGGAAAA TACCACCCAA TACTTTTTAT
GATGCAGCGA CACCATATGG ATACGGAGGT TTTTTAATTG AAGGAAATGT GACGGATGAA
AGTTTAAGGA CTCTTGAAAA AGAGTATTCC GAACTTTGCC TCAATGAAGG CATTATCAGT
GAATTTGTAC GTTTTCATCC GGTAATTAAC AACTCTGGGA CTGTTTCATC AATTTACGAT
ATTTTGGAGC TTGGAAGAAC CGTAACAATA GAATTGGATT CACGGGAAAA AGTTTGGGAA
GGCTTTGCCG GCAACAATCG CAATAAGGTT CGGAAAGCCA AAAAGTCCGG AGTCGAGATA
TTTTGGGGAC GAAGCCCAAA GCTATTTGAC ACGTTTATAC CTATGTACAA CGAAACAATG
GATAATGACG GTGCATCAGG CTACTATTAT TTTAAAAAAG ACTTTTACAA TAGTATACTT
GAAGACTTGA AATATAATTC ATTGATATTT TATGCTGTTT TTGAAGATAG AATAATTTCA
ATGTCAATTA TTTTGTTTGC AAATAAACAG ATGCATTATC ATCTGTCTGC TACTGACAGA
GAGTACAGAA ATCTTGCACC TACCAACCTT TTGCTTTATG AAGCGGCATG CTGGGGAATT
GAAAATGGAT ATAAGACATT CCATTTGGGT GGAGGTTTGG GCAGCAGGGA AGACAGCCTT
TATCAATTTA AAAAAGGATT TAACAAGAAC TCAAGCACTT ATTTTTGTAT AGGAAAAAAA
ATATTTGACA AAGAGAAATA TGATGAATTG ATAAAAATAA GAAAGCAAGA TCCATACTTT
GATGAAAACA AGTTGTTTTT CCCTTTATAT AGGGGATAA
 
Protein sequence
MVLLVNIDDN EKWDNTVKGF ENFDVYYLSG YVKAFHAHGD GEPKLFFYED SNIKAINVFM 
KRDIEKDQNF SGKIPPNTFY DAATPYGYGG FLIEGNVTDE SLRTLEKEYS ELCLNEGIIS
EFVRFHPVIN NSGTVSSIYD ILELGRTVTI ELDSREKVWE GFAGNNRNKV RKAKKSGVEI
FWGRSPKLFD TFIPMYNETM DNDGASGYYY FKKDFYNSIL EDLKYNSLIF YAVFEDRIIS
MSIILFANKQ MHYHLSATDR EYRNLAPTNL LLYEAACWGI ENGYKTFHLG GGLGSREDSL
YQFKKGFNKN SSTYFCIGKK IFDKEKYDEL IKIRKQDPYF DENKLFFPLY RG