Gene Cthe_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1873 
Symbol 
ID4809204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2224043 
End bp2225386 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content41% 
IMG OID640107292 
ProductHMG-I and HMG-Y, DNA-binding 
Protein accessionYP_001038287 
Protein GI125974377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.701975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTGGGAA GTTCTAGTAC TGATGCTATA GGACCTGGCT CAGTATTCCA AATTGATGCA 
ACTGTGGCTG ATGTATATTT GGTTTCCCGT TTCAACAGAA CACATATTAT AGGCAGACCT
GTTTTATATA TTGTCCAGGA CTGCTTTTCC AAACTTATTG TGGGGCTTTA TGTAGGACTT
GAAGGGCCGT CATGGATTGG AGCAGCAATG GCTCTTGCAA ACACTGCTGG AAACAAAGTT
TCTTTTTGTA GTCAGTATGG TATAGATATT CAGGAAGAAG AATGGCCTGT ACATCATCTC
CCTCAGGCAA TTTTGGCAGA CCGAGGGGAA ATGCTGTCAG ACAATGCAGA GAGTTTGATT
ATGAATCTTG GAATTACGGT AAAGAATACC CCGCCTTTCA GGGCTGACTG GAAGCCGTTA
GTAGAAAGAT ATTTTAAATT GACCAATGAG CGTACAAAAT CATTACTTCC TGGAGCGGTA
AATACAGATT TTATGCAGCG AGGCGGGAGA GATTACAGGC TTGATGCGAA ACTTGATTTA
ATGCAATTTA CTGCCATTAT TATAAAATGT GCGTTATTCC ACAACAACCA TTATCGTATT
GACAATTACA ACAAAGATGA AATGATGGTG GCAGACGAAG TGGAACCTAT TCCAAGGGAA
ATCTGGAACT GGGGTATCGC TAACCGAATG GGCAAACTTC GGCACGTAGA TGAGGAAGTA
GTGAAACTTA ACCTGATGCC GTCGGATAAT GGGGTGGTTA CGGCAAAAGG GATAAGGTTT
AAAGGGCTGT TCTACAGTTC TAAATCAAGC ATGAAAGAGC AGTGGTTTGT AAAAGCCCGT
AGCAGTGGAA GTTGGAAAGT GCCTGTATCC TATGACCCAA GAAACATGAA TTACATATAC
ATTAAGAAAT CTGCCACCGA GTTTGAGAAA TGCTACCTGC TGGAATATCA GACGGCATTT
AAGGATAAGT ACATTGAAGA AATTGAATAC CTGATGGAGT GGGAAAAGAT GCAAAAGGCT
AAAAGTCTTG ATGAGGGATT GCAGGCCAAG GCAGATTTAA TAACAGAAAT AGAAACAATA
GTTGAAGGGG CAAAAAGCAA GACAAATAAA GAACTTTCAC TATCAACAGA AAGCGATGCA
CAAAGGAAGA AAAACATACG GCAAAACAGG CAGGTTGAAA AGGAGATAAA TCGGGAGATA
GAGGCTTTTG AATTGGATAG GCAACCTAAT AATAAGAATG CAGAGATAAT CTCTCTTAAT
GAACTGGAAG AAGAGTTACC ATCCAATCCT CTGGATTTGT TGAGAAGAAA ACAGAGGGAG
ATGCTTGGGA AGATTAACGA ATAA
 
Protein sequence
MLGSSSTDAI GPGSVFQIDA TVADVYLVSR FNRTHIIGRP VLYIVQDCFS KLIVGLYVGL 
EGPSWIGAAM ALANTAGNKV SFCSQYGIDI QEEEWPVHHL PQAILADRGE MLSDNAESLI
MNLGITVKNT PPFRADWKPL VERYFKLTNE RTKSLLPGAV NTDFMQRGGR DYRLDAKLDL
MQFTAIIIKC ALFHNNHYRI DNYNKDEMMV ADEVEPIPRE IWNWGIANRM GKLRHVDEEV
VKLNLMPSDN GVVTAKGIRF KGLFYSSKSS MKEQWFVKAR SSGSWKVPVS YDPRNMNYIY
IKKSATEFEK CYLLEYQTAF KDKYIEEIEY LMEWEKMQKA KSLDEGLQAK ADLITEIETI
VEGAKSKTNK ELSLSTESDA QRKKNIRQNR QVEKEINREI EAFELDRQPN NKNAEIISLN
ELEEELPSNP LDLLRRKQRE MLGKINE