Gene Cthe_2357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2357 
Symbol 
ID4808991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2811392 
End bp2812564 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content35% 
IMG OID640107764 
Producthypothetical protein 
Protein accessionYP_001038752 
Protein GI125974842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000200113 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTAAATTTAT GTTTTTAGTT GGAACTATTT TGTTGGCAAT GTTTATGTTT 
ACAGGCTGCG GTTCCACAAC AGTTGACCTC AACAAATATA TGACAATTGA AGTAACCGGA
TATGATTCCA GGGGAACTGC AAAATATACT TTTGACCGTA AATCTTTCAT CGAAGACTAT
TCCGACAAAA TAAAAATCAA TTCAAAGAAA AGCAGTGATA CAGCTTCCCT GGGTCTGATG
CTCGGTATGT CACCGGCGGA ATTATTGTTG GACTCATGTG TAAAATATGA ACTTGACAAA
TCAAAAGAAC TTAGCAATGG GGATGTTGTT ACTTTAAAAT GGAATTGCGA AGATAATTTG
ACCAAAGAAA ATTTTAACGT AAAATTAAAA TATTCCGATA TAAAGCACAA GGTATCAGAA
CTAAAAGAAG TTGATAAATT CAATCCGTTC GATTATGTTG AGGTCAGTTT TTCGGGAATA
TCCCCAAATG TTAAAGTAAC CATAACCCCA AATTATGAAC AGCCGGAAAT GCAATATATT
AGCTTTTCGG CAGATAAAGA TTCCCGTTTG AAAAACGGTG ACACAATTAC CGTAACCGCT
TTGATTTCAA AATCGAACGA GGAAAAACTT GCCGAAAAGT TTGGAAAAAT TCCCGGAGTA
ACCAAAAAAG AATATACGGT GGAATCTCTG CCCTATTATG TTACAAAAGT GGATGAAATT
CCATCGGATA CTATGGATAA AATAATTTCG CACGGTGAGT CCGTATTCAG ATCTTATGTG
GAAAGAAAAT GGGCCAAACC CGAAAATCTT ATTGACGTTG AATATATCGG CAATTACTTT
TTGGTTGACA AAAACCCTTG GTGGATTTCT TCCTATACTG AATTGTACAT GGTCTACAAA
ATTTCAGCAG TAAATCCGGA ACCTGAACAG CCTATCGAGT TTTACTACTA TGTATGTTAT
TCTGACATAA CCGCATCCGC CGACGGTACT TGTACTGTTA ATTTTGATGA CTGTAAAGTA
CCTCAAGATG GCCTTTTCAC ATCCGAATCA TTTAAAGTCG GCAAATATAA ATATATGGGA
TATGAAACTT TGGACGCACT TTTCAACAAT TGCATAGTGC CTAAGATAAA CAAGTATGAA
TATACTTCAA CAATTAAAAA CGCTGAACAA TAG
 
Protein sequence
MKKIKFMFLV GTILLAMFMF TGCGSTTVDL NKYMTIEVTG YDSRGTAKYT FDRKSFIEDY 
SDKIKINSKK SSDTASLGLM LGMSPAELLL DSCVKYELDK SKELSNGDVV TLKWNCEDNL
TKENFNVKLK YSDIKHKVSE LKEVDKFNPF DYVEVSFSGI SPNVKVTITP NYEQPEMQYI
SFSADKDSRL KNGDTITVTA LISKSNEEKL AEKFGKIPGV TKKEYTVESL PYYVTKVDEI
PSDTMDKIIS HGESVFRSYV ERKWAKPENL IDVEYIGNYF LVDKNPWWIS SYTELYMVYK
ISAVNPEPEQ PIEFYYYVCY SDITASADGT CTVNFDDCKV PQDGLFTSES FKVGKYKYMG
YETLDALFNN CIVPKINKYE YTSTIKNAEQ