Gene Cthe_2273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2273 
Symbol 
ID4809862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2702860 
End bp2703981 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content40% 
IMG OID640107679 
Producthypothetical protein 
Protein accessionYP_001038668 
Protein GI125974758 
COG category[S] Function unknown 
COG ID[COG3581] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA CATTTCCACA TATGGGAGAC ACATATATAC CTGTAAAAGT ACTGCTGGAA 
ACCGCGGGGA TTGATTATGT CATGCCACCG GTTTCCGACA GAAGTCTGTT AGAACAGGGG
ATACTGCACT CGCCTGAATT TGCCTGTCTT CCTTTCAAAA CAATAATGGG TGATTTTATT
TACGGAATTG AACATGGAGC GGACTGGATT CTTTTCGGCG GTGGCTGTGG CCAGTGCAGA
TTCGGCTATT TTGGAAAGCT TCAGGCCGAA ATATTAAAAA GCATTGGGTA TGATGTAAAT
TTTATATATA TTGATCTTAG CAATATTTCC GTGAAAGAAG TGCTGGAGAA AATAAGGCCT
CTTACAGAAG GAAAGAGTAT TTTTGAGCTT TTAAAGGCAA TATTTTATGC CGTTAAAACC
GTTTTTGCCG TTGACAGGAT AAACGAACTG GCAAGATTCA CAAGGTGCCG GGAGATAAAC
AAGGGAGAAA CGGACAGAAT AATGACTGAA TTTCACAATG AAATCCAAAA AGCCAGGGGG
TATAAAAGCA TAAACAAAAT AATTCATTCC ACCGCCAAAA AACTGCGGAA GATGCCTTTG
GACAAAAAAT ACAGGCCAAT CAGGGTTTCC ATTGTGGGTG AAATATATAT TGCCGCCTAT
CCCGGCATTA ATTTTGAGAT AGAAAGAAAG CTTGGCAACA TGGGTGTGGA AGTGCATAAC
ACCATGAGCA TGAGCTTTTG GATAAAAGAA CATTTTATAA AGAAGCTTCT CCCCTTCAAA
ATAAAAAACA AAAACCATGA AGCCGGAAAG GAATTTATGA ATACTGACGA TATCGGCGGT
CATGGCCTCA GCTCCATAGG TGCCTCCATA AGAAGTGCCA AAAAGGGATT TGACGGCGTT
GTCCATATAT ATCCCTTCAC CTGCATGCCT GAAATAATTG CTCAAAGCAC CTTTAGCGAA
GTGCAAAAGA AATACGGTAT ACCCATTATT ACACTGATAA TTGATGAAAT GACCGGTGAA
GCAGGTTATA TGACAAGGCT TGAGGCATTT GTGGATATGA TTAAAATGAG AAGGAAGCCA
TCTTACTTCC CTATGCCCAG ATTTTTTTCG CAAAAAATTT AA
 
Protein sequence
MKITFPHMGD TYIPVKVLLE TAGIDYVMPP VSDRSLLEQG ILHSPEFACL PFKTIMGDFI 
YGIEHGADWI LFGGGCGQCR FGYFGKLQAE ILKSIGYDVN FIYIDLSNIS VKEVLEKIRP
LTEGKSIFEL LKAIFYAVKT VFAVDRINEL ARFTRCREIN KGETDRIMTE FHNEIQKARG
YKSINKIIHS TAKKLRKMPL DKKYRPIRVS IVGEIYIAAY PGINFEIERK LGNMGVEVHN
TMSMSFWIKE HFIKKLLPFK IKNKNHEAGK EFMNTDDIGG HGLSSIGASI RSAKKGFDGV
VHIYPFTCMP EIIAQSTFSE VQKKYGIPII TLIIDEMTGE AGYMTRLEAF VDMIKMRRKP
SYFPMPRFFS QKI