Gene Cthe_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2641 
Symbol 
ID4808952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3123065 
End bp3124057 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content40% 
IMG OID640108054 
ProductN-acetylneuraminate synthase 
Protein accessionYP_001039033 
Protein GI125975123 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID[TIGR03569] N-acetylneuraminate synthase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGTGT TTGTCATAGC CGAAGCAGGC ATAAATCACA ATGGAGAGCT AAAGCTTGCA 
AAAAAACTGG TGGATGCCGC CAAAGATGCA GGTGCTGACT GTATAAAATT TCAAACCTTT
ATTTCAAAAA ATCTTACGAC AAAAAACGCT TCAAAGGCCG AGTACCAGAA GCAAACAAAA
TCCGAAGAAT CTCAGTATGA CATGCTCAAA AGGTATGAAC TTTCTTTTGA TGAATTTTCG
GAGCTAAGCA GGTACTGCCA GGATAAAAAC ATTGAATTTC TTTCGACGGC CTTTGATTTT
GAAAGCATAG AGTTTTTAAA AAGTCTTGAT ATGAAAAGAT GGAAGATTCC TTCGGGAGAA
ATTACAAATC TTCCTTATTT AATAAAAATA GCAAAGCTAA ACAAGCCCGT TATTTTATCC
ACGGGCATGA GCACAATGGA TGAGATAAAA AAAGCGGTTT CGGTATTGAG AGAAAACGGT
ACCGGAGAAA TTACGGTTCT TCACTGCACG ACGGAGTATC CTGCGCCCTT TTCTGATGTA
AACCTTAAAG CCATGCTCAC AATAAAAAAA GAGCTCGGCG TAAAAGTAGG TTATTCCGAC
CACACGAAAG GAATTGAAGC ATCCATTGCA GCTGTGGCAC TGGGAGCTTC CGTCATAGAA
AAACATTTAA CTTTGGATAA GAATATGGAA GGTCCTGATC ACAAGTCAAG CCTTGAACCA
AATGAAATGA AAGCTATGAT TAGAGCCCTC AGAAATATTG AGCTTGCTTT GGGCGACGGA
ATAAAGAAGC CTTCAGAATC TGAGAAAAAG AATATTTGTG TGGCCCGCAA AAGCATTGTG
GCCAAAAGAT ACATCCAAAA GGGTGAAATT TTCACTGAGG AAAATTTGAC GGTAAAAAGG
CCGGGTAACG GCATCAGCCC GATGCAATGG TTTGAAGTTC TTGGAAGAAG AGCCGTAAGA
GATTTTCAGG AAGACGAGTT GATAGAGTTA TGA
 
Protein sequence
MKVFVIAEAG INHNGELKLA KKLVDAAKDA GADCIKFQTF ISKNLTTKNA SKAEYQKQTK 
SEESQYDMLK RYELSFDEFS ELSRYCQDKN IEFLSTAFDF ESIEFLKSLD MKRWKIPSGE
ITNLPYLIKI AKLNKPVILS TGMSTMDEIK KAVSVLRENG TGEITVLHCT TEYPAPFSDV
NLKAMLTIKK ELGVKVGYSD HTKGIEASIA AVALGASVIE KHLTLDKNME GPDHKSSLEP
NEMKAMIRAL RNIELALGDG IKKPSESEKK NICVARKSIV AKRYIQKGEI FTEENLTVKR
PGNGISPMQW FEVLGRRAVR DFQEDELIEL