Gene Athe_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1060 
SymbolglyA 
ID7409617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1154338 
End bp1155585 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content38% 
IMG OID643715426 
Productserine hydroxymethyltransferase 
Protein accessionYP_002572934 
Protein GI222529052 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.328734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTT ACAATTTAGT AAAAAATACA GATCCAGAAA TAGCTGAGGC AATAAAGAGC 
GAGCTTAAAA GACAGCAGAA TAAAATTGAG CTTATTGCAT CTGAGAACTT TGTTTCAATT
GCAGTAATGG CAGCAATGGG TTCACCTTTG ACAAATAAAT ATGCAGAAGG ATATCCAGGA
AAGAGATATT ATGGTGGATG CGAATATATC GATGTTGTTG AATCTATAGC AATTGAGAGA
GCTAAAAAGC TGTTTGGAGC TGAACATGCT AATGTCCAGC CGCACTCAGG TGCACAGGCG
AACATGGCTG TGTATTTTGC AGTATTAAAT CCGGGCGATA CTATCCTTGG AATGAATCTT
TCGCATGGTG GACATCTGAC TCATGGCAGC CCTGTGAACT TTTCAGGGAA GCTTTACAAT
ATTATTTCAT ATGGGGTTGA CCCTGAAACA GAGACAATAA ATTATGATGA AGTTTTAAAA
CTTGCAAAGG AGCACAGACC AAAACTTATC TTGGCAGGCG CATCGGCGTA TCCGAGAGTA
ATAGATTTTA AAAAGTTCAG AGAGATAGCT GATGAAGTGG GAGCTTATTT GATGGTAGAT
ATGGCTCACA TTGCTGGGCT TGTTGCTGCA GGACTTCATC CATCACCTGT TGAATATGCT
GATTTTGTTA CAACCACAAC ACACAAAACG CTCAGAGGTC CACGCGGTGG TCTTATTCTT
TGCAAAGAAA AGTATGCAAA ATTAATTGAC AAGTCCATTT TCCCTGGAAT ACAAGGTGGC
CCGCTTGAAC ATGTAATAGC TGCAAAAGCT GTTGCTCTCA AAGAAGCTAT GACAGAAGAG
TTCAAAAACT ACCAGGTTCA AATATTGAAA AATGCAAAAG CTCTGAGTAC AAGACTTATT
GAAAGAGGAT TCAGACTTGT GAGTGGTGGA ACTGATAATC ATTTAATGTT AGTAGATTTG
AGAAATAAAG GTATAACAGG AAAAGACGCC GAAAAGATAT TGGATGAGCA TAATATAACA
TGTAACAAAA ATGCGGTTCC TTTTGATACT CAAAGTCCGA TGATAACAAG CGGGATAAGA
CTTGGGACGC CGGCTGTCAC AACCAGAGGG TTTAAAGAAG GGGATATGCT TGAGGTTGCA
GATATTATCC ATGATGCTTT GACAAATTCT GATACTAAAG AGAATATTTT AATCAGAGTG
AAAGCTCTTT GCGAAAAACA TCCTTTGTAT AAAGAATTTG ATGAATAA
 
Protein sequence
MYFYNLVKNT DPEIAEAIKS ELKRQQNKIE LIASENFVSI AVMAAMGSPL TNKYAEGYPG 
KRYYGGCEYI DVVESIAIER AKKLFGAEHA NVQPHSGAQA NMAVYFAVLN PGDTILGMNL
SHGGHLTHGS PVNFSGKLYN IISYGVDPET ETINYDEVLK LAKEHRPKLI LAGASAYPRV
IDFKKFREIA DEVGAYLMVD MAHIAGLVAA GLHPSPVEYA DFVTTTTHKT LRGPRGGLIL
CKEKYAKLID KSIFPGIQGG PLEHVIAAKA VALKEAMTEE FKNYQVQILK NAKALSTRLI
ERGFRLVSGG TDNHLMLVDL RNKGITGKDA EKILDEHNIT CNKNAVPFDT QSPMITSGIR
LGTPAVTTRG FKEGDMLEVA DIIHDALTNS DTKENILIRV KALCEKHPLY KEFDE