Gene Msed_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1650 
SymbolglyA 
ID5104855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1591453 
End bp1592748 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content50% 
IMG OID640507541 
Productserine hydroxymethyltransferase 
Protein accessionYP_001191729 
Protein GI146304413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.278355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTC AAAAGGAACT CGAAAAAGTG ATTGAGCTCA CCAGAAGTCA GAACAGGTGG 
AGAAGGACTG AGACCATCAA TCTGATTGCC TCCGAGAACG TTATGAGTCC CTTGGCTGAA
GCCCTCTACA TGAGCGATTT CATGTCAAGA TACGCTGAGG GAAAACCCTT CAAGAGGTTC
TATCAGGGAA CGAAGTACGT GGATGAGGTC GAGACTCTGG CCATGGACTA CATGAACCAG
GTCACGGGGA GCAAGTTCTG CGACCTAAGA CCCACAAGTG GTACGTTGGC CAACGCCGCA
GTGTTCAGGG TTCTGGCCAA TCCTGGAGAC AAGGCTTTAA TCGCACCGGT TCAGGCTGGG
GCTCACGTTA GTCACACAAA GTTTGGAACC CTTGGGGCCC TTGGAATAGA GCACATAGAG
ATGCCATACG ATGAGGAAAA CATGAACGTT GACGTGGATA GGGCAGTGAA AATGATAGAG
CAGATTAAGC CCAAGTTCGT TGTTCTGGGA GGTAGCCTTT ACCTTTTCCC GCATCCCACT
AAGGACCTCG CGCCTCACGT TCACTCGGTG GGGGCAAAGC TGGTGTATGA TGCAGCCCAC
GTTTATGGGC TCATGACTGG TAAGGTCTGG AGTAATCCTC TTGATGAGGG AGCTGACTTC
CTTAATGTTT CTACCCACAA AACTTTTCCA GGACCTCAAG GCGGAGCCAT ATTCTCGAAC
GAGGAGGAGG AGTTCAAGAA GGTGTCTAGG ACAATCTTCC CCTGGTTCGT GAGTAACCAC
CATCTACATA GGTTACCCTC CACGGCTGTA ACTGCGCTCG AGATGAAGGT TTACGGGGAG
GATTACGCTA AGCAGATAAC CAGGAACTCG AAAGCCTTGG CGGAGGCCCT AGCCTCTTTC
GGCTTCAAGG TGATAGGAGA ACACCTGGGA TACACCAAGA GTCACCAGGT TGCTGTCGAC
GTGAAGAACC TTGGAGGAGG AGCCTACGTT GCGAAGACTC TCGAGAGTGC TAACATCATA
GTGAACAAGA ACCTCTTACC TCACGATCCA CCGGAGGCAG TTAATGATCC AAGCGGAATA
AGAATTGGGG TTCAGGAAAT GACGAGATTC GGGATGAAGG AGGGTGAGAT GGAAGAGATA
GCAGAATTGA TGAAGCAGAT CCTCGTGGAT AAGAGGGACA TTAACGAGAT GAGAAGGAAG
GTAACCGAGA TGAGATCCAG ATTCCTTGAG GTTAAGTACG CGCTAACGTA TGATCTGTCC
AAGTATAACT CAAAGCTGAT CCCGATGATA CTCTAG
 
Protein sequence
MDVQKELEKV IELTRSQNRW RRTETINLIA SENVMSPLAE ALYMSDFMSR YAEGKPFKRF 
YQGTKYVDEV ETLAMDYMNQ VTGSKFCDLR PTSGTLANAA VFRVLANPGD KALIAPVQAG
AHVSHTKFGT LGALGIEHIE MPYDEENMNV DVDRAVKMIE QIKPKFVVLG GSLYLFPHPT
KDLAPHVHSV GAKLVYDAAH VYGLMTGKVW SNPLDEGADF LNVSTHKTFP GPQGGAIFSN
EEEEFKKVSR TIFPWFVSNH HLHRLPSTAV TALEMKVYGE DYAKQITRNS KALAEALASF
GFKVIGEHLG YTKSHQVAVD VKNLGGGAYV AKTLESANII VNKNLLPHDP PEAVNDPSGI
RIGVQEMTRF GMKEGEMEEI AELMKQILVD KRDINEMRRK VTEMRSRFLE VKYALTYDLS
KYNSKLIPMI L