Gene Msed_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1298 
Symbol 
ID5104549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1275749 
End bp1277263 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content49% 
IMG OID640507187 
Productnonphosphorylating glyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_001191380 
Protein GI146304064 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCA AGGAATTGTC CCCAGAGTTT AAGGAGATAA CTAGCTTAGA GTCTAACACG 
ACGGTCTTCA AGACTTATCT CACCGGTGAC TGGGTGTCAG CAAGGGAGCT CGAAGATGTT
ATATCCCCAA TAGACCTCAC CACATTCGCC AGGGTTCCCA GACTTCCCTA TGAGATGGTG
GACAGTGCAC TCTCCAAGAT CTCAGAGAAG GGGAGATGGG AGATCCGTGA CCTTCCAGGA
GAAAAGAGAT TGAAAATATT TCATGATATG GCCTCGCTCC TGGACAAATT TAGGAGTGAT
CTGGTTGAAG TCCTAGTGAT AGGCAACGGA AAGACTAGGG CAGCTGCCAA CGGTGAAGTG
AACGCGTCCA TAGAACGATT AATCAGGGCA GACCTGGACG TGAGGAAACT CTATGGAGAA
TACGTTCCAG GCGACTGGAG TAGTGAGAGC TTGGAGACTG AGGCCATTGT GAGAAGGGAA
CCACTGGGAG TAGTTCTTGC AATTACTCCT TTCAACTATC CACTTTTCGA CGTGGTGAAC
AAGTTCGTTT ACTCTACGGT TGCAGGTAAC GCCATCCTAA TCAAACCTGC TAGTAAAACT
CCAGTGCCCG CAATCCTTTT CGCGAGAATC GCGGAACTTG CGGGATTTCC CAAGCACGCG
TTGGGAATAC TGACCATACC TGGGAAGGAC ATGGATAAGG TAGTCTCGGA CAGAAGGATT
GGAGTCATCT CATTCACCGG AAGCACGGAG ACTGGAGAGA GGGTAATTAG GGCTGGAGGC
GTTAAACAGT ACGTGATGGA GCTCGGAGGC GGTGACTCGG CCATTGTCCT AGATGATGCT
GATCCTGTCT CCACTGGACA GAAGCTTGTT ACCTCGATCA CCTCGTACAG CGGACAGAGG
TGCGATTCAA TAAAGTTCAT TTTCTCTGAA CCTGGCGTCT ATGACAAACT TAAGCAGACG
CTTCTCAACG AGCTATCCAA GATAAAGGTT GGCGATCCCA GGGAGGACGT GAGCATGGGA
CCGATCATAG ACAGGGGAAC GGTGGATGAG CTAGAGTTCT CCGTCAAGGA CGCGCAGGAG
AAAGGAGCCA GGGTTTTATT TGGGGGCAAA AGGCTGAAGG AGAACTATGT TGAACCTACT
TTGATCGAGG CCTCAAAGGA GATCGTGAAG AACCTTTACC TTTATCAGAA GGAGGTCTTC
CTCTCGGTTG CAGTTCTAGT TAAGATGAAC AACGTGGATG AAGCTATCTC CCTGAGCAAC
TCCAGGAGAT ATGGGCTTGA TGCCTCGGTA TTTGGAGAGG ACATCAATAA GATCAGAAAG
GCCATAAGAC TACTTGAAGT GGGAGCGGTC TACATAAATG ACTTTCCAAG GCATGGGATA
GGTTACTTCC CCTTTGGAGG AAGAAAGGAC TCGGGACTAG GAAGGGAGGG AATAGGGTAC
ACCATAGAGT ACGTAACTGC CTACAAGACG GTGGTCTACA ACTACAAGGG AAAGGGTGTA
TGGGACTACA TGTAG
 
Protein sequence
MNLKELSPEF KEITSLESNT TVFKTYLTGD WVSARELEDV ISPIDLTTFA RVPRLPYEMV 
DSALSKISEK GRWEIRDLPG EKRLKIFHDM ASLLDKFRSD LVEVLVIGNG KTRAAANGEV
NASIERLIRA DLDVRKLYGE YVPGDWSSES LETEAIVRRE PLGVVLAITP FNYPLFDVVN
KFVYSTVAGN AILIKPASKT PVPAILFARI AELAGFPKHA LGILTIPGKD MDKVVSDRRI
GVISFTGSTE TGERVIRAGG VKQYVMELGG GDSAIVLDDA DPVSTGQKLV TSITSYSGQR
CDSIKFIFSE PGVYDKLKQT LLNELSKIKV GDPREDVSMG PIIDRGTVDE LEFSVKDAQE
KGARVLFGGK RLKENYVEPT LIEASKEIVK NLYLYQKEVF LSVAVLVKMN NVDEAISLSN
SRRYGLDASV FGEDINKIRK AIRLLEVGAV YINDFPRHGI GYFPFGGRKD SGLGREGIGY
TIEYVTAYKT VVYNYKGKGV WDYM