Gene Msed_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1301 
Symbol 
ID5104552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1279946 
End bp1281022 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content53% 
IMG OID640507190 
Productgalactose 1-dehydrogenase / glucose 1-dehydrogenase 
Protein accessionYP_001191383 
Protein GI146304067 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAA TTATTGTAAG GCCACCTAAT GAGGGTGTCG AGGTTAAGGA CATCACATTG 
AGGGAATCCA CGGACGGGAA GATAGTTGTC AGGACTAGAC TCAGCGGTCT GTGTGGAACA
GATAGGGGTC TAGTCACGGG AAGGCTTACC TTCGCAAGGC CTCCACCGGG ATACGATTTC
CTTATCCTGG GTCACGAGAC TCTTGGTGAA GTGGTAAAGG GTAATGGAGA GTTCAGTCCC
GGGGACCTAG TTGTTCCAGT GGTCAGGAGG GGTTGTGGAT CCTGCCTAAA CTGTATGCTG
GGAAGGCAGG ACTTCTGTGA AACCGGAAGA TTCACGGAGA TCGGAATAAG GGGAGCTCAC
GGTACCATGA GGGAGGAGTT CTTAGAGGAC CCAAAGTACC TAGTTAGGGT TCCAAGGGAA
CTAGGAGATG AGGGAGTTCT ATTGGAGCCT CTCTCAAATG TCGTGAAGGC CCTCACAGAG
ATGGAATATC TTCAGAGGAG GTCGTGGTGG AGGTGCGACG ATTCCACCTA CTCGTGCAGA
ACAGCTGTGG TACTGGGGAG TGGACCCATA GGTCTCCTGT TCTCCATGGC CCTGAGAAGT
ATGGGCTTCC GCGTGATTGT GGCGAACAGG AGGCCCCCAT CCCAGGTTGA GAGCGAAATA
ACTCGAGATA TAGGGGCAAC CTTCCTCAAC ACCTCTGAGC ATGAGGACCT TGAGCCAGAT
CTCATTGTGG ACACCTCTGG GCATCCCTCA GCCGTCGTCC CCTTACTTCC TAGAATCAGG
AAGAACGGTG CGGTGATCCT CTTTGGAACA ACTGGGCTAG AGAGATATGA GCTAACTGCA
GAGGAGATAA CCATGTTGGT TGAGAACAAC ATCCTGATCT TTGGGAGCGT GAATGCCTCA
AAGGCCGATT TCCAGGCTGG AGTTAACCTT CTAGTGGAAT GGAAGGCCAG GTATCCAGGC
GTCCTCCAAA GGATGATCAC CAAGAGGGTC AGCGTGGAAG AGGCCCCCCA AGTCCTGAAG
GAAAAGGTCC CGGGGGAGAT AAAGACGGTC ATAGACTGGA CTGCTCGTGA GAGTTAA
 
Protein sequence
MKAIIVRPPN EGVEVKDITL RESTDGKIVV RTRLSGLCGT DRGLVTGRLT FARPPPGYDF 
LILGHETLGE VVKGNGEFSP GDLVVPVVRR GCGSCLNCML GRQDFCETGR FTEIGIRGAH
GTMREEFLED PKYLVRVPRE LGDEGVLLEP LSNVVKALTE MEYLQRRSWW RCDDSTYSCR
TAVVLGSGPI GLLFSMALRS MGFRVIVANR RPPSQVESEI TRDIGATFLN TSEHEDLEPD
LIVDTSGHPS AVVPLLPRIR KNGAVILFGT TGLERYELTA EEITMLVENN ILIFGSVNAS
KADFQAGVNL LVEWKARYPG VLQRMITKRV SVEEAPQVLK EKVPGEIKTV IDWTARES