Gene Msed_1127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1127 
Symbol 
ID5103599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1056935 
End bp1058860 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content57% 
IMG OID640507020 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001191213 
Protein GI146303897 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.814296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTGCA GGGTTGCAAT TGACGTTGGT GGGACATTCA CAGACTTCAT AGCCCTAGTG 
AACGGGGAGA TAGTCACGGT AAAGACTCTA ACAAATCCAA CGAGGCCCTC TCAAGTCATT
AAGGACGTCC TGTCCTCGCT TGGATGTGAG GTCACGGAGG TCGTTCACGC GACCACTCTC
GCTACCAACG CCCTCCTGGG ACAGGAGAAA CTGGATCTCC CGAGGACGGC GCTCCTCACA
ACTAGGAGTT TCAGGGACGT GATAGAGATA GGGAGGCAGA ATCGACCCCG TCTCTACGAC
CTCAACTTCG AGAAGCCGAG GCAACTGGTA CCAAGGGAGT TGAGGGTTGA GGTTCAGGAG
AGGGTTGACG CTCAGGGTAA CATACTAGAA AGAGTTGAGG AGTCCGAGAT CGAGAAAATT
GCCCAGAACC TAAGGGAAAG AGGCGTTGAA TCCGTCGCGA TCAGTTACCT TCACTCTTAC
CTTAACCCCA CTAACGAGAT CAGGACTGGG GAAGTCCTCT CTCGTCACTT CAGGTTTGTG
TCCCTTTCCT CTGAGGTCGC CCCTGAACCG AGGGAGTATG AGAGAACCTC CACCACCGTG
GTTAACGCTG TCCTCATGCC CCTGATCTCC TCGTATCTGC AGGAACTCAA CTTCCTTCCC
TCCTTTCTGG TAATGTCAAG CTCAGGTGGC CTAGTTGACG TGGAGGAGGC CTCAAGGAAA
CCTGTCCAGC TCGTGGAGTC GGGCCCAGCC GCAGGGGTGA TAGCCTCAGC CTCCCTCTTT
CCAGGTAACG TGATCAGCTT TGACATGGGC GGGACCACGG CGAAGGCTGG GGTCGTCATC
GACGGAAAGT TCGAGATCAC GACCGAGTAC GAGGTAGGTG GTGAGGTTCA TCACGGCAGG
GTGGTGAAGG GTAGCGGTTA CCCCGTGAGG TTCCCCTTCG TGGACTTGGC AGAGGTCTCA
GCTGGAGGAG GGACGGTGAT CTGGAGAGAC GACGCTGGGG CCTTAAGGGT TGGTCCCTTG
AGTGCAGGGG CAGACCCAGG TCCCATGTGT TACGGAAGGG GAGGTGATAA GCCCACGGTG
ACGGACGCAA ACCTTGTCCT GGGAAGGGTG GGGGAGGTGA TCGGTGGAGG GATGAGGCTA
AAGCCCGAGT TGGCGAGGAA GGGGCTATCC AGGCTCGGTG ACCTGGAGGA CGTGAGTAGG
GATGCCCTTG CCCTGGTAAA CCTGGAGATG GCCAGGGCCA TAAGGCTTGT CACGGTGGAG
AGGGGGCTGG ATCCTTCAAG CTTCAGTCTC GTGGCCTTCG GTGGGGCTGG GCCACAGCAC
GCGGTTTACC TGGCAGAGGA ACTGGGAATT TCCAAGGTGT TGATTCCACC TTACCCTGGG
TTGTTTAGTG CCCTAGGCCT CCTCCTGGCT GACTGGCGCT TTGAGGCTAG GAAATCCTTT
CCCAGGGACC TCGAGGCCGA GTTCGTGAAG CTGGAGAGGG AGCTTTACGA CAGGTTGAAG
GGGAAGGTGG GTCACTTCCT CAGGTACGCT GACGTCAGGT ATCAGGGCCA GGGCTGGGAG
CTCACAGTCC CCGTGAACGA CGTCAACGAG ATCAGGCAAG TCTTTGAGGA GAAGCACCTC
TCAACCTACG GCTTCGTGAT GAGCGATAGG GAGATTGAGG TCGTGACCAT AAGGGTGTTC
GCCGTGAGAA GGAGACCCCT CCCACAGCTC TCAGTTGTGT CGGGGCAGGG GGACAGCCCC
GTCAAGAGGA GGAAGGCCCT CCTAGAGGAC GAGTGGGGTG AAGTGGACGT GTACGTTAGG
GAGAAGTTGA GGAGGGGGGT TAGGGTGAGA GGTCCCGCAA TCATAGAGGA GTTCAGCTCC
ACTACGGTGG TCAAGGACGG GTGGGAGGCC CTAGTAGACG AGTCCATAAC CTTGGTGAGA
CCATGA
 
Protein sequence
MECRVAIDVG GTFTDFIALV NGEIVTVKTL TNPTRPSQVI KDVLSSLGCE VTEVVHATTL 
ATNALLGQEK LDLPRTALLT TRSFRDVIEI GRQNRPRLYD LNFEKPRQLV PRELRVEVQE
RVDAQGNILE RVEESEIEKI AQNLRERGVE SVAISYLHSY LNPTNEIRTG EVLSRHFRFV
SLSSEVAPEP REYERTSTTV VNAVLMPLIS SYLQELNFLP SFLVMSSSGG LVDVEEASRK
PVQLVESGPA AGVIASASLF PGNVISFDMG GTTAKAGVVI DGKFEITTEY EVGGEVHHGR
VVKGSGYPVR FPFVDLAEVS AGGGTVIWRD DAGALRVGPL SAGADPGPMC YGRGGDKPTV
TDANLVLGRV GEVIGGGMRL KPELARKGLS RLGDLEDVSR DALALVNLEM ARAIRLVTVE
RGLDPSSFSL VAFGGAGPQH AVYLAEELGI SKVLIPPYPG LFSALGLLLA DWRFEARKSF
PRDLEAEFVK LERELYDRLK GKVGHFLRYA DVRYQGQGWE LTVPVNDVNE IRQVFEEKHL
STYGFVMSDR EIEVVTIRVF AVRRRPLPQL SVVSGQGDSP VKRRKALLED EWGEVDVYVR
EKLRRGVRVR GPAIIEEFSS TTVVKDGWEA LVDESITLVR P