Gene Msed_1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1128 
Symbol 
ID5103600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1058857 
End bp1060389 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content57% 
IMG OID640507021 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001191214 
Protein GI146303898 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCT GGGAAATAGT GAGCAGGGCC CTCGAGTTCG TCGCCGAGGA GATGGGGGTA 
ATGCTCAAGA GGTCTGCGAT GTCGCCCAAC ATAAGGGAGA GGATGGATCA CAGCTGTGCC
ATCGTCAACG AGAGGGGTGA GGTCGTGGCC CAGGCGGAGC ACATCCCCGT TCACCTGGGA
TCGTTCAGCG TTGGGGTCAA GAATCTCCTG CAACAGGTTG AGCTCGAGGA AGGGGATATG
GCGATCGTGA ATGATCCCTA CGTGTCGGGG ACACACCTCA ACGACGTCAT GGTAATGGCA
CCGTTACCTG GGGGACTGGG ATATGTGGTG AACAAAGCAC ATCACGTTGA CGTTGGAGGG
CCGATGCCGG GAAGCCTCAA CCCCTCAGCC TCCACGCTGT ATGAGGAGGG TTTCGTCATA
CCCCCTCTCA AGTTGATGAG GAAGGGGAAA CTCAACGAGG ATGTGGTGAA GATGATCAGG
GAGAACTTTA AGGTCCCTGA CACCTCACTT GGGGACCTGA ACGCCCAGAT CTCGGCTAAC
CTTGTGGGGA TAGCCAGGGT AAAGCAACTA GTGGAGAGGT ACGGCGTCTC GCAGGTGGTT
GAAGCATGGA ACACCTCAAT GGAGTACGCA AGGAGGCTCA CGTTGACAGA GCTGGCGAGA
TGGCCCAAGG GGTCTGGGGA GGACGAGGAC TATCTGGAAC TTGATAGGCT GACCAGGATT
AGGGCGAGGG TGGAGATTAG CGACCAGGGA GTCTTGGCCG ACTTCACGGG CACGGATCCC
CAGGTTGAGT CCCCCTTTAA CGCCGTGTAT GGGGTAACGT TCTCCGCGGT TAGCTTCGTG
ATAAGGTCTC TCCTTGGGAA GGACGTCCCA ACAAACGAGG GCTTCTATAG CGTGGTCAGG
GTTAAGGCAC CCCTGGGGAG CCTCGTGAAC CCCACGAAGC CGTCAGCTGT GGGTGGAGGA
AACGTGGAAA CCTCACAGAG GATAGCTGAC GTCACGTTCA AGGCCCTGTC CCACCTGATG
CCCGTCCCGG CAGCCGGGTC CGGGACAATG ATGAACATCA TGATGGGAGG GTTGAGAGGG
AGGTACTGGG CCTACTACGA GACTGTGGGA GGAGGCATGG GGGCCAGGCC TAACCGCGAC
GGAGTCTCAG CGGTCCAGGT CAACATGACC AACACCTTGA ACACGCCTAT AGAGATAGCG
GAAAGGCAGT ACCCCCTTCT CTTCACGGCA TACAGGGTGA GGGAGGGAAG CGGAGGAAGA
GGGAGGTTCA AGGGAGGTGA CGGGATAGTT AGGGCCTTCA AGGTCATGGA CAGGACGAGG
TTGTCAGTGA TGGCTGAGAG GTTCCTTATC CCACCTTGGG GGCTATTAGG TGGTGGAAAC
GGGAAACCAG GAAGGGTGAC GATCGTGAGA GGTGGAGTCA GAAGGGAGAT GCCCAGTAAG
TTCTCAACTG TTCTGGAGTC AGGGGACGAG GTCGTGATAG AGACGCCAGG AGGCGGGGGA
CTAGGAAAAG AGGAGGAAAC CTCGGAAGGC TAA
 
Protein sequence
MTSWEIVSRA LEFVAEEMGV MLKRSAMSPN IRERMDHSCA IVNERGEVVA QAEHIPVHLG 
SFSVGVKNLL QQVELEEGDM AIVNDPYVSG THLNDVMVMA PLPGGLGYVV NKAHHVDVGG
PMPGSLNPSA STLYEEGFVI PPLKLMRKGK LNEDVVKMIR ENFKVPDTSL GDLNAQISAN
LVGIARVKQL VERYGVSQVV EAWNTSMEYA RRLTLTELAR WPKGSGEDED YLELDRLTRI
RARVEISDQG VLADFTGTDP QVESPFNAVY GVTFSAVSFV IRSLLGKDVP TNEGFYSVVR
VKAPLGSLVN PTKPSAVGGG NVETSQRIAD VTFKALSHLM PVPAAGSGTM MNIMMGGLRG
RYWAYYETVG GGMGARPNRD GVSAVQVNMT NTLNTPIEIA ERQYPLLFTA YRVREGSGGR
GRFKGGDGIV RAFKVMDRTR LSVMAERFLI PPWGLLGGGN GKPGRVTIVR GGVRREMPSK
FSTVLESGDE VVIETPGGGG LGKEEETSEG