Gene Msed_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0050 
Symbol 
ID5105189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp45010 
End bp46110 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content46% 
IMG OID640505945 
Productamidohydrolase 
Protein accessionYP_001190151 
Protein GI146302835 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.352004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.512598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAAGCTT CTTCAAGAAT AGTAAACGTA GAATATGCTC TACTGGGTCA AGATTTAGAA 
CTGGTACACA AAATTCACCT GGAGATCAGT GACGGGATAA TCTCCCACAT AGGAAAGGGA
TGGGATGTTA AGGGCGAATC CTACCCTAAT TCCCTTCTCA TGCCTGGGCT GGTAAACTCC
CACGTTCATA CCTTCGATGG GATCGCACCT GAGTTAGGAT GGAACCTAAC CCTTAAGGAG
GTGGTAGGCG ACCCTCATAG CGAGAAGTAT AGAGTACTCT CCCTGAGGAG CCTCCCTGAG
TTAAGGGCCT CAACCCTTAA CTTCCTTAAC AGGTCCTTGG AGCTTGGGAT TTTCACTGTA
ATCGACTTCA AGGAAATGGA TATCAATGGT GCAAAAATCT CGAAAGAGGT TAAGGAAATC
TCACCCATTA ACTACGTGAC TCTGGGGAGA TTGGACGGTG AGATAACAAG GGATAGACTT
GAAATCCTCA AGGAGTTGGT GGACGGGTAT GGGGTAAGTA GCGTATCCAT AGGGATGGAA
AAGCTTAGCC TCATTCGAGA GGTATTTAGG GATAAGATGA CCGCTATCCA TGTTTCTGAA
ACGCTTAGGC ATAACTTGGC CTCAGACCTT GAGACCTCTC TTTCCACTCT TAAACCTGAT
ATGGTGGTTC ATGGCATCCA TCTATCTGAG GAGGAGATGG AACTCTTGGC AGAAACGGAC
ACCAAACTGG TTATATGTCC TCGAAGTAAC CTATGGTTCT CAACGGGTAT TCCCAACATT
CCCATGATGA TAAGAAAAGG GGTTAGACTA CTAATCGGAA CCGACAACGC CGGGATCACC
GATCATGATC TCTGGAAGGA GTTAGAGGTT GCATTACTTC TATCGCGACT TCTGGATCCA
GGAAGTGATT TTTCCAGGGA TATCCTTAAG TCTGCTACCG TTAACCCTGG AAAAGGGGTT
TATCCAATAG AGGAGGGAAA TAGGATGACA GGAATAATCA TGGGGCCACT CCCCAGGTTC
GAAGTCTCAA ATAACAGGTA TATGGCACTA ATCAAGGAAC CTGGAAAAAT AATTAGAGTC
TTGGGGCTAC CCAAAATCTA A
 
Protein sequence
MEASSRIVNV EYALLGQDLE LVHKIHLEIS DGIISHIGKG WDVKGESYPN SLLMPGLVNS 
HVHTFDGIAP ELGWNLTLKE VVGDPHSEKY RVLSLRSLPE LRASTLNFLN RSLELGIFTV
IDFKEMDING AKISKEVKEI SPINYVTLGR LDGEITRDRL EILKELVDGY GVSSVSIGME
KLSLIREVFR DKMTAIHVSE TLRHNLASDL ETSLSTLKPD MVVHGIHLSE EEMELLAETD
TKLVICPRSN LWFSTGIPNI PMMIRKGVRL LIGTDNAGIT DHDLWKELEV ALLLSRLLDP
GSDFSRDILK SATVNPGKGV YPIEEGNRMT GIIMGPLPRF EVSNNRYMAL IKEPGKIIRV
LGLPKI