Gene Msed_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2026 
Symbol 
ID5105248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1954244 
End bp1955467 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content47% 
IMG OID640507914 
Productamidohydrolase 
Protein accessionYP_001192090 
Protein GI146304774 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.761901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGTAT CAAATAGAAC GTATACACTG AGGAATTGCG CCTTTGCCGT TGATTACAGT 
CACGTCGAGG GGCCAACGAA CATAGTCGTC GAAGACGGTT TCATTAAGCA CGTTGGTAAG
GAGGTTGAAG GAGACGAGTT GGAGTGCAGT GAGTACGTGG TAATGCCTGG TTTGGTGAAT
GCCCATACTC ACTCAGCCAT GACAGTCCTG AGGGGTGTAT TTGACGACGG GGAACTTCAC
GAGTGGTTAG CTCGAATGTG GGACGAAGAA AGGAAGTTAA CCAGGGAGAT TATGGCTGTA
GGTTCAGAAA TCGCAGTGAT TGAGATGATC TCCTCTGGGA CAACGGCCTT CGTTGATATG
TACTTCAACC CTGACCAGAT AAGGGATATC TCCACTCAGT ATGGGATTAG GGCGAGAGCG
GGCCCAACTC TCATGAAAGA CAAGAGTGTC GATGAAACCG TAAGGGAACT ACGTGCACTG
GGGGAAAGTG AGTTCTTTAG GCCCATCGTT AACGTCCACA GTCTCTATGC CACGGACCTT
CAGAAGCTTA GGGAGCTAAG AGATAACCTG AACCGAGGGT ATCATCTCCA CATTCATCTT
TCAGAAACAA GGGAAGAGGT CTTCCAGATA AAGAGAAGAT ATGGGATGTT CCCGGTGGAG
CTCATTCACA GGGAAGGTCT AACGGAACGT GTGCATGGTG TACATCTAGG CTGGATAACC
TCATGGGAAC TCAACTATCT GAGAAGTTCC ATTGCGGTCA CTCATTGTCC AACGTCTAAC
ATGAAGCTTG CCACCGGAGG GGCTTTTCCC ATGAAGGAGG CGTTGACTCA AGGACTTAAT
GTAACCATTG GGACAGATGG TGCAGCGAGC AATAACTCCC TCAACATGTT TCAAGAGATG
AAAATGGCAG TCTTGCTACA GAGACATAAT TACTGGTCCA CAGGAATAAC TGCAGTTGAC
GTGTTTAGGG CATCATCAGT TAACGGGTAT AAGATGCTGG GCATACGCGG AGGGGAGATT
AGGCCCGGAT ACGTGGCTGA TCTAGTCCTG CTAAGTAAGT ATGAGGTTTA TCCATTAACC
AAGGAGAGAC TTCTATCTCA TCTAGTTTAC AATCCGCCAA AGGAAGTTGA GAAGGTTATA
ATCCAAGGAA AGATTGTTTA TCAAAAGAAT GACTTTAGGG ATAGGTTGAA GAAACTTTTA
GAGAAGTTAA GCCTTTACCT CTAA
 
Protein sequence
MGVSNRTYTL RNCAFAVDYS HVEGPTNIVV EDGFIKHVGK EVEGDELECS EYVVMPGLVN 
AHTHSAMTVL RGVFDDGELH EWLARMWDEE RKLTREIMAV GSEIAVIEMI SSGTTAFVDM
YFNPDQIRDI STQYGIRARA GPTLMKDKSV DETVRELRAL GESEFFRPIV NVHSLYATDL
QKLRELRDNL NRGYHLHIHL SETREEVFQI KRRYGMFPVE LIHREGLTER VHGVHLGWIT
SWELNYLRSS IAVTHCPTSN MKLATGGAFP MKEALTQGLN VTIGTDGAAS NNSLNMFQEM
KMAVLLQRHN YWSTGITAVD VFRASSVNGY KMLGIRGGEI RPGYVADLVL LSKYEVYPLT
KERLLSHLVY NPPKEVEKVI IQGKIVYQKN DFRDRLKKLL EKLSLYL