Gene Msed_0671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0671 
Symbol 
ID5105277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp613353 
End bp615002 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content46% 
IMG OID640506575 
Productputative molybdopterin biosynthesis protein MoeA/unknown domain fusion protein 
Protein accessionYP_001190770 
Protein GI146303454 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00402297 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAATT TCGTTCCCGA GGAGAACTTA CCCACTCCTC TAGAGGCCAT TGAAAAATTT 
CTTAGAACTC TTCCTCCCAA GAGGAGAACA ATCTCTCTTG GAATATTGGA TTCCGTGGGC
AAGATAACTT CTTCATCAGT ACGTGCTAAA ACCGATTATC CACCCTTCTC TAGATCAACT
GTAGATGGGT TCGCTGTTAA ATCATCTTCT ACCCCAGGTA GGTTTAGGAT AGTTGGAAAG
ATATCCATTG GGGAATGGAA GGATATAACT ATTGGGCCAG GGGAGGCGGT AGAGGTGGAT
ACTGGATCCC CACTACCTTG GGGAGCTGAT GCCGTGGTTA AGATAGAGGA TACACATGTG
GAACTTCCCT TTGTCGAGAT AAATTCTAGG TTGAGATTTG GTGCAAACGT AGCATGGGCT
GGTAGCGATA TATCCAAGGG CTCAGAAATT ATAGGAGAGT TCCAGGAAAT TACGCCAGAA
GATGTGGGAG CATTTGCGTC TGTGGGAATA GAGACAGTTG AGGTGTATGA TCTCCCAAAG
GTTTACGTTA TCGCAACCGG TGATGAATTA GTTCAACCTG GAAGAGATCT AACGCCAGGG
AAGATATATG AAAGCAACGT TCACTTCCTG GTATCAAGGC TAAAGCAACT TGGATGCGAA
ATAGTTGGAG CCGAAGTTCT ACCTGACGAC AAGGAAAAGA TAAGGAATTC CTTAAGGACG
GCCGTAGACA AGGCCGACCT TATAATAACC ACAGGAGGAA CGAGTGCAGG TGAAAAGGAT
TACGTACATC AGCTGGTAAG GGAGATGGGA AATATCGTGA TTCAGGGTTT AAATACGAAA
CCTGGAAAGC CAACGATATT AGGGGAGATA CAAGGGAAGC CGGTCTTCGG ACTTTCTGGA
AACATTGTGG CTACCATAAT GACCTTTAAT CAGTTAGTGG AGAGATACAT CGCCGAATTC
TCAGGGAGAT CACTGGCTAC CAGAACAGCT TACCACGACA AGGTCTCGGC CATCTCCATT
TTACCAATTA AGGCTGATAG GTACAGGGTC ACTAACATCC CCGTTTACAT AGTTAAAGGG
AGAGGGGGTC ACTATGCTAT CCCGGTCCCA TTCGACAGTT ACATGGTTGG GACTTTTGCT
AGCTCTGATG GTTATGTGAC CTTACCTCCT GGTACGCAAG TCAAGGAGGG AGAGAGGATA
GAGGTAACCC TGAAAAGCCT TGATAATAGG CCTGTCATTA TAGGGGAAGA GGACGTGAGA
ATAAGGGAAG TTAAGGCTAG GAAAATACTC TTGGGAACAG TGCCCGCTTG CGCTGCACTC
AAGTACGATG TGGGGGATGC GCTGATACTG AGTGACTTCC TTTGCGAAGA TAAGGTGCCT
GAGGCGGTGG AGGTAAAGCG TTGGATCCTA TCCCATGGAA GCGGAGATCC CATAGGCTAT
CATGATTGGA TAGGAATGAG CAGGTTGGTG AAGGATCCTG CAGTTAAATT GAAATCACCC
TCTACAGCAC CCCTGTTCTT GGGCAAGGGA AAGGTGATCG CACCCCATGG ATACATAGAA
GGAGAGAAAG TAACACAAGA GAAATTAAAA GTAGTAGTGA GAAACAGGGA TCTACAATTC
TTAGAGGGAA TATTTTCTGA AGATACCTAG
 
Protein sequence
MRNFVPEENL PTPLEAIEKF LRTLPPKRRT ISLGILDSVG KITSSSVRAK TDYPPFSRST 
VDGFAVKSSS TPGRFRIVGK ISIGEWKDIT IGPGEAVEVD TGSPLPWGAD AVVKIEDTHV
ELPFVEINSR LRFGANVAWA GSDISKGSEI IGEFQEITPE DVGAFASVGI ETVEVYDLPK
VYVIATGDEL VQPGRDLTPG KIYESNVHFL VSRLKQLGCE IVGAEVLPDD KEKIRNSLRT
AVDKADLIIT TGGTSAGEKD YVHQLVREMG NIVIQGLNTK PGKPTILGEI QGKPVFGLSG
NIVATIMTFN QLVERYIAEF SGRSLATRTA YHDKVSAISI LPIKADRYRV TNIPVYIVKG
RGGHYAIPVP FDSYMVGTFA SSDGYVTLPP GTQVKEGERI EVTLKSLDNR PVIIGEEDVR
IREVKARKIL LGTVPACAAL KYDVGDALIL SDFLCEDKVP EAVEVKRWIL SHGSGDPIGY
HDWIGMSRLV KDPAVKLKSP STAPLFLGKG KVIAPHGYIE GEKVTQEKLK VVVRNRDLQF
LEGIFSEDT