Gene Msed_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0631 
Symbol 
ID5103791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp576222 
End bp577760 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content47% 
IMG OID640506535 
Productdihydropteroate synthase-related protein 
Protein accessionYP_001190730 
Protein GI146303414 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR00284] dihydropteroate synthase-related protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.498786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTTA TTTTAACTTA CGATTCTAAG ATCTCCTGGA TCATGAGAGT ACTGCTAATT 
ACAGGTAAAT TGGCTAGGCC GATCTTGGAG GAGGCTGTCA AAAACTTGGA AGGCGTCTCC
ATTTTGGCCC TAGATTACCC TGTTGCCTCC CTAATGAGCA TAAGGTACAT TCTAGAGAAG
TTAAAGGAGA GAAAGGAGCT CCTATCTCAA TACGACTATC TTGTTTTGCC AGGGCTAGTT
TATGGTGACG CAAAGATAAT CGAGAAGGAA CTCGGAATTA AGACCTACAA GGGAACTGAA
AATGCCTGGG ACGTGGTGAG GGTGATAGAT GCTCTGAGGA ATGGTGTAGA GCTATCTACC
ATATACCCTG CTGACGTCAT TCTAAATACC TCGTTGATGG AGGATGCCAT GAAAATTCTG
GATGAGGTTG AGGCTAACGC TCAGTATGCG TTCGATATAG GGTTTAGGAT TCCGCTTCGC
CCTCCACCCT TTAGGCTTCT CCTCGAACTC GACCCATCAC AGCCTCTAGA CAAATGGCTT
CACGAGATAG GGAGAGTAAA ACCATACGTG GATGGTGTGG TGGTGGGGTT TCCGGTAGGT
TTCTCGGATC TAGATGAGGT AAGGAGAAGG GTTAAGTCTG TTAGGGACGT GATTTCGGTT
GTGGGGATAG ACAGCGACTC CCCTCAGATA CTAAAGGAGG GAGTAAGGGA AGGTGCGTCT
CTAGTCCTCA ATCTCAATGA GGACAACATG GAGAAACTTG AGGAGCTCAA GAAGGAGTCT
GCGTTCGTTG TGGCCCCCTT CTCGACAGAG AATAGGGCTG AGACAACTGT GAACCTTGTG
AGAAAGGCCA AGCACATGGG TTTCGATAGG CTCATTGCTG ACCCAGTTCT CTCACCCCCT
CTCATGGGCT TCACGGAGAG TATCTTGGAT TATGCTAGGC TAAGGGCAAC TTTACCAGAT
ACACCACTAA TGATGGGTAC GTTGAATGTT ACGGAGTTGA TTGATGCTGA CAGCCATGGG
GTAAACGCAC TTCTTTCAGT TATGGGAATG GAGCTAGGTA TTTCAGTCTT CTTGACCATG
GAGAAGGGAA AGACCAGATG GAGTGTATGG GAATTGAGAG AGGCCACTAA GATGGTTTCC
ATTGCTCACT CTCAGGGAAA GGTACCCAAG GATGTGGGAA TAGATCTTCT CCTCCTTAAG
GATAAAAAGA GATTGAAAGC CTTCTCCCCT CAAACAAAAA GGATACCTGC ACGGGAAATT
GAACCAGAAA TGGACAGGGC TGGCTTCGTT CACATTGTCC TTGAGGATCG AAAAATAGTA
GCTTCGTTTA GGGGAAAGAA AGAGGTGTCT GTTGAAGGGC AAGATGGGCT TCTAGTGGGA
AGAACGCTCC TAAGGGAAGT AGGGGATATC TCGCCAGAAC ATGCCCTGTA TATAGGGTAT
GAGCTAGCTA AAGCCGAAAT TGCTAGCTAT CTGGACAAGA ACTACATTCA AGATAAACCC
CTTGTAAGGA GGATAGGCCT TGAGGATAGT GGTGCCTAA
 
Protein sequence
MSFILTYDSK ISWIMRVLLI TGKLARPILE EAVKNLEGVS ILALDYPVAS LMSIRYILEK 
LKERKELLSQ YDYLVLPGLV YGDAKIIEKE LGIKTYKGTE NAWDVVRVID ALRNGVELST
IYPADVILNT SLMEDAMKIL DEVEANAQYA FDIGFRIPLR PPPFRLLLEL DPSQPLDKWL
HEIGRVKPYV DGVVVGFPVG FSDLDEVRRR VKSVRDVISV VGIDSDSPQI LKEGVREGAS
LVLNLNEDNM EKLEELKKES AFVVAPFSTE NRAETTVNLV RKAKHMGFDR LIADPVLSPP
LMGFTESILD YARLRATLPD TPLMMGTLNV TELIDADSHG VNALLSVMGM ELGISVFLTM
EKGKTRWSVW ELREATKMVS IAHSQGKVPK DVGIDLLLLK DKKRLKAFSP QTKRIPAREI
EPEMDRAGFV HIVLEDRKIV ASFRGKKEVS VEGQDGLLVG RTLLREVGDI SPEHALYIGY
ELAKAEIASY LDKNYIQDKP LVRRIGLEDS GA