Gene Msed_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0220 
Symbol 
ID5104086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp180729 
End bp182330 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content45% 
IMG OID640506125 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001190321 
Protein GI146303005 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000131562 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0129038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGTTT TATTGGTTGT TGTGGACGGG CTGGCCTATC ATTTAATGGA GAGATTCATA 
GATCAACTTC CCACTATTCA GGAGATGGCG GAGGAAGGGG TGTATGGTCC CCTTGAGAGC
ACTTTCCCGT CCATAACCCC TGTGGCTTTA GCCTCTCTTT TCACTGGGGT ATCACCAAAG
GTTCACGGTG TTGTAGCCCC CAGAATATTC GTGAAGGGAA GAAAGATTCA GTCTAGCATA
TCTGCCTTTT CCAGTAGCTC ACTCATGGTA GACCCTATCT GGGCTACGCT GGGAAGGAAA
GGTTACAAGG TTGTGATAAC CTCTGCTCCT CAGGCCTTAC CAGATAAATG GAAACTAGAT
AATGTGATTC TGTTCGATCC ATACAAGGCC AAGGTTAAGA GATGCTCCGA GGGAACCCTC
CTCAAGGAGG GCGAAAACGA GTTTATTGGA AAGAAGTGGA GCGTAAAGGT AGAGGACCAG
ATTTACCTTG TGGGAGTGGA AGGACAAGAA TTCAAAGTGG AGATGGGGTC ATGGATTGGA
CCTCTGGAAG TAAACGGAAA GTGCGGAGAG GAAGAGTTGA AGGCTTCTAT CTTTCTTCAT
GCAACTCCCC GTGGAGTATA TGTAACTCCG CCAGCCTTTC TGAACTTTAA GTGGGGAAAC
AACAGGGACC TTCTCCTTGA GGTTTGGGAG AATGTAGTCA AGAAAGTTGG AATGTTGCTG
GACGGTGATT ATAAGGGATT AAACAAGGGC CTAATCACGT TCGATGAGTA TTTAAAGACA
GCTGAACTCT CCTTTAACTT TTTCGTGGAA TACTCGCTCT ACCTCCTTAG GAGAAGCGAT
TGGGATTTTG GAGTCACCTA TCTTCCGATA GTGGATAATC TTCAGCACCT TTTGTATGGC
GTGGATGATG GGAAGGCACT AGAGCACATT TTTCAGGCTT ACAAAATGGC TGATAAGTTT
CTAATGTTGC ATAGGAGTTT AGCAGAAAAT ATATTCCTTT GTTCCGACCA CGGAATAACA
AAAATAAAGA AGAGAGTCTA TGTCAATAAA ATATTGGAGA GGTTAAACGT GTTGAAAATG
GACGACGGTA AAATAAACTG GGGGAAAACT AAGGCCTACT ATGGCGGGGG AGGCTTAATC
AGGATAAACC TTAAAGATAG GGAGGAAGCC GGCGTGGTCT ATCCGAAGGA GTATCAGAAG
CTGGTAAGGT ATATCGTGAA AAACCTAGAG GATCTTAAGG ACGATGATGG TGAATCCATT
TTCACGGGTA TTTACATGAG GGACACTCCT GCCTCAGACA GACAGGGTGA CATTGAGCTT
AGCATTAGGG ACTATTATTC GCTCAGTTCC AACGTTGATC ATGAGAACGA GATAGATACC
GTGAAACCCT ATTCCACTTC CACGGGAGAT CACGGGTTTT ACAGGAAGGA GGACCTTTAT
GGAGTGATCC TAGGAATAGG ACCCAAAATA GCCAGGGGTA AGAAGATAAA GGCCAAGATC
GTCGACATAG CTCCCACAAT CCTGAAGATT ATGGACGTTC AAGGTCCTAA GATGGAGGGA
AGGGTCCTAG TGGAGGCCCT GAGTAATGGA GGTCAGGAGT AA
 
Protein sequence
MKVLLVVVDG LAYHLMERFI DQLPTIQEMA EEGVYGPLES TFPSITPVAL ASLFTGVSPK 
VHGVVAPRIF VKGRKIQSSI SAFSSSSLMV DPIWATLGRK GYKVVITSAP QALPDKWKLD
NVILFDPYKA KVKRCSEGTL LKEGENEFIG KKWSVKVEDQ IYLVGVEGQE FKVEMGSWIG
PLEVNGKCGE EELKASIFLH ATPRGVYVTP PAFLNFKWGN NRDLLLEVWE NVVKKVGMLL
DGDYKGLNKG LITFDEYLKT AELSFNFFVE YSLYLLRRSD WDFGVTYLPI VDNLQHLLYG
VDDGKALEHI FQAYKMADKF LMLHRSLAEN IFLCSDHGIT KIKKRVYVNK ILERLNVLKM
DDGKINWGKT KAYYGGGGLI RINLKDREEA GVVYPKEYQK LVRYIVKNLE DLKDDDGESI
FTGIYMRDTP ASDRQGDIEL SIRDYYSLSS NVDHENEIDT VKPYSTSTGD HGFYRKEDLY
GVILGIGPKI ARGKKIKAKI VDIAPTILKI MDVQGPKMEG RVLVEALSNG GQE