Gene Msed_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1526 
Symbol 
ID5104054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1487501 
End bp1488472 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content47% 
IMG OID640507413 
ProductDHH superfamily phosphohydrolase 
Protein accessionYP_001191606 
Protein GI146304290 
COG category[R] General function prediction only 
COG ID[COG2404] Predicted phosphohydrolase (DHH superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.390938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACT ACGCCATAGT GCATAACGAC TTTGATGGGA CTGCCTCGGC CAGCGTTTAC 
GCGAGAGCTG TCAATTCCCT CCCGAGAAAC ATCTGGTTCA CTGAGCCAAC TAAACTTCAC
GAGGTTTTAG CCAAGTTAGA GTTGAGGGGA GTCTCCAGCG TGATGATAGC AGACCTAGGT
ATCAATGAGT CCACCTTTCC TTCGATAGTT GAGGCTGTGA AACGTCTTAG AAGTGAAGGT
GCCACAATAC AATGGTTTGA TCATCATGTT TGGAAGGAGG AGTGGAAATC GAAGCTCAAG
GAGGTAGGGG TAGAAGTCTA CCACGATGTT ACTACCTGCG GTGCAGGCGT GGTAAACAAG
GTCATGAACC CCAATGACGA GGTATCCAGG AGATTAGCCT CTGCGGACTG CTCCGTGGAT
ATATGGCTTC ATGACGATCC ACTTGGTGAA AAATTGAGAA GGATTGTGGA GAATGACAGA
AGGTTTGAAT GGAAGAAGAA ATTGCTTGAG ACCTTTTATG GTGGAACCCT TTGGAACGAC
GAGTTCCAAA AAATCTTGGA GACTAGAATT AACGAGGAAT TGAAAGGATA TCAAAGGATC
TGGAAATATG TGAAGGTGTT GGACGTTGAA GGTGCTAAGG TAGTGGTTGC GATAAGGTGG
AAGGGTCCGC CTGACATAAG CTATGCCTCT CAGTTCCTTA TGACGAGAAC AGGGGCAGAC
ATATTCGTTT CAGCTAATGG GAAGGCAGTT TCGTTCAGGA GCAATACGAT AGATGTGAGG
AGGTTTGCAG CTGGACTAGG TGGCGGAGGA CATCCTCTTG CCGCAGGAGC ATCCCTTAGA
ATTCCCCTGC TCTATAGGTT TTTAAGATGG ATAGGCGTTA GAGGGCCTGT GATCGATTGG
GTCTCAAGAG TAGTAATTGA CGTAATAAGG AAGGAGGGGC TAGTTAAGTA CGAGAGAAAA
CCAGCCCATT AG
 
Protein sequence
MDYYAIVHND FDGTASASVY ARAVNSLPRN IWFTEPTKLH EVLAKLELRG VSSVMIADLG 
INESTFPSIV EAVKRLRSEG ATIQWFDHHV WKEEWKSKLK EVGVEVYHDV TTCGAGVVNK
VMNPNDEVSR RLASADCSVD IWLHDDPLGE KLRRIVENDR RFEWKKKLLE TFYGGTLWND
EFQKILETRI NEELKGYQRI WKYVKVLDVE GAKVVVAIRW KGPPDISYAS QFLMTRTGAD
IFVSANGKAV SFRSNTIDVR RFAAGLGGGG HPLAAGASLR IPLLYRFLRW IGVRGPVIDW
VSRVVIDVIR KEGLVKYERK PAH