Gene Msed_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1010 
Symbol 
ID5105609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp929560 
End bp931158 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content51% 
IMG OID640506909 
Productphosphoesterase 
Protein accessionYP_001191102 
Protein GI146303786 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3511] Phospholipase C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457541 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGGCTC TAACTTGGAT CGGAATAGTT CTCACGCTTC TCTCGGCTCT CTCTTTTCTT 
TCAATCGTGA GCTCTGGAGC GACTCAAGCC ACAGCCACAC CAATAAAACA TGTCATATTC
ATAGAACTTG AGAACCACGC TTTCGACAGC ATTTACGGCA CTTATCCCTT TGGATACCCC
GTGATCGTGA ACAATATCAC CATGTCAGTC ATGAGGCCCG TGAATTACAT TTACAACTTA
TCCCTCCTAA ACACCCTGTC TCAGTCCCAC GGAAACGTAA CCTGGATCTC AGTTCCCGCC
GGAAAGGGCT ACCTTCACCC CTATTACGCC AATTCAACCG TCCTAGTAAA TCCCAAGGAA
GGTTACACCA ACTATCATGA GGACTGGAAT TGGGGGCAAA TGAACGGCTT CGTGAACGGA
TCTGGCCCTC AGTCACTGGC TTACGTCTCA TATGAACAGG TACCGCTCCT CTGGGATTAC
GCCGAGGAAT ACGTACTCTT TGATAACTAC TTCTCTCCCA CCCTATCCGT GACTGTACCC
AACAGGATTG CATACATTAC AGGTTTTCCC ACTCAGGTTG AGAGCGACGC CCCTCAATTT
GGGTTAATAC CTCTTAACGA GTCTATCCTT TACCAGCTCA CGGAGAACAA TGTAAGTTGG
GGCTGGTACG AATACGGTTA CTCCAAGGAC TTTCAGATAC TATCCCCTGA TCTTTACCTT
GGATACAACA ACACGGCACC CCTGCCCGTT AGCCTCTTGA AGGGAGCGAA TCAGTGGAAC
TCGCACTATC ACGACCTTTC AGACTTTCTG GCTGAGGCTA GAAACGGGTC TCTTCCATCA
GTCTCATACG TCATGTTCAC GGGTCCCATG GGGTATGACG ATCACGTGCC CGGTTACGAT
ATGCATCCTC CCTACAATAC CACACTCGCT ATGCTCATGC TCTCCACAGT GATCAACGCC
GTGATGACGG GGCCAGACTG GAACTCCACT GTGATTTTCA TCACCTTCGA CGAAGGCGGA
GGATACTACG ATCCAGTCCC TCCACCAATA GTTAATGGGT TCGGTCTCGC CAATACTCCA
ACAATATCCA AGATATTACC GGGTTACTTC ACCCTAGGGC AGAGGATCCC GCTCCTTATG
GTTTCGCCCT ACTCCAAGGA GGGATTCGTG GACAACTACA CCGCTTCGGG CTACTCAATC
CTTGCCTTCA TTGACTACAA CTGGCATCTT CCCTACCTGA ACCCCATAGT GAAGGAGTTC
GGACCAGAGT CAATCCTTTA CGGGCTTAAC TTCACTGCTC CAAGGCCTCC CCTGGTCCTG
ACCCCTGAGA ACTGGAGTTA TCCGGTTCCC CTACAGTATC CAATTCACTA CGGCTACGTG
GCAACCATTA ACAATAACTA CAGCATCTAC AACGCGATCT ACCACGATAA GCAGATGGGC
AACTACACGC CCCCGCAGTA CTTCCTTGAG GGCAACGTGG TGCAAGGCGG GGTTCAGGAA
GCCACGGGCT CCTCGGCTGG TTTCCCAACC CTCCTCCTGT GGATTCCAGT CCTCCTCATC
ATCATAGCCG TGGGAGTCCT CCTGGAGAGG CGTAAGTGA
 
Protein sequence
MKALTWIGIV LTLLSALSFL SIVSSGATQA TATPIKHVIF IELENHAFDS IYGTYPFGYP 
VIVNNITMSV MRPVNYIYNL SLLNTLSQSH GNVTWISVPA GKGYLHPYYA NSTVLVNPKE
GYTNYHEDWN WGQMNGFVNG SGPQSLAYVS YEQVPLLWDY AEEYVLFDNY FSPTLSVTVP
NRIAYITGFP TQVESDAPQF GLIPLNESIL YQLTENNVSW GWYEYGYSKD FQILSPDLYL
GYNNTAPLPV SLLKGANQWN SHYHDLSDFL AEARNGSLPS VSYVMFTGPM GYDDHVPGYD
MHPPYNTTLA MLMLSTVINA VMTGPDWNST VIFITFDEGG GYYDPVPPPI VNGFGLANTP
TISKILPGYF TLGQRIPLLM VSPYSKEGFV DNYTASGYSI LAFIDYNWHL PYLNPIVKEF
GPESILYGLN FTAPRPPLVL TPENWSYPVP LQYPIHYGYV ATINNNYSIY NAIYHDKQMG
NYTPPQYFLE GNVVQGGVQE ATGSSAGFPT LLLWIPVLLI IIAVGVLLER RK