Gene Msed_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0522 
Symbol 
ID5103682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp478226 
End bp479185 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content51% 
IMG OID640506426 
Producthypothetical protein 
Protein accessionYP_001190621 
Protein GI146303305 
COG category[S] Function unknown 
COG ID[COG1814] Uncharacterized membrane protein 
TIGRFAM ID[TIGR00267] conserved hypothetical protein TIGR00267 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.655797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.060241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGT TAGTTAACAG GAATTATAGG GCAGAGCTGT TCGGTGAGGA ACTTTACTCT 
GCTCTTGCCA GGGACGAAAA GTCCGAGAAG GTCAGGGAGG TACTACAGGA ACTCTCTGAG
GGAGAGGGAA GACATGCTAA GTTCTGGGAG AGTATAGCCA GAAGCAGAGG GATTAAGCTG
AGGGGATTGG GATTTCTGGA CAAATTGAAG CTCTGGATCT TGCGAAGACT CAGGAAAGTT
CTGGGTCTAG CCCTAGTCCT AAAGATGGTT GAGGCGGGCG AGGAGAACGA TGCTGAGAAA
TATTACCGCT TCTCCACCTC GCAAGAGTTT ACGGACACGG AGAGACAGGG CTTCAGGGAC
ATTATGATGC AGGAAATGGT CCACGAAGAT ATGCTCATAC AGACGCAGGT GAACGTGGAT
ACTGTTAGGG ATACCATCTA CGCTATTAGC GATGGCCTAA TCGAGGTACT AGCTTCGGTG
TCAGGACTTG CAGGTATATT CTCGGTTCCC CTTTATGTAG CACTGGGAGG ACTCATCGTA
GGAGTATCGG GAATGATATC CATGAGCATT GGGGCTTATC TTTCGGCGAA ATCTGAGGAG
GACATAAGGA ACAATGCCCT CAGGAAGGCT AGGCTGAGGA GTCTTCTAGA AGGGGAGAAA
CATGAGGAGG ACCAGGAGGT GTCCAGAACA GGGGAGAGTG TGAGAACCAC TGCGATCTCC
TATATCATGG GAGCGATAAT TCCAATCCTA CCCTTTCTCC TGGGTTTAGG CGGTCTAGTT
GGCTTGGTTA CCTCATATGC CGTGACTGGT GTCGCCACTT TCATAGTGGG TGCGCTCATA
GGTGTACTCA GCGATGTGAG CCCCTGGAGA AAGGGGGCGG TAATGACTGG GTTGGCGCTG
GGAGCAGCAC TAGTGACTCA CGTTCTGGGA CTATTGGCAC ATCTAGCGGG TTTTTCCTAA
 
Protein sequence
MEELVNRNYR AELFGEELYS ALARDEKSEK VREVLQELSE GEGRHAKFWE SIARSRGIKL 
RGLGFLDKLK LWILRRLRKV LGLALVLKMV EAGEENDAEK YYRFSTSQEF TDTERQGFRD
IMMQEMVHED MLIQTQVNVD TVRDTIYAIS DGLIEVLASV SGLAGIFSVP LYVALGGLIV
GVSGMISMSI GAYLSAKSEE DIRNNALRKA RLRSLLEGEK HEEDQEVSRT GESVRTTAIS
YIMGAIIPIL PFLLGLGGLV GLVTSYAVTG VATFIVGALI GVLSDVSPWR KGAVMTGLAL
GAALVTHVLG LLAHLAGFS