Gene Msed_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0469 
Symbol 
ID5105465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp423966 
End bp425825 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content50% 
IMG OID640506375 
Producthypothetical protein 
Protein accessionYP_001190570 
Protein GI146303254 
COG category[C] Energy production and conversion 
COG ID[COG0348] Polyferredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAGT TAGAGTACAA GGTTACTGGA AAGGTTAGAA ATTATGAAAG GAGATTCAAT 
TTTTATGTGG CACTCTCAAC CGTGGGAACA GGAGCATTTA CTGGAATTGC GGTATTACTT
AAGCAGGTTT TGATGATAGA GACCGGTATA TTACTTTTCA CTACGGCTCT CACAATCCTG
GCCGTTAATC TTGTGCTGGA TCTAACTGTG AAGTCCCATT CCAATACCTG GGTCTTTGCT
TCTCCTCCTA GAGAGATAGT GAAGAAAGCT GACAGAGTGG GAAAGGAGAT CTGTGAACAT
CGCCCTTCCC TATTAGAGGG GAATAACCCA GTGTCCAGGT TGGTTTCTAA GCTGTTTAAG
AAGAGCTGGG CACACTTCGC AATCATTCTT CCAAGTTTCA TAATATTCTA CGTAGTCATG
GTTGTGGGTT TAGTGGGGTA TCAGAAGCTT GGTCCTGCAG GGATATCCTT GGTGAATTTT
GCCTCAGACA TTAGCTGGTT GTTCTGGTTT CCCTTACTCT GGTTGTTGAC CTGGCTAGCT
AACGGCAGAG CGTGGTGCCA GACCTGCCCC TTTAGCGGTC AGGCAGAGTG GGTCCACAGA
TTGCATCCCT GGAAAAAGAT GAGCAAGAAG CTGGGGCTAA ACCTCAGGTG GCCCATAAAG
TACAGCACCA TCTTCTATTC TGCTGTGGGC TTCTCGGTCC TAACCTGGAT GGAGGAGTTT
TACGGAATTG GAGGTCCTGG AATTCCGGAA CTGACCTCAG TGGTGTTGAT ATACATTGGT
GCCCTGGAGC TCTTCATTAG CCTGCTCTTC CAGGACAGGA CCTTCTGCAG GACAATTTGT
CCCCTAAGTG CCCCTTTGGC CATAACCACG ACAATCTCTC CACTTGGAAC CTTCAGGGCG
AAGAATCCTG AGGTATGTAA GTCCTGTACC ACTAAGGATT GCATGAAGGG GAACGATAAG
TTCCACGGTT GTCCTTGGTT CGCCTCCCCA GGAAGTAAGG AGAACTCACC CATGTGCGGG
CTAGCCTCGG ACTGCTACAA GGCATGCCCC CACGACAACA TTGACTGGCA GGTTAAGAGG
TTCCCATGGT TGAGCGATCT CGCCGGAGGC AAGAAGAGGT TTGATATAGC CCTCTCAGTG
ACTCTCCTAA CTGGGGTTGT CCTCTTTCAG TTCCTTAATG CACTGCCCTT CTACTCCATG
GTGGATACCT GGCTGAGCAA AGTGACAGGA TGGGTTAATT TCGCTCAGCT ACTAGTTCCT
GGACTGTCCA AGTTTGGCTA TTCGACTCAT GGCTATCCAA ACCCCCTGGA TTACTTTGCC
ATCAACATGA TACCTATCCT TGTGGTCCTG GCTGCAGCCA AGTTTGAGGA AAGGAGGGGA
GTACCCCTGA AGTGGGGATT CACGTCTATA TCTTATGCGC TGATACCCAT CTTCGCTGCA
TCGTTACTGG TGAGAAACCT ACCCAAGTTT CTAGGCGGAT CTCCGTTGAT CCTCAACGAG
ATCCTTGACC CCACTGGGGC CGGTATGCAC AACAGTGAGA TCTACTCAAC CTTCTGGGGA
AGCCTGCTTC ACTCCCTGGG TCACGATCCG CTCAACGCCA CGGCTGCCTG GTGGGTTCTC
CTTGTGATGG AAGCTGTAAT GGCCTTCGGC ATCTACCTAG GATTGAGGGC TTCAAACATG
CTGGCTGAGA CTGACGGAGT GGGCAAGTGG ACATACTATG CCGTAGTTCT GGGGTTTGGG
CTAACCTTCA TGCTAGTGAC CTACTGGATG TCTTCCCCTG CCTCTCCCAC AGCGCCCTTC
TACAACCAGT ATCTCGGAAA CCTACTCTAC AACCCACTTC AGGCTACTCC GCCGTTCTGA
 
Protein sequence
MEKLEYKVTG KVRNYERRFN FYVALSTVGT GAFTGIAVLL KQVLMIETGI LLFTTALTIL 
AVNLVLDLTV KSHSNTWVFA SPPREIVKKA DRVGKEICEH RPSLLEGNNP VSRLVSKLFK
KSWAHFAIIL PSFIIFYVVM VVGLVGYQKL GPAGISLVNF ASDISWLFWF PLLWLLTWLA
NGRAWCQTCP FSGQAEWVHR LHPWKKMSKK LGLNLRWPIK YSTIFYSAVG FSVLTWMEEF
YGIGGPGIPE LTSVVLIYIG ALELFISLLF QDRTFCRTIC PLSAPLAITT TISPLGTFRA
KNPEVCKSCT TKDCMKGNDK FHGCPWFASP GSKENSPMCG LASDCYKACP HDNIDWQVKR
FPWLSDLAGG KKRFDIALSV TLLTGVVLFQ FLNALPFYSM VDTWLSKVTG WVNFAQLLVP
GLSKFGYSTH GYPNPLDYFA INMIPILVVL AAAKFEERRG VPLKWGFTSI SYALIPIFAA
SLLVRNLPKF LGGSPLILNE ILDPTGAGMH NSEIYSTFWG SLLHSLGHDP LNATAAWWVL
LVMEAVMAFG IYLGLRASNM LAETDGVGKW TYYAVVLGFG LTFMLVTYWM SSPASPTAPF
YNQYLGNLLY NPLQATPPF