Gene Msed_0086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0086 
Symbol 
ID5104664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp76329 
End bp77633 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content43% 
IMG OID640505985 
Productaminopeptidase 
Protein accessionYP_001190187 
Protein GI146302871 
COG category[R] General function prediction only 
COG ID[COG4882] Predicted aminopeptidase, Iap family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0111442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000071727 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATTG ATCATGGGTT AGCAGGCGAT ATAAACGAGG GAAGAAGGAT AGGGAGACTA 
AGAGAAATCT TAGAAGACGT CAGCGATGAA ATTAAGGTCT ACAGGGAGAG GATATTAACG
TGGGAGGTTA ACCATTCTCT GGTAACCATA AACGGTGAAA GGGTAGAACA TCAGGCACTA
CCATACTCGC CTACCTCGTA TATCAGAGGA CACGTAGTGG AGAATCCTGA AGACTGTAGG
AAAGACACAG TACTTGTAAA TAGGGGTAAC TCGAGGTTCG ATGTTTATTA CGACATTATT
CTGGCCAAGA GGTTTGGTTG TGAAGCCTTA CTATTAACAC AAGTCGAGAC TGTCTCGATT
TCGCCTCCTT ACCTACAGTC TGGAGCCTCG GAACCCCCAC TGGGTCTGAT ATCGATAAAA
GGTGCAAATC CTAAGAAAGG GAATCTGGTG GAGATAAATC TCGAAACCAA GGCAAGGATC
GTAGACGCTT ACGCTATCCA TGCGATAAAA AATGGAAGAG AAAGGTCCAA AAAACTCCTT
CTGGTCTCCA ATCACGATAC ATGGCTTAGC AAAAGTAAAA GCTGGGTAGT GTCAGCGAAA
ATTTTCCGAG AGCTAAATGA CCCTAGTAAC CAATGGGAAT ACCTTTGTAT ATCTGGAATG
GAGTCTGGAG CACCTGGATT TTCATCACTT TATTGGGGAT ACATGGCTAG GCAACTTAGT
AAGAAATTTA CAGACAGGGA TCTGGCAATA GAGGTTAGAG ATAACCTAGC TTTAGAGCTG
TCACCTGGAA TGAGATATCA AGACATGAAG GGTGATCTGA TTTCCCCAAT GTCAGCGTCT
TTCGAGTTAT TGAGGAATGG GATACCATCT GTGACCCTGG GAATAGGAGA TACTTCCACA
GATCAAACTG ATGTAATAAA GACTCTGAAG GCTCTCTCTT CTAATTTCAA ATTCTCTCTA
GAGGATCTAA TAAATGATCT CCTTGACGAG TATTCCCTGT TACCCCCTGA GGTGAAATCC
CTACTCACCA ACCTCAGCGG AAAACCAAGA GAGGCTAAAT ACCTTGTCAG ATACCTAGGA
AGGTACATGG GGTTGCCGGG AAGGATTGAA TATGCACTTT TTCATAAGTT GGTGGCTGTA
AGGAAATCCT TCAAACACAG GTTCATGTTC GCAGAGGACA ATCCAGCGCT AGGAATAGAG
GTAACGAGAA ATGGGTTTCT CGCAAGCTAT AGATCAGGAT TAGATAGGGA AATCACAAAC
TCTTACATTT ACAGGTTACA CGAAGATTTA CAGGAACTCC TCTAA
 
Protein sequence
MSIDHGLAGD INEGRRIGRL REILEDVSDE IKVYRERILT WEVNHSLVTI NGERVEHQAL 
PYSPTSYIRG HVVENPEDCR KDTVLVNRGN SRFDVYYDII LAKRFGCEAL LLTQVETVSI
SPPYLQSGAS EPPLGLISIK GANPKKGNLV EINLETKARI VDAYAIHAIK NGRERSKKLL
LVSNHDTWLS KSKSWVVSAK IFRELNDPSN QWEYLCISGM ESGAPGFSSL YWGYMARQLS
KKFTDRDLAI EVRDNLALEL SPGMRYQDMK GDLISPMSAS FELLRNGIPS VTLGIGDTST
DQTDVIKTLK ALSSNFKFSL EDLINDLLDE YSLLPPEVKS LLTNLSGKPR EAKYLVRYLG
RYMGLPGRIE YALFHKLVAV RKSFKHRFMF AEDNPALGIE VTRNGFLASY RSGLDREITN
SYIYRLHEDL QELL