Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0086 |
Symbol | |
ID | 5104664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 76329 |
End bp | 77633 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640505985 |
Product | aminopeptidase |
Protein accession | YP_001190187 |
Protein GI | 146302871 |
COG category | [R] General function prediction only |
COG ID | [COG4882] Predicted aminopeptidase, Iap family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0111442 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000071727 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATTG ATCATGGGTT AGCAGGCGAT ATAAACGAGG GAAGAAGGAT AGGGAGACTA AGAGAAATCT TAGAAGACGT CAGCGATGAA ATTAAGGTCT ACAGGGAGAG GATATTAACG TGGGAGGTTA ACCATTCTCT GGTAACCATA AACGGTGAAA GGGTAGAACA TCAGGCACTA CCATACTCGC CTACCTCGTA TATCAGAGGA CACGTAGTGG AGAATCCTGA AGACTGTAGG AAAGACACAG TACTTGTAAA TAGGGGTAAC TCGAGGTTCG ATGTTTATTA CGACATTATT CTGGCCAAGA GGTTTGGTTG TGAAGCCTTA CTATTAACAC AAGTCGAGAC TGTCTCGATT TCGCCTCCTT ACCTACAGTC TGGAGCCTCG GAACCCCCAC TGGGTCTGAT ATCGATAAAA GGTGCAAATC CTAAGAAAGG GAATCTGGTG GAGATAAATC TCGAAACCAA GGCAAGGATC GTAGACGCTT ACGCTATCCA TGCGATAAAA AATGGAAGAG AAAGGTCCAA AAAACTCCTT CTGGTCTCCA ATCACGATAC ATGGCTTAGC AAAAGTAAAA GCTGGGTAGT GTCAGCGAAA ATTTTCCGAG AGCTAAATGA CCCTAGTAAC CAATGGGAAT ACCTTTGTAT ATCTGGAATG GAGTCTGGAG CACCTGGATT TTCATCACTT TATTGGGGAT ACATGGCTAG GCAACTTAGT AAGAAATTTA CAGACAGGGA TCTGGCAATA GAGGTTAGAG ATAACCTAGC TTTAGAGCTG TCACCTGGAA TGAGATATCA AGACATGAAG GGTGATCTGA TTTCCCCAAT GTCAGCGTCT TTCGAGTTAT TGAGGAATGG GATACCATCT GTGACCCTGG GAATAGGAGA TACTTCCACA GATCAAACTG ATGTAATAAA GACTCTGAAG GCTCTCTCTT CTAATTTCAA ATTCTCTCTA GAGGATCTAA TAAATGATCT CCTTGACGAG TATTCCCTGT TACCCCCTGA GGTGAAATCC CTACTCACCA ACCTCAGCGG AAAACCAAGA GAGGCTAAAT ACCTTGTCAG ATACCTAGGA AGGTACATGG GGTTGCCGGG AAGGATTGAA TATGCACTTT TTCATAAGTT GGTGGCTGTA AGGAAATCCT TCAAACACAG GTTCATGTTC GCAGAGGACA ATCCAGCGCT AGGAATAGAG GTAACGAGAA ATGGGTTTCT CGCAAGCTAT AGATCAGGAT TAGATAGGGA AATCACAAAC TCTTACATTT ACAGGTTACA CGAAGATTTA CAGGAACTCC TCTAA
|
Protein sequence | MSIDHGLAGD INEGRRIGRL REILEDVSDE IKVYRERILT WEVNHSLVTI NGERVEHQAL PYSPTSYIRG HVVENPEDCR KDTVLVNRGN SRFDVYYDII LAKRFGCEAL LLTQVETVSI SPPYLQSGAS EPPLGLISIK GANPKKGNLV EINLETKARI VDAYAIHAIK NGRERSKKLL LVSNHDTWLS KSKSWVVSAK IFRELNDPSN QWEYLCISGM ESGAPGFSSL YWGYMARQLS KKFTDRDLAI EVRDNLALEL SPGMRYQDMK GDLISPMSAS FELLRNGIPS VTLGIGDTST DQTDVIKTLK ALSSNFKFSL EDLINDLLDE YSLLPPEVKS LLTNLSGKPR EAKYLVRYLG RYMGLPGRIE YALFHKLVAV RKSFKHRFMF AEDNPALGIE VTRNGFLASY RSGLDREITN SYIYRLHEDL QELL
|
| |