Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2032 |
Symbol | |
ID | 8568689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 2362364 |
End bp | 2365444 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Aminopeptidase N-like protein |
Protein accession | YP_003291301 |
Protein GI | 268317582 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCCG CCGTACGGAT CGCATCGGTG CTCCTGCTCT GGCTGCTTTC TGCCGCTTTC GCCGCGGCCC AGACCTCGCC TGACTGGCAA CAGCGCGTGC GCTACGAAAT GGACATTCGC CTCGACGCCG CGCAGCATCG CATGCGCGGC CGTTCGCGCA TCGTCTACTA CAACCATTCG CCCGACACGC TCCGTCACGT CTTTTTTCAC CTGTACTTCA ACGCCTTTCA CCCGCAGTCC ATGATGGCCG AGCGCAACCG GCATCTGCCC GACCCCGACC GGCGGGTGGT TCCGAAAATC TGGCGACTCG GTCCCGACGA ACAGGGCTTT CACCGCATCA CCTACCTGGC GCAGGAAGGC AAGCCGCTCA CGTTTCGCAT CACCGACACG GTGCTGAAGG CCGAGCTGGT CCGTCCGCTG GCACCGGGCG ACTCGACCGT CTTCGAGATC CGCTTTCACT CGCAGGTGCC GCTCCAGACA CGTCGAAGCG GCCGCGACAA TGCCGAAGGG ATCGACTTTT CGATGAGCCA GTGGTATCCG AAGATCGCCG CCTACGACGG CCGCGGCTGG CATCCCGATC CGTACATCGG CCGGGAGTTC TACGGCGTCT TCGGCACGTT CGACGTACGC ATCACGCTGC CCGCCTGCTA CACGATCGGG GCCACGGGTG TATTGCTCAA TGCGGACGAA GTCGGGCATG GCTACGACCG CATCAGTAGC CGGGGATGGA CCCGCTTCGA TCCACGCGAA GCGGGCCTGC ACTGCACACC CGGCGACTCG CTCACCTGGC ACTTCCGCGC CGAGCACGTG CACGACTTCG CCTGGGCGGC CGATCCGGAC TACATCCACG AAGCCTGGCA CGACGACAGC CTGGGCGTCA CCTATCACCT GCTCTTTCAG CCCGACGTAG CCGAACGCTG GCAACCCATG CGCCAATGGG TGCCCTGGCT CATTCGCTAC TTCAGCCGAC GCATCGGGCG CTACCCCTAC CCGCAGTTCA CCGTCGCGCA GGCGGGCGAC GGCGGCATGG AATACCCGAT GATCAACTTC ATCACGGGAC GTCGCAGTCC GTTTTCGCTG CTGGGCGTGA CGGCCCACGA AGCGGCGCAC GAGTGGTTCT ACGGCGTGCT GGCCTCCAAC GAGAACGCCT ACGCCTGGAT GGACGAAGGC TTCACGAGCT GGGCCACCAC CGAGGCCGTC GCGCACCTGC TGGGTCAGGC GCCCGACCAT CGGGACGCCG CGCTCAGCGT CGTGCGCCTG CAACAGATGG GCCTGTTCGA GCGCCTGAAC ACGCCCGCCG ACTGGTTCGC TTCCAACGTG GCCTACAGCG TGGCCGCCTA CCCCGGCGGC GAAATGCTGC TCGATCTGCT GGGCTATGTG ATCTCCGATT CGCTGCGCGA CGTCTTCCTG CAGGTCTACT TCCGCACTTA CGGCCTGCGC CATCCCAATC CTTACGACGT CGAAAAAGTC GCCGAGCAGG TCAGCGGCCT GCGCCTGGAC TGGTTCTTCG AGCAGCTGAC CAACAGCACC TACACCTGCG ACGACGCGCT GGAAGTCGCC TCACAGGAGC GCACGCCGGA AGGCTGGCGC GTAACGATCC GGCTTCATCG CCGGGGATCC ATGTTTCTGC CCGTCGATCT CCGGCTGACG CTGGCCGACG GGAGCACGCA GTGGGTGCAT GTGCCGCTCG GCCTGGCCCA GGGGCACAAG CCGGTACCGC CCGACTGGAT CGTGGCCGAG CCGTGGCTGT GGACGTTCCC CCGCTACACG CTAACGCTCA CGCTACCGGC CCGCGTCGTG CGGGCCGAGC TGGACCCGCT TCAGCGTACG CCCGATCACA ACCGCCTGAA CAACACCTGG CCGTTTCCGC TGCAGCTTTC CTTCCTGCAG CCGCTTTCGT TCGATCCGGC CGCCTACCGG GCCACGTGGC GCCCGCTGGC CGCTTACGCC TACGACTTCG GTCCCGGCAT CGGACTGCAA CTGCGCGGCC GCTACTTCTT CGACCGCCAC GAAATGCTGC TGACGCTGAA GCTCTGGCCC GAAGTGCTGC TGAGCAACGG CCGACGACCG GAGCGCCCCT GGGTGCGCGA CCGCAACGCC TGGTGGGCGG GCATCGACTA CACGCTGCGC TACAGCGATC GGCTGGCCCC CCGCACCCGC TGGCACCTGC AGCTCGAAAA GCACCTCGGC GTACTGGAAA ACCGCATCGG CCTCACGCAC ACGCTGGGCC GCTGGGCGGC TCTGGGCCAC GACTACGGAA CGGTCACGCT GGAGCTGACC CACCAGTACA CGCCGGGCTT TCGCACTTTC TCATTCGAAA ACGTCCCGGT GTGGACACCG GGCATGCATC TGGTGTGGAC CGGGCTGCGT TATCGCATCG AACGTCCCTT CGGCCAGCTG CATGCGCTGC TGGAAACCGG TGAAGGTGAA AACGGCAAAG GAGCAACCCG AGGAGTGCTC GACCTGCGCT GGCATTTCCT GCGCCGTCCC ACCTTGCGGG CGACACTGCA TGGCCAGGTA GGCTGGGGCG ATGGCCAGCT TCCCTTCAGC CGCGTCTTCC GGCTGGGAAG TGCCCCGGTC GAAGCGGCCT GGCGCCATGC GGGCTTCCGG AGCGTGGCCG CGCTGTTTGC CGACGCCCGC AAGTCGCTCC ACCTGATCCC GCTTGACGCT CCAGGTCCCG TGGCCTACTG GAACCCTGAT GGGCGCGACC CGCTGCGCGT GCAGGGCAAT GTGCTGGTAG CCGCCAGTCT GCAACTGCAA TGGACGCCTT TCCGGGCGCG TCTGCTGCGC CCGCTCCAGA TGGAAGGCTT CTTCGGAATC GGCCAGACCT GGTTCACCGA GCCCACACTG GCCAGCCGCT ACACGTTCGG CTGGGATCGT TTTCTGGCCG ACGCAGGCGT GGGCCTCGGC TATAACGTAA GCGACCTGCC CGGCCTGCGG CGCTGGACGG CCCTGTCGGA ACTGTTGCAG GACCTGCGGC TGCGCCTGCG GGTGCCGCTG TGGGTCAGCG ATCCGGACCT GATCGGCGAA CGCGACGCGC TGCGGTTCCG CTGGATGCTG GGGATTGTGG TCGGGCCTTG A
|
Protein sequence | MPPAVRIASV LLLWLLSAAF AAAQTSPDWQ QRVRYEMDIR LDAAQHRMRG RSRIVYYNHS PDTLRHVFFH LYFNAFHPQS MMAERNRHLP DPDRRVVPKI WRLGPDEQGF HRITYLAQEG KPLTFRITDT VLKAELVRPL APGDSTVFEI RFHSQVPLQT RRSGRDNAEG IDFSMSQWYP KIAAYDGRGW HPDPYIGREF YGVFGTFDVR ITLPACYTIG ATGVLLNADE VGHGYDRISS RGWTRFDPRE AGLHCTPGDS LTWHFRAEHV HDFAWAADPD YIHEAWHDDS LGVTYHLLFQ PDVAERWQPM RQWVPWLIRY FSRRIGRYPY PQFTVAQAGD GGMEYPMINF ITGRRSPFSL LGVTAHEAAH EWFYGVLASN ENAYAWMDEG FTSWATTEAV AHLLGQAPDH RDAALSVVRL QQMGLFERLN TPADWFASNV AYSVAAYPGG EMLLDLLGYV ISDSLRDVFL QVYFRTYGLR HPNPYDVEKV AEQVSGLRLD WFFEQLTNST YTCDDALEVA SQERTPEGWR VTIRLHRRGS MFLPVDLRLT LADGSTQWVH VPLGLAQGHK PVPPDWIVAE PWLWTFPRYT LTLTLPARVV RAELDPLQRT PDHNRLNNTW PFPLQLSFLQ PLSFDPAAYR ATWRPLAAYA YDFGPGIGLQ LRGRYFFDRH EMLLTLKLWP EVLLSNGRRP ERPWVRDRNA WWAGIDYTLR YSDRLAPRTR WHLQLEKHLG VLENRIGLTH TLGRWAALGH DYGTVTLELT HQYTPGFRTF SFENVPVWTP GMHLVWTGLR YRIERPFGQL HALLETGEGE NGKGATRGVL DLRWHFLRRP TLRATLHGQV GWGDGQLPFS RVFRLGSAPV EAAWRHAGFR SVAALFADAR KSLHLIPLDA PGPVAYWNPD GRDPLRVQGN VLVAASLQLQ WTPFRARLLR PLQMEGFFGI GQTWFTEPTL ASRYTFGWDR FLADAGVGLG YNVSDLPGLR RWTALSELLQ DLRLRLRVPL WVSDPDLIGE RDALRFRWML GIVVGP
|
| |