Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0475 |
Symbol | pepA |
ID | 3932102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | - |
Start bp | 392294 |
End bp | 393814 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637900631 |
Product | cytosol aminopeptidase |
Protein accession | YP_506360 |
Protein GI | 88608537 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00320613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGTCT TTTTTTTGAA TCCAGTAGAG GCAATGCAAA GCGAGCCTAT TCTATTCAGT GGACTGTATG AAAATTCTGA GTTCTTTGGA AGGATTTCCA TTTTGGATCA ACAAAGCTCT TCGTTTATCT CCAAAAGCAT TGCTTCGACT TCGTTTACTG GAAAAGTTGG GGAAAACGTT TGTATTTTTC TCAGTGACCT TGTCGAAGGG TATCCAAAAC TGCAACAACT GTGTCTCATT GGGCTTGGAA AACAAAAAGA TTTTGATGCT CATACCGCAG AGAAAACAGG AGCACAGATT GCTTCACTCA TGAAGAAGAA TAATGTTTCT TCCGCGACAG TACTTTTAGA TGGTTTAAAA GATGACTATG CCTTGGATCT GGCTTTTGGG GCAAAGCTAA AAGACTATTC GTTCGATAAA TATAAAACAA AAAAGACCGA TGATAAGAGC ATAACGCTAG AGCGGCTTGT AATTGGAATT GAGAACTACG AGCACATCTA TTCGGTTTTT CGAGAGAAAG TCGAGCCTTT AATTGAATCA ATGAAGTTAA CTAGGGATCT GATAAATGAA CCCGCAAATC ACCTTAATCC GGAAACTTAT GCCAATGCTG TAAAGGAGAT TACCAAGACA GCTCCAAACC TAAAAGTCGA AATCCTTGAT GAAAAGGGCA TGCAGAAATT AGGCATGAAC GCACTTCTAG GCGTTGGTCA GGGAAGCACT TACCCTTCAA AATTAGTGGT GGTAAAATAT AACGGTGCAG TGGATAAAGA TGATCCCTAC ATCGCTCTTG TTGGAAAAGG CGTTACTTTC GACAGCGGTG GTATTTCACT TAAGCCAGCA CGTGGAATGT GGCATATGGT ATCCGATATG GCTGGCTCTG CAACAGTTTT GGGGGCAGTT CATGCTATTG CAAAACAGGG TGTGAAAGCA AATGTTGCAG CCGTACTCGG GATCGTTGAA AACGCAGTTT CAGGAGTCGC ACAACGTCCA GGGGACATAG TCAGGTCGGC TTCTGGGAAA ACAATTGAAG TACTCAACAC TGACGCAGAA GGTAGGTTGG TACTCGCAGA TGCCTTGTGG TATGCGCAAG AACACTTAAA AGCAAACCAA GTAATAGACG TAGCAACACT CACAGGAGCA ATAGTTGTTG CTTTAGGACA CGATCATGCA GGATTGTTCT CTAATGATGA TGCATTAGCC GAAAACCTTG CAAATGTGGG GAAAAAGGTT GGTGAGAAAC TCTGGAGAAT GCCTATGTCA AAAAATTATG ATGATCTCAT AAATTCTGAA GTCGCAGATG TGAAGAACAT CTCTACAGAA AATCATGGCG CAGACAGCAT AACAGCAGCT CAATTCTTGA AACGATTCAT AAACGACGGA ACAAAATGGG CTCACCTCGA TATTGCCGGC GTTGCATGGA ATAACAGTAC TTCCCATTTT TCCAGCATTG GAGCTTCTGG ATTTGGTGTA CGACTACTAA CGGAATTTAT ATCGGAATCA GTTAAAGAAA CTTACAAGTA A
|
Protein sequence | MEVFFLNPVE AMQSEPILFS GLYENSEFFG RISILDQQSS SFISKSIAST SFTGKVGENV CIFLSDLVEG YPKLQQLCLI GLGKQKDFDA HTAEKTGAQI ASLMKKNNVS SATVLLDGLK DDYALDLAFG AKLKDYSFDK YKTKKTDDKS ITLERLVIGI ENYEHIYSVF REKVEPLIES MKLTRDLINE PANHLNPETY ANAVKEITKT APNLKVEILD EKGMQKLGMN ALLGVGQGST YPSKLVVVKY NGAVDKDDPY IALVGKGVTF DSGGISLKPA RGMWHMVSDM AGSATVLGAV HAIAKQGVKA NVAAVLGIVE NAVSGVAQRP GDIVRSASGK TIEVLNTDAE GRLVLADALW YAQEHLKANQ VIDVATLTGA IVVALGHDHA GLFSNDDALA ENLANVGKKV GEKLWRMPMS KNYDDLINSE VADVKNISTE NHGADSITAA QFLKRFINDG TKWAHLDIAG VAWNNSTSHF SSIGASGFGV RLLTEFISES VKETYK
|
| |