Gene NSE_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0475 
SymbolpepA 
ID3932102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp392294 
End bp393814 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content42% 
IMG OID637900631 
Productcytosol aminopeptidase 
Protein accessionYP_506360 
Protein GI88608537 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00320613 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTCT TTTTTTTGAA TCCAGTAGAG GCAATGCAAA GCGAGCCTAT TCTATTCAGT 
GGACTGTATG AAAATTCTGA GTTCTTTGGA AGGATTTCCA TTTTGGATCA ACAAAGCTCT
TCGTTTATCT CCAAAAGCAT TGCTTCGACT TCGTTTACTG GAAAAGTTGG GGAAAACGTT
TGTATTTTTC TCAGTGACCT TGTCGAAGGG TATCCAAAAC TGCAACAACT GTGTCTCATT
GGGCTTGGAA AACAAAAAGA TTTTGATGCT CATACCGCAG AGAAAACAGG AGCACAGATT
GCTTCACTCA TGAAGAAGAA TAATGTTTCT TCCGCGACAG TACTTTTAGA TGGTTTAAAA
GATGACTATG CCTTGGATCT GGCTTTTGGG GCAAAGCTAA AAGACTATTC GTTCGATAAA
TATAAAACAA AAAAGACCGA TGATAAGAGC ATAACGCTAG AGCGGCTTGT AATTGGAATT
GAGAACTACG AGCACATCTA TTCGGTTTTT CGAGAGAAAG TCGAGCCTTT AATTGAATCA
ATGAAGTTAA CTAGGGATCT GATAAATGAA CCCGCAAATC ACCTTAATCC GGAAACTTAT
GCCAATGCTG TAAAGGAGAT TACCAAGACA GCTCCAAACC TAAAAGTCGA AATCCTTGAT
GAAAAGGGCA TGCAGAAATT AGGCATGAAC GCACTTCTAG GCGTTGGTCA GGGAAGCACT
TACCCTTCAA AATTAGTGGT GGTAAAATAT AACGGTGCAG TGGATAAAGA TGATCCCTAC
ATCGCTCTTG TTGGAAAAGG CGTTACTTTC GACAGCGGTG GTATTTCACT TAAGCCAGCA
CGTGGAATGT GGCATATGGT ATCCGATATG GCTGGCTCTG CAACAGTTTT GGGGGCAGTT
CATGCTATTG CAAAACAGGG TGTGAAAGCA AATGTTGCAG CCGTACTCGG GATCGTTGAA
AACGCAGTTT CAGGAGTCGC ACAACGTCCA GGGGACATAG TCAGGTCGGC TTCTGGGAAA
ACAATTGAAG TACTCAACAC TGACGCAGAA GGTAGGTTGG TACTCGCAGA TGCCTTGTGG
TATGCGCAAG AACACTTAAA AGCAAACCAA GTAATAGACG TAGCAACACT CACAGGAGCA
ATAGTTGTTG CTTTAGGACA CGATCATGCA GGATTGTTCT CTAATGATGA TGCATTAGCC
GAAAACCTTG CAAATGTGGG GAAAAAGGTT GGTGAGAAAC TCTGGAGAAT GCCTATGTCA
AAAAATTATG ATGATCTCAT AAATTCTGAA GTCGCAGATG TGAAGAACAT CTCTACAGAA
AATCATGGCG CAGACAGCAT AACAGCAGCT CAATTCTTGA AACGATTCAT AAACGACGGA
ACAAAATGGG CTCACCTCGA TATTGCCGGC GTTGCATGGA ATAACAGTAC TTCCCATTTT
TCCAGCATTG GAGCTTCTGG ATTTGGTGTA CGACTACTAA CGGAATTTAT ATCGGAATCA
GTTAAAGAAA CTTACAAGTA A
 
Protein sequence
MEVFFLNPVE AMQSEPILFS GLYENSEFFG RISILDQQSS SFISKSIAST SFTGKVGENV 
CIFLSDLVEG YPKLQQLCLI GLGKQKDFDA HTAEKTGAQI ASLMKKNNVS SATVLLDGLK
DDYALDLAFG AKLKDYSFDK YKTKKTDDKS ITLERLVIGI ENYEHIYSVF REKVEPLIES
MKLTRDLINE PANHLNPETY ANAVKEITKT APNLKVEILD EKGMQKLGMN ALLGVGQGST
YPSKLVVVKY NGAVDKDDPY IALVGKGVTF DSGGISLKPA RGMWHMVSDM AGSATVLGAV
HAIAKQGVKA NVAAVLGIVE NAVSGVAQRP GDIVRSASGK TIEVLNTDAE GRLVLADALW
YAQEHLKANQ VIDVATLTGA IVVALGHDHA GLFSNDDALA ENLANVGKKV GEKLWRMPMS
KNYDDLINSE VADVKNISTE NHGADSITAA QFLKRFINDG TKWAHLDIAG VAWNNSTSHF
SSIGASGFGV RLLTEFISES VKETYK