Gene WD0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0054 
SymbolpepA 
ID2738311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp52849 
End bp54351 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content37% 
IMG OID637172291 
Productleucyl aminopeptidase 
Protein accessionNP_965884 
Protein GI42519969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGTT TACAATTATT TGCAACGGAG ATACCAGCAA TGAAAATAAC AATTTCTAAA 
ATTTCGCCCG ATTTTAAAAC AATAGTAATG GGTTTATTTG AAGACAACGA AACCGTAAAT
GATGGCGGAG TTTTGCAAGG AAAACAGGTC ATAGATAATA TAAAGCAATT TAGCGATTTC
AATGGAAGTT TTGGTGAATT TTTCTCTACT GCTTTGCCAG AAGAAAAAAA TGTTATAGTT
GTTGGACTGG GTAAGAAGGA TGAATGGAAT GAAAATAAAG AATTAAATAT TGGTGGTAAA
ATATATTGTG AGCTAAGCAG ATTAAAAATT AAAAAAGCAG CGGTTTTAAT CGAAGGTAGT
GCAGCAAATG TTGCATATGG TGCGTTTCTG CGTAGTTTTA AGTTTGATAA GTATAAAACT
AAAAAGGATG AGAAAATTAC AGAGGTAGAG GAGATTACCG TATTAGTAAA AGATGAACAA
TTAAGTAATG CTGAAAGATC ATTTGAGCAC TTAAGGCAAG AAGGTGAGAG TATATTCCTT
GCGCGCTCTT TTATAACAGA GCCTCCTAAC ATTCTATATC CAGAATCCTA TGCTGATCAT
ATAAAAAAAG AACTTACTAA GCTTGGCCTT GAAATCGAGG TGCTTGATAA AAAGCAGATG
GAAGAGAAAA AAATGGGAGC CTTGCTTGGA GTTGCACAAG GAAGTAGTAA AGAACCAAAA
TTAGTAGTGA TAAAATGGAA TGGAGCTTCT AAAGAACAAA AGCCTATAGC TTTTGTTGGT
AAAGGTATAA CGTTTGACAC TGGTGGAGTA TCACTCAAAC CTTCACGTGG TATGGAGTCA
ATGAAATATG ACATGGCAGG CTCTGCTACT GTGGTTGGGG TGATGCATGC TTTAGCAGGA
CGAAAAGCAA AAGTAAATGC GATCGGTGTG GTCGCACTTG CAGAAAATGC AGTGGGTGGT
AATGCTCAAA GACCAAGTGA TGTAGTAACT TCAATGTCTG GACAAACAAT AGAAGTGTTG
AACACCGATG CAGAAGGAAG GCTCATACTT GCAGATGCTT TATGGTATAC GCAAGATAGA
TTCTCACCAA AATTTATGAT TGATCTTGCA ACTTTAACTG GTGCTATAGT GGTTGCACTT
GGAAATAACG AATATGCTGG TCTTTTTTCA AATAATGATG AATTAGCAAA CCGTCTGATT
GATGCAGGAA ATGAAGTAAA TGAGAAGTTA TGGCGTTTTC CTATGAATGA AACTTATGAC
AAAATTATTG ATTCACCGAT TGCTGATGTT CAAAACATCG CTCCTGCAGG CTCTGGTGGA
GATAGCATAA TGGCTGCACA GTTTTTACAG CGTTTTGTGA ATGAAACTTG CTGGGCACAT
TTAGATATCG CAGGCACTGC TTGGCACGAG AAAGGTACTG ACATTTGTCC AAGAGGAGCA
GTAGGTTTTG GTGTAAGGTT ACTTAATAAG TTGGTTGAGA AGTACTACGA AGCCAATGAT
TAA
 
Protein sequence
MYSLQLFATE IPAMKITISK ISPDFKTIVM GLFEDNETVN DGGVLQGKQV IDNIKQFSDF 
NGSFGEFFST ALPEEKNVIV VGLGKKDEWN ENKELNIGGK IYCELSRLKI KKAAVLIEGS
AANVAYGAFL RSFKFDKYKT KKDEKITEVE EITVLVKDEQ LSNAERSFEH LRQEGESIFL
ARSFITEPPN ILYPESYADH IKKELTKLGL EIEVLDKKQM EEKKMGALLG VAQGSSKEPK
LVVIKWNGAS KEQKPIAFVG KGITFDTGGV SLKPSRGMES MKYDMAGSAT VVGVMHALAG
RKAKVNAIGV VALAENAVGG NAQRPSDVVT SMSGQTIEVL NTDAEGRLIL ADALWYTQDR
FSPKFMIDLA TLTGAIVVAL GNNEYAGLFS NNDELANRLI DAGNEVNEKL WRFPMNETYD
KIIDSPIADV QNIAPAGSGG DSIMAAQFLQ RFVNETCWAH LDIAGTAWHE KGTDICPRGA
VGFGVRLLNK LVEKYYEAND