Gene WD1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1008 
Symbol 
ID2737737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp966880 
End bp968547 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content35% 
IMG OID637173164 
Productaminopeptidase P 
Protein accessionNP_966734 
Protein GI42520819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA TCAAAGAATT TCGTTCTTTT ATGCACGAAA TAAACGTTGA TGCATTTGTG 
TTACATACTA AAGACGAATA TTTAAATGAG TATTCAGAGG AGCTAACAAA GCTGTGTGGC
TTCACAGGAA CAAATGGGCT GCTTATTGTC ACAAAAAACA ACAAGTGCCA ATTTTTTACA
GATGGACGCT ATATCACACA AGCTCACAAT CAGCTCGATC AGGGCAATTT TCAAGTATAT
AATATACAAG AAGAGGATCC ACGCGAATGG ATAAAAGCAA ACTTAACATC GACTGCCTCA
CTAGGTTATT ATTTGCAATA TTTTACCATG GAAGATATAA GAAAGTATGA GAATATCTGT
AAATTAATAC CCTGTTTAGC TGGAAAAAAA AGTGACTATC GAAAACAAGC AGTGGTTTTA
CATTCTATTA AATATGCTGG TGAAAGTAGT AAGGACAAAT GTGAAAAAGT CGCTAAAAGT
ATAGATAAAG AAGCTGAAGC AGTGCTTTTA ACTGATCCAA ATTCAATTTC ATGGTTATTA
AATTTAAGAA ACGAAAATGC TAAATATACT CCATGTATAT TGGGTCGTGC TATATTGTAT
AAAAGCGGTA ATGTTGATTT GTTTATTCAA GATAAAGAAC ATTCAACTAT AGAAGCAAAT
TTAGGCAATC ATATAAATAT TTTTGATATC AGTGAGCTAG AAAATTCGCT GCACAAGCTA
AATTCAATAG TTATAGATCC AAACACAACT CCAATGAGTA TCATGGCTGT AATAAAAGAT
AAACAGGTAG CTGAAAGAGA GGATCCTTGT TTAATTTATA AAGCAGTAAA AAATCAAACT
GAAATAGCTG GGGCTATAAA TGCGCACATC AGAGATGGAG TGGCAGTTAC AAATTTTCTA
CATTGGCTTG AAAGTAATGT TGGTACAGAG CTTGAAGCTG AAGAAAGGAT TTTAGAATAC
AGAAAAGAGC AGAATTTGTT TAAACAATTG AGCTTTCCAA CAATTTCTGC ATTTAATGAG
AATGGGGCAA TAATTCACTA TCGTGCAAGC AGTAAGACGA ATAAAGTAAT TCAGAAAGAT
GGACTGTATT TGATTGACTC TGGTGGCCAG TACCTTGACG GCACAACTGA TGTGACAAGA
ACTGTAGTAG TTGGTAATCC GACCAATGAG CAAATAACCC ACTATACAAT AGTACTCAAA
GCTCACATTG CTATAGCAAG TGTCGTCTTT CCCCCCGGCA CTACTGGTGG AGAATTGGAT
ATATTGGCAC GTACGCATTT ATGGAAATTT GGAATGGACT ATATGCATGG TACAGGGCAT
GGAGTAGGAA GTTACCTATC AGTACACGAA GGACCACAAG CAATATCAAA AAGTAATAAA
GTGAAACTCA CGCCAGGGAT GATACTTTCC AACGAACCTG GCTATTACAT TCCGGGAGAG
TATGGAATAA GGATTGAAAA TCTGATGTAT GTCAACAGAC AAGAAAACGG CTTCTTAAAC
TTTAAACAAC TGACCTCTAT TCCATATGAT AGAAGACTAA TAAATGTGCA AATGCTTACT
AAGGATGAAA TTGAATGGAT AAATGGCTAC CATCAATTTA TCTATAAAAA CTTAGAAAAT
AGCGTCAAAG ATAAGGAGTG GTTAAAGAAA GTATGTGACC CTTTATAA
 
Protein sequence
MSKIKEFRSF MHEINVDAFV LHTKDEYLNE YSEELTKLCG FTGTNGLLIV TKNNKCQFFT 
DGRYITQAHN QLDQGNFQVY NIQEEDPREW IKANLTSTAS LGYYLQYFTM EDIRKYENIC
KLIPCLAGKK SDYRKQAVVL HSIKYAGESS KDKCEKVAKS IDKEAEAVLL TDPNSISWLL
NLRNENAKYT PCILGRAILY KSGNVDLFIQ DKEHSTIEAN LGNHINIFDI SELENSLHKL
NSIVIDPNTT PMSIMAVIKD KQVAEREDPC LIYKAVKNQT EIAGAINAHI RDGVAVTNFL
HWLESNVGTE LEAEERILEY RKEQNLFKQL SFPTISAFNE NGAIIHYRAS SKTNKVIQKD
GLYLIDSGGQ YLDGTTDVTR TVVVGNPTNE QITHYTIVLK AHIAIASVVF PPGTTGGELD
ILARTHLWKF GMDYMHGTGH GVGSYLSVHE GPQAISKSNK VKLTPGMILS NEPGYYIPGE
YGIRIENLMY VNRQENGFLN FKQLTSIPYD RRLINVQMLT KDEIEWINGY HQFIYKNLEN
SVKDKEWLKK VCDPL