Gene WD0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0652 
Symbol 
ID2738820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp639747 
End bp641027 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content33% 
IMG OID637172833 
ProductM48 family peptidase 
Protein accessionNP_966416 
Protein GI42520501 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.59081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAA TTGCTAAGTT TTTGACCTTA TTGTTCTTTC TTGCATATTA TAATAATGCT 
TACTCTATTA ACATTATTAG AGATAGTGAG GTGGAAGCAA TAGTGAAAGA GCTAGCGCAA
CCTTTATTTT CTGCTGCAGA TATTGATCAT GATCAAGTGA AAGTTTTTGT AATTAATGAC
AGTTCGATTA ATGCTTTTGT AATTAACAAT AACAGTATCT TCATTCATTT AGGGCTTTTA
CGATATTCGG CTAAACCTTA TGTCTTGCTT GGTATATTAG CACATGAGAT TGCTCACATA
TCTGCTGGTC ATATATTGCA AATGAGTAGT GCTATGGGTT ATTTTCAATC GATAGCAATG
ATTAGTTATA TGGTAGGATT AGTTTCTAGT ATTATCATTA ACCCTCAGGT TGCTGGTGCA
ATTTTGCTTA GTGGTGTAGC ACTCAGTTCA AGGCTATTTT TTAACTATTC TCAAGAGCAA
GAAAGTGTAG CAGATAGCTA TGCTTTAAGG TACCTTGATG AATCTGGCTA TGATAATTCA
GGTATGAAAG AGATTTTTGA TTATTTTAAA AGTATTGAGC ATGAGAATAC CGAGGAATAT
TTCCGTACTC ACCCACTTAG TGAGAAACGT ATATTTGCTG TACAGAATTA TAAGGTCAAA
AACAACGTAA AACCAATTTT TGCGGATAAG TTACTAAAGT TTGAGCGTGT GGTTGCAAAG
CTAGACTCTT TCTTTGCTCC TATTCATGTG TTATCTAATA AATATGAAGA TAATTCTGAG
TATGTAAATG CTATAGTTTG CTATAGGCAA GGAAAGATAG AGGAGGCTAT TGCTAAAGTT
AATTCATTGA TTCAAGAGTC ACGCAATGAC CCATACCTAT ATGAATTAAA AGCAGAAATG
CTATACAAGG CTGGAAATTT AAGTGAAGCA ATAAAAATGT ATGAGGAATC GCTTAGGTAT
TTATCTGAGA AAAACAGTTA TTTAGTGAAA CTTGCATTAT CTCATACTTT ATTATTACAC
GGTGATGCAA AAAAGGCAAT TTTTTACCTG GAGCAGATTT TGAATGTAGA ACCAAATAAC
GCTTTTGTCT GGAAATATTT AAGCGTTGCA TATAAATGTG ACGCTGATAC GGCAATGCAT
TATTTTGCTT TGACAAAAAA GGCTTGTATT GAAGGTGATT TAAAGCAATT TACAAAATAT
GCTGAGCTAG CTGTTAAAAC TTTACCAAAA GATAGCCCTC ATTTGTTGCA AGTTGAAGAT
ATGAAGCGAT TTAATGGATA A
 
Protein sequence
MFRIAKFLTL LFFLAYYNNA YSINIIRDSE VEAIVKELAQ PLFSAADIDH DQVKVFVIND 
SSINAFVINN NSIFIHLGLL RYSAKPYVLL GILAHEIAHI SAGHILQMSS AMGYFQSIAM
ISYMVGLVSS IIINPQVAGA ILLSGVALSS RLFFNYSQEQ ESVADSYALR YLDESGYDNS
GMKEIFDYFK SIEHENTEEY FRTHPLSEKR IFAVQNYKVK NNVKPIFADK LLKFERVVAK
LDSFFAPIHV LSNKYEDNSE YVNAIVCYRQ GKIEEAIAKV NSLIQESRND PYLYELKAEM
LYKAGNLSEA IKMYEESLRY LSEKNSYLVK LALSHTLLLH GDAKKAIFYL EQILNVEPNN
AFVWKYLSVA YKCDADTAMH YFALTKKACI EGDLKQFTKY AELAVKTLPK DSPHLLQVED
MKRFNG