Gene WD0761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0761 
Symbol 
ID2738194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp734420 
End bp735760 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content35% 
IMG OID637172937 
Productinsulinase family protease 
Protein accessionNP_966517 
Protein GI42520602 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCATA AGTTTTTTAG TTTTCTTATA CTACCTGTAC TTTTTTTTAC TTACCCAATT 
TCTTCGAATG CTTTAAACAT AGAGCACAAC ATAAAATATA CTAAACTCAG TAACGGACTA
GATGTTTATG TTGTTTCTAA TCATCGGATT CCAGCCGTTT TACATGCAGT AATATACAAA
GTTGGGGGAA TGGATGATCC AATCGGCAAA GCAGGATTGG CTCACTACTT TGAACATTTA
ATGTTTGAAA CTACAGGGAA GTTCACAGAT ATAGAAGCCA CTATGAGTAG CATTGGGGCT
CAATTTAACG CATTTACCAC TAAAGAATAT ACCTGTTATT TTGAGTTGAT CCCCAAAAAA
GATTTACCAC TTGCAATGGA AGTTGAAGCA GATAGAATGG GCAGCTTTAA TGTTACTCAA
GACAAAATAG ATAGAGAAAA AAACATCGTA TTAGAAGAAA GAAAGATGAG ATTTGATAAC
CAACCTCATA ACTTGTTATG GGAAGAAATG GATAGTGCAT TTTACCGTAC TGGCTATGGT
AGGTCTGTTA TCGGCTGGGA GAGCGATATC AAAACTTACA ATTTGGATGA CATAACGAGG
TTTCATGATA ACTATTATCA CTCCGGCAAT GCAATATTAC TGATTGTGGG TGATGTGGAA
CTTGATGAAG TAGTGAAATT AGCAGAAGAA AAATATGGCG AAATTAAAGC TAAGCCTGTA
ATGAGAAATT ACCCAAATCA GGATCCAGTG CATAATGCAG GTCTATCGGT GACCTTAGAG
AGCACTGAAG TAAAAGAATC AGTTTTATAT TTTCGTTATC GTGTTCCCTT ATTTGATCAC
ATAAATGAAG CCTCTGCTGC TCATTTAGCG GTTGACATTT TAGGTGGCGG CAAATCCAGC
AAACTGTATA AAGATTTGGT TTTAGATAAA GATGTGGCAG TATCAGTGTT TGCTTATTAT
AATAGCCTAG CTTTTAGTGA TGGCTACATT GATATTATGA TAATTCCAAA AAGTGGCGCT
AATTTAGATA TTGTTGAAAG AGAGTTAGAT AATGCTATTA ATGGCTTCGT ATCTAAAGGG
GTGACGAGCG AGGAATTACA AAGCTCAAAA TATAGGTATA AGGCAGCACA GTTTGATAAT
CTATCTGACT TAACTCATAT AGCGATGTTC TATGTGCCAC ACCTTGCACT TGGTATTCCA
CTTGATGAAA TAGATATTTC ATATAGCAAG ATTAATGATG TCAATCTAGA GGATGTAAAT
AATAAAATCC GTGCTATTTT TTCTGCTAAT AAGTTAATTG GTCGTTTATT ACCAAAAGGA
GATAATAATG AGAATAAGTA A
 
Protein sequence
MLHKFFSFLI LPVLFFTYPI SSNALNIEHN IKYTKLSNGL DVYVVSNHRI PAVLHAVIYK 
VGGMDDPIGK AGLAHYFEHL MFETTGKFTD IEATMSSIGA QFNAFTTKEY TCYFELIPKK
DLPLAMEVEA DRMGSFNVTQ DKIDREKNIV LEERKMRFDN QPHNLLWEEM DSAFYRTGYG
RSVIGWESDI KTYNLDDITR FHDNYYHSGN AILLIVGDVE LDEVVKLAEE KYGEIKAKPV
MRNYPNQDPV HNAGLSVTLE STEVKESVLY FRYRVPLFDH INEASAAHLA VDILGGGKSS
KLYKDLVLDK DVAVSVFAYY NSLAFSDGYI DIMIIPKSGA NLDIVERELD NAINGFVSKG
VTSEELQSSK YRYKAAQFDN LSDLTHIAMF YVPHLALGIP LDEIDISYSK INDVNLEDVN
NKIRAIFSAN KLIGRLLPKG DNNENK