Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD0761 |
Symbol | |
ID | 2738194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 734420 |
End bp | 735760 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637172937 |
Product | insulinase family protease |
Protein accession | NP_966517 |
Protein GI | 42520602 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.126087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCATA AGTTTTTTAG TTTTCTTATA CTACCTGTAC TTTTTTTTAC TTACCCAATT TCTTCGAATG CTTTAAACAT AGAGCACAAC ATAAAATATA CTAAACTCAG TAACGGACTA GATGTTTATG TTGTTTCTAA TCATCGGATT CCAGCCGTTT TACATGCAGT AATATACAAA GTTGGGGGAA TGGATGATCC AATCGGCAAA GCAGGATTGG CTCACTACTT TGAACATTTA ATGTTTGAAA CTACAGGGAA GTTCACAGAT ATAGAAGCCA CTATGAGTAG CATTGGGGCT CAATTTAACG CATTTACCAC TAAAGAATAT ACCTGTTATT TTGAGTTGAT CCCCAAAAAA GATTTACCAC TTGCAATGGA AGTTGAAGCA GATAGAATGG GCAGCTTTAA TGTTACTCAA GACAAAATAG ATAGAGAAAA AAACATCGTA TTAGAAGAAA GAAAGATGAG ATTTGATAAC CAACCTCATA ACTTGTTATG GGAAGAAATG GATAGTGCAT TTTACCGTAC TGGCTATGGT AGGTCTGTTA TCGGCTGGGA GAGCGATATC AAAACTTACA ATTTGGATGA CATAACGAGG TTTCATGATA ACTATTATCA CTCCGGCAAT GCAATATTAC TGATTGTGGG TGATGTGGAA CTTGATGAAG TAGTGAAATT AGCAGAAGAA AAATATGGCG AAATTAAAGC TAAGCCTGTA ATGAGAAATT ACCCAAATCA GGATCCAGTG CATAATGCAG GTCTATCGGT GACCTTAGAG AGCACTGAAG TAAAAGAATC AGTTTTATAT TTTCGTTATC GTGTTCCCTT ATTTGATCAC ATAAATGAAG CCTCTGCTGC TCATTTAGCG GTTGACATTT TAGGTGGCGG CAAATCCAGC AAACTGTATA AAGATTTGGT TTTAGATAAA GATGTGGCAG TATCAGTGTT TGCTTATTAT AATAGCCTAG CTTTTAGTGA TGGCTACATT GATATTATGA TAATTCCAAA AAGTGGCGCT AATTTAGATA TTGTTGAAAG AGAGTTAGAT AATGCTATTA ATGGCTTCGT ATCTAAAGGG GTGACGAGCG AGGAATTACA AAGCTCAAAA TATAGGTATA AGGCAGCACA GTTTGATAAT CTATCTGACT TAACTCATAT AGCGATGTTC TATGTGCCAC ACCTTGCACT TGGTATTCCA CTTGATGAAA TAGATATTTC ATATAGCAAG ATTAATGATG TCAATCTAGA GGATGTAAAT AATAAAATCC GTGCTATTTT TTCTGCTAAT AAGTTAATTG GTCGTTTATT ACCAAAAGGA GATAATAATG AGAATAAGTA A
|
Protein sequence | MLHKFFSFLI LPVLFFTYPI SSNALNIEHN IKYTKLSNGL DVYVVSNHRI PAVLHAVIYK VGGMDDPIGK AGLAHYFEHL MFETTGKFTD IEATMSSIGA QFNAFTTKEY TCYFELIPKK DLPLAMEVEA DRMGSFNVTQ DKIDREKNIV LEERKMRFDN QPHNLLWEEM DSAFYRTGYG RSVIGWESDI KTYNLDDITR FHDNYYHSGN AILLIVGDVE LDEVVKLAEE KYGEIKAKPV MRNYPNQDPV HNAGLSVTLE STEVKESVLY FRYRVPLFDH INEASAAHLA VDILGGGKSS KLYKDLVLDK DVAVSVFAYY NSLAFSDGYI DIMIIPKSGA NLDIVERELD NAINGFVSKG VTSEELQSSK YRYKAAQFDN LSDLTHIAMF YVPHLALGIP LDEIDISYSK INDVNLEDVN NKIRAIFSAN KLIGRLLPKG DNNENK
|
| |