Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD1098 |
Symbol | engA |
ID | 2737895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 1054135 |
End bp | 1055460 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637173251 |
Product | GTP-binding protein EngA |
Protein accession | NP_966819 |
Protein GI | 42520904 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAA TCGCCATAGT AGGCCTACCA AATGCTGGAA AATCGACCCT ATTCAACAGA TTAGTAGGAA GAAAAGCAGC AGTAGTGAGT AACATTCCAG GAGTGACAAG GGATAGGCGA GAGGGAATGG GGAGAATTAG CGATTTGGAG TTTAAAGTTA TAGATACAGG AGGATGGAAC GACCAAACTA ATTTTTCACT ACAAGTTATT GAACAAATCG AATTTTCTTT ATCCAATTCA AACATAATTT TCTTTCTTGT TGATGCAAAA GTGCAAAATG AGCGAAATGA AGAGTTTGCA AAGTGGCTGA AGAGAAAAAT AAATAAACCT GTAATACTAG TAGCAAATAA ATGCGAGAGT CATAAATCCG AAAATGTTGA TTATTTGCAG TTTTTTGATT TTCTTGGCCC GGTGTATATC TCTGCCGAGC ATAATCTTGG CATGGTTGAT CTTTATGATG CATTAGCTGG TGTTATTGAA AATTTTAACG AGAACACTGA GTTACCTAAC AATGAACTGA GTAGGCTGAG GATCGCGATC ATCGGCCGTC CGAATGTTGG AAAATCAACT TTTTTAAATG GTTTACTTGC AGAAAACAGA CTAATAACAA GTTCAGAGCC AGGTACCACA CGTGATTCTG TGGATATTAC ATATGATCAT GATGGAGAGT TAATTACTCT AATTGACACT GCCGGAATCC GCAGAAAGGC GAATGTTGTA GATGGCTTAG AATCAAGATT TGTTGAGAAA AGCATGGAAT CAATCAAGCG CTCTCATGTG GTAGTTTTAA TGCTAGACTC CCTAGTTGGT ATTGAGCAGC AGGATTTATC AATTGGTGAA GCTGCAATTA AGGGAGGGAA AGGGATTATT GTTGTTTTAA ATAAGTGGGA TTTAATAGGT AAGGATGACA GAAGCAGGTT AATAAAATTT GTCAAACAAC AGGAAGTAAC CAGGTTATTC TTGGAAGTGC CAACTATAAC AATTTCTGCG TTAAAAGGCA TGCGCTGCGG TGATGTGATA GATAAGTGTC TTGAAGTGAG TGAATCCTTG AACAAGAAGA TCAGCACTGC AAAACTGAAT AAGTGGCTCA TAGATGCTGT AGGAAAACAT TCTCACCCTC TTGTAAAAGG CAAAGCAGTT AAAATGAAGT ATATCGCTCA AATTGGTACT AAACCTCCAG CTTTTTCTTT GATATGTAAC ATACCTGAAA GTGTTGATGA AAGTTACAAA CGCTATTTAA TTAACGATCT CAGAAAAAAT TTCTTTGCAG ACGGTGTGCC AGTTAGATTG CTTTTGAAAA AAAATAAAAA TCCCTATGTA AAATGA
|
Protein sequence | MLKIAIVGLP NAGKSTLFNR LVGRKAAVVS NIPGVTRDRR EGMGRISDLE FKVIDTGGWN DQTNFSLQVI EQIEFSLSNS NIIFFLVDAK VQNERNEEFA KWLKRKINKP VILVANKCES HKSENVDYLQ FFDFLGPVYI SAEHNLGMVD LYDALAGVIE NFNENTELPN NELSRLRIAI IGRPNVGKST FLNGLLAENR LITSSEPGTT RDSVDITYDH DGELITLIDT AGIRRKANVV DGLESRFVEK SMESIKRSHV VVLMLDSLVG IEQQDLSIGE AAIKGGKGII VVLNKWDLIG KDDRSRLIKF VKQQEVTRLF LEVPTITISA LKGMRCGDVI DKCLEVSESL NKKISTAKLN KWLIDAVGKH SHPLVKGKAV KMKYIAQIGT KPPAFSLICN IPESVDESYK RYLINDLRKN FFADGVPVRL LLKKNKNPYV K
|
| |