Gene WD0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD0247 
SymbolprfA 
ID2738735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp231348 
End bp232427 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content36% 
IMG OID637172468 
Productpeptide chain release factor 1 
Protein accessionNP_966056 
Protein GI42520141 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.736319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATGG AAAATAATTT ACAAGACTTA AAGAAAAAAT TTAATGATAT AGAGAGGAAT 
TTGGAAAATC CTACCAATTT AAGTCAAAAA GAATTCATTA CCCTTTCAAA GGAATACTCT
GAGCTCAGGC CAATAATCAA GATAATTGAT GAATATAACA CGCTGAAAGA GGAAATTTCA
GACTTAGAAG AAATAATGAA AGATGAAAAC AGCGAAGGTG ATATAAAAGA GTTAGCAAAA
GAAGAATTTT TCGAAAAGCA CAAGGTATTA TTACCAAAAA TAAAAGCAAA ATTAAAGTTA
GCATTATTGC CCAAAGATGA AGATGACTCA AGAAATGCAA TATTAGAAAT TAGAGCAGGT
ACAGGCGGAG AAGAAGCAGC ATTATTTGCA GCAATGTTAT TTCGTATGTA TCAAAAATAT
GCAGAAAGAA GAAATTGGAA GTTCGAGCCA ATAAGCATTT CTAATACAGG TATAGGTGGA
TATAAGGAAG CTTCTGCACT CATTAATGGA ACAGAAGTTT TTGCAAGGTT GAAATTTGAA
TCAGGAGTGC ACAGAGTACA GAGAGTGCCA GAAACTGAAT CTTCAGGAAG GTTGCATACC
TCTGCCGCTA CTGTTGCGAT ATTACCTGAA GTAGAAGAAG TTGACTTTAA AATAGAAGAA
AAAGACTTAC GAATAGATGT TTATAGATCC AGTGGTCCTG GAGGGCAATC AGTGAATACA
ACTGACAGCG CAGTAAGGGT CACCCACTTG CCAACAGGGA TAGTTGTGAT ACAGCAAGAT
GAAAAATCGC AGCATAAAAA TAAAGCTAAA GCGCTCAAAG TATTGAGGGC AAGGCTATAC
GAAATTGAAA GACAAAAAAA AGAAATGGAA AGGTCAACAA TGAGGAAAAG TCAGATTGGC
TCTGGTGATC GTTCCGAGCG CATAAGAACA TATAATTTCC CACAATCAAG AATAACAGAC
CACAGAATTA ATCTAACTTC ACATCGGCTA GAGCAGATTA TAAAAGAAGG CGAACTAGAT
GAATTTATTG AGGCATTAAT CTCACGTAAT GAAGCAGAAA GATTGGCAGG GGAAAGTTAG
 
Protein sequence
MDMENNLQDL KKKFNDIERN LENPTNLSQK EFITLSKEYS ELRPIIKIID EYNTLKEEIS 
DLEEIMKDEN SEGDIKELAK EEFFEKHKVL LPKIKAKLKL ALLPKDEDDS RNAILEIRAG
TGGEEAALFA AMLFRMYQKY AERRNWKFEP ISISNTGIGG YKEASALING TEVFARLKFE
SGVHRVQRVP ETESSGRLHT SAATVAILPE VEEVDFKIEE KDLRIDVYRS SGPGGQSVNT
TDSAVRVTHL PTGIVVIQQD EKSQHKNKAK ALKVLRARLY EIERQKKEME RSTMRKSQIG
SGDRSERIRT YNFPQSRITD HRINLTSHRL EQIIKEGELD EFIEALISRN EAERLAGES