Gene WD1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1035 
SymbolglyA 
ID2738754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp995981 
End bp997258 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content40% 
IMG OID637173190 
Productserine hydroxymethyltransferase 
Protein accessionNP_966759 
Protein GI42520844 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGTG TTTTAAAAAA AATCTGTGGC TCTAAAAATA GTTTAAAGTC TTTTGATAAC 
GAGGTTTATC AGTCTATAGA AAAAGAATTA CAACGCCAAA AATCACAATT GCAATTAATT
GCATCAGAAA ATTTTGCAAG CAAAGCGGTA ATGGAGGCAC AAGGCTCTTT TCTGACTAAT
AAATACGCAG AAGGTTATCC AGGTAAAAGA TATTACTGTG GTTGTGAGCA TGTGGACAAA
ATTGAAAGTC TGGCTATAGA AAGACTTTGT AAGTTGTTTG GTGTTAAATT TGCAAACGTT
CAACCTCATT CTGGTTCTCA GGCAAACCAG GCGGTATTTG CTTCACTGCT TACTCCAGGC
GATACAATAC TTGGATTATC ACTGAGTTGC GGTGGGCATC TAACTCATGG TGCGGCACCA
AGCCTTTCTG GTAAATGGTT TAAGTCAATT CAATATACAG TGAATAAAGA CACTTATCTG
CTCAATATGG ATGAGATAGA AAAGCTGGCG CTGGAGCATA AACCGAAATT GATCATAGCT
GGTGCTTCTG CTTATCCAAG AAAAATGGAC TTCAAACGCT TTCGCGAGAT TGCAGATAAA
GTTGGTGCTT ATTTGCTTGC AGACATTGCT CACTATGCAG GGCTTATTGC AGCGGGCGAA
TATCCATCCC CTGCTGAATA TGCACATGTT ATGACTTCCA CGACTCACAA AACTTTGCGT
GGTCCTCGTG GTGGAATAGT GATGACCAAT GATGAAGCAT TACACAAAAA AATTCAATCC
GCAGTTTTTC CAGGATTGCA GGGCGGGCCA CTTATGCATG TGATAGCTGC AAAAGCTGTT
GCATTTAAAG AAGCATTAGC ACCAGAGTTT AAGACTTATA GCAAGAAAGT CGTGGAAAAT
GCGAAAGTGC TGGCTCAAGA ATTGCAAAAG CATGGACTTG ACATTATAAC CGGTGGCACT
GACTCTCATA TAGTGCTAGT TGACTTAAGA TCGCAGAAAT TAACTGGAAA AGACGTTGTA
GATAGCCTTG AGAGAGCCGG TATTACCTGT AATAAAAACT CTGTACCATT TGATACAGCA
AAGCCGACCA TCACTTCAGG GCTCCGTTTT GGCACCGCTG CTGAGACAAC ACGCGGACTT
GAGGCAGAAA ATTTTAAAGA GATAGCTGGT CTAATAAATG AAGTAATTCA AGGATTAATC
AGCGGAAATA GCTCAAGTGT CGAAAAAGCA GTAAAAGCTA AAGTTGAAAG GATTTGTAGT
AATTTTCCTA TTTATTAA
 
Protein sequence
MMSVLKKICG SKNSLKSFDN EVYQSIEKEL QRQKSQLQLI ASENFASKAV MEAQGSFLTN 
KYAEGYPGKR YYCGCEHVDK IESLAIERLC KLFGVKFANV QPHSGSQANQ AVFASLLTPG
DTILGLSLSC GGHLTHGAAP SLSGKWFKSI QYTVNKDTYL LNMDEIEKLA LEHKPKLIIA
GASAYPRKMD FKRFREIADK VGAYLLADIA HYAGLIAAGE YPSPAEYAHV MTSTTHKTLR
GPRGGIVMTN DEALHKKIQS AVFPGLQGGP LMHVIAAKAV AFKEALAPEF KTYSKKVVEN
AKVLAQELQK HGLDIITGGT DSHIVLVDLR SQKLTGKDVV DSLERAGITC NKNSVPFDTA
KPTITSGLRF GTAAETTRGL EAENFKEIAG LINEVIQGLI SGNSSSVEKA VKAKVERICS
NFPIY