Gene WD1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagWD1151 
SymbolgltA 
ID2738866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameWolbachia endosymbiont of Drosophila melanogaster 
KingdomBacteria 
Replicon accessionNC_002978 
Strand
Start bp1100282 
End bp1101532 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content35% 
IMG OID637173300 
Productcitrate synthase 
Protein accessionNP_966867 
Protein GI42520952 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00353381 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA AAGTGCTATT AGAACTAAGT AATGGATTAA AAATCGAGCT ACCTGTATTA 
AGCGGAACAA CAGGTCCTGA TGTGTTAAAT ATCAAGGATT TATATAAGAT GACAGGATTA
TTCACTTACG ATCCAGGGTT TGTTTCTACA GCTTCATGCT CTTCTACAAT TACATTCATT
GATGGAGATG AAGGAGTTCT TAAATATAGG GGACATAATA TAGCTGATTT GGCAGAGAAT
AATAATTTTA CTGCTGTGAT TTATTTATTG CTCTATGGTG AATTACCCAG TTCAGAGCAA
CACAAAAAAT TTCTTCTCAA AATACAAGAA TCATCCAAAG TATCAGAGCA AGTTACAAAT
GTAATTAAAG CATTTCCAAA AACTGCTCAC CCTATGTCAA TCTTAGTTGC ATGTTTTGCA
AGTTTGTCAG CATCTTATCA TGAAAAGCAT GGCAACAATG TCAATGGTGA AGACCTAGAT
TTTGGAATTT CTGCAATAGC GCAAGTTTCC ACAATTATTG CAATGATTTA TAGGCATATC
AACAATCAGG AATTCATAAA TGCTAACAAT GAATTAAGTT ACAGTGAAAA TTTCTTAAAG
ATGATATTTG GCGATGCTGT TGATAATGAT AAAAGCGCCC TTTTTGCAAA AGCTCTGGAT
AAAATATTTA CTCTCCATGC TGATCATGAA CAGAATGCTT CTACGGCGGC TGTCAGATTG
GTGGGATCGG CTGGTTCTAA TCTGTTTGCA AGCCTCTCTG CAGGAGTTGC TACACTTTGG
GGACCAGCAC ATGGTGGAGC TAATGAGGCA GTTATAAATA TGCTAAAAGA GATAGAGCAA
AGTGGAGATA TAGATAAATT CATCGAAAAA GCTAAAGATG ATAAAGATCC ATTTAAATTG
ATGGGATTTG GACATCGTGT TTATAAAAAT TATGATCCGC GCGCGCGTAT ATTGGAAGGC
GCTTGCCATG AGGTTCTAAG TAAACTAGAA CAAAATAATG AACTGCTTAA AATTGCAAAA
AAACTTGAAG AAATAGCTTT AAAGGATGAA TATTTTATCG TGCGTAAGTT ATATCCAAAT
GTTGATTTTT ACTCAGGTAT AATAATGAAT GCTATTGATA TCCCCTCAAA TATGTTCACG
CCTATTTTTG CACTCGCAAG AACCACTGGT TGGGTTACTC AGTGGTATGA AATGATAAAT
GATAAAGAAA CTAAGATCTG TAGACCAAGG CAACTCTATT TTGGTAAATA A
 
Protein sequence
MDKKVLLELS NGLKIELPVL SGTTGPDVLN IKDLYKMTGL FTYDPGFVST ASCSSTITFI 
DGDEGVLKYR GHNIADLAEN NNFTAVIYLL LYGELPSSEQ HKKFLLKIQE SSKVSEQVTN
VIKAFPKTAH PMSILVACFA SLSASYHEKH GNNVNGEDLD FGISAIAQVS TIIAMIYRHI
NNQEFINANN ELSYSENFLK MIFGDAVDND KSALFAKALD KIFTLHADHE QNASTAAVRL
VGSAGSNLFA SLSAGVATLW GPAHGGANEA VINMLKEIEQ SGDIDKFIEK AKDDKDPFKL
MGFGHRVYKN YDPRARILEG ACHEVLSKLE QNNELLKIAK KLEEIALKDE YFIVRKLYPN
VDFYSGIIMN AIDIPSNMFT PIFALARTTG WVTQWYEMIN DKETKICRPR QLYFGK