Gene Gdia_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1557 
Symbol 
ID6974967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1735722 
End bp1736861 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID643391088 
Producthomocitrate synthase 
Protein accessionYP_002275951 
Protein GI209543722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.319892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAA ACGCGCGTCT GCTGATCAGC GACACCACCC TGCGCGATGG AGAGCAGGCG 
CCGGGGGTGG CCTTTACCGC GTGGGAAAAG CTGGCCATCG CCGGCGCCCT CGACGCGGCC
GGCGTGGATG AGATCGAGGC CGGCGTTCCG GCCATGGGGG ATGCCGAGAT CGCCATGATC
GCGGCGATCG GCGACGAGGT GGAACGCGCC CGCGTCATCC CCTGGTGTCG CATGCGCGAC
GAAGACGTGC ACGCGGCGCG GCGGACGGGG CTGGGCAGCG TGCACCTGTC CGTTTCGACC
TCGGTCCGGC AGATCCAGGC GAAATACCGC ATGTCGCCGC GTCGCACGCT GGACATGGCG
CGCGACGTCG TCTCCCGCGC CAGGGATTAC GGCCTGGCGG TGTCGGTGGG GGGCGAGGAC
GCCAGCCGGG CCGATCCGGC CTATCTGGTC GATCTGCTGG GCGTGATCGC CGAGGCAGGG
GCGTTCCGCT TCCGCTTCGC CGATACGCTG GGCGTGATGG ACCCGTTCGG CGTGCACGAG
GTCATGCGGT ATCTCTGCCA GTCCAGCCCG CTTCAGCTTG AATTTCACGG CCATGACGAT
CTGGGCCTGG CAACGGCCAA TACGCTGGCC GCCGTGCGGG CCGGCGCGGC GTGCGCCAGT
GTCACCGTCC TGGGACTGGG CGAACGCGCG GGCAACGCCC CGCTGGAGGA AGTGGTGGCG
GGGGTATATC GCCTGCTGGG CCGTCCGGCG GGCGTGAAGC TGGACAGCCT GCCCGGGCTT
GCCACCCTGG TCTCGCGCGC CGCCGTGCGT GATATCCCGC CCGACAAGGC GATCGTCGGC
GACGCGGTCT TTCGTCATGA ATCGGGCATC CATGTCTCGG GCCTGCTGCG TGACGCCGCG
ACGTATGAGG CACTGGACCC CGTGCAGTTC GGCCGCCAGC GCGAAATCGT GCTGGGCAAG
CATTCCGGCC GGGCCGCGGT ACGGCACGCG CTGGCGGCGC TGGGCCTGGA TGCCGACGAA
ACCGTCATCG CGGCCACGCT TGCCGCCGTG CGCGCGCGTG CCTCCGCCGC CAAGCGCACG
GTGGCGCTGG CCGAACTGGC CGAAATGCAT GCGGGCCTGA TGGCAGGCGT GTCGAAATAA
 
Protein sequence
MSENARLLIS DTTLRDGEQA PGVAFTAWEK LAIAGALDAA GVDEIEAGVP AMGDAEIAMI 
AAIGDEVERA RVIPWCRMRD EDVHAARRTG LGSVHLSVST SVRQIQAKYR MSPRRTLDMA
RDVVSRARDY GLAVSVGGED ASRADPAYLV DLLGVIAEAG AFRFRFADTL GVMDPFGVHE
VMRYLCQSSP LQLEFHGHDD LGLATANTLA AVRAGAACAS VTVLGLGERA GNAPLEEVVA
GVYRLLGRPA GVKLDSLPGL ATLVSRAAVR DIPPDKAIVG DAVFRHESGI HVSGLLRDAA
TYEALDPVQF GRQREIVLGK HSGRAAVRHA LAALGLDADE TVIAATLAAV RARASAAKRT
VALAELAEMH AGLMAGVSK