Gene Gdia_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0390 
Symbol 
ID6973784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp428641 
End bp429789 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID643389922 
Productgalactonate dehydratase 
Protein accessionYP_002274801 
Protein GI209542572 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.100514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.488186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CGAAGCTGAC GACGTTCCAG GTCCCGCCGC GCTGGCTGTT CCTGAAGATC 
GAGACCGACG AGGGGATCAG CGGCTGGGGT GAGCCGGTGG TCGAGGGCAG GGCCGACACC
GTTGCCGCCG CCGTGGCCGA ACTGGCGGAC TACCTGGTCG GCAAGGACCC GTTCCGCATC
GAGGATCACT GGACCGTCCT GTATCGCGGC GGCTTCTACC GGGGCGGCGC GGTGCATATG
AGCGCCATTG CCGGTATCGA TCAGGCGCTG TGGGACATCA AGGGCCGGGC CTTCGGTGTG
CCGGTGCACG ATCTGCTGGG CGGGCGCTGC CGCGACCGTA TCCGCGTCTA TTCCTGGATC
GGCGGCGACC GGCCGGCGGA CACGGCCCAG GCGGTCCGCG CCGTGGTCGA TCGCGGCTTT
ACCGCGATCA AGATGAATGC GACCGAAGAA CTGCAATATG TCGACAGCCA CGCCAAGGTG
GACGACGTGA TCGCCCGTGT CGCCGCGATC CGCGAGGAGG CGGGGCCCTA TCTGGGCATC
GGCGTGGATT TCCACGGCCG CGTGCACAAG CCGATGGCCA AGGTCCTGGC CAGGGAACTG
GAACCCTACG ACCTGATGTT CATCGAGGAG CCGGTCCTGA GCGAGCATCT GGAAGACCTG
CCTGAAATCA CCAAGCACAC CTCGATTCCC ATCGCGCTGG GCGAACGCCT GTTTTCGCGC
TGGGACTTCA AGCGGGTGTT CGAACAGGGG TGCGTGGACA TCATCCAGCC CGACCCGTCG
CATGCCGGCG GCATCACCGA AACCCGCAAG ATCGCGGCGA TGGCGGAGGC CTATGACGTC
GCGGTGGCGC TGCACTGCCC GCTGGGGCCG ATCGCGCTGG CGGCGAACCT GCAGCTCGAT
GCCCTGTGCT ACAATGCGTT CATCCAGGAA CAGAGCCTGG GCATCCACTA CAACAAGACC
AACGACCTGC TGGACTATCT GGTGGATCCG GATGTCTTCG CCTATCGCGA TGGGCACGTG
GACATCCCGA CCGGCCCCGG CCTGGGGATC GAGATCAACG AGGACTATGT CCGCGCCCGC
GCCGCCGAGG GCCATCGCTG GCGCAACCCG GTCTGGCGGC ATCGCGACGG GTCGTTCGCG
GAATGGTAG
 
Protein sequence
MKITKLTTFQ VPPRWLFLKI ETDEGISGWG EPVVEGRADT VAAAVAELAD YLVGKDPFRI 
EDHWTVLYRG GFYRGGAVHM SAIAGIDQAL WDIKGRAFGV PVHDLLGGRC RDRIRVYSWI
GGDRPADTAQ AVRAVVDRGF TAIKMNATEE LQYVDSHAKV DDVIARVAAI REEAGPYLGI
GVDFHGRVHK PMAKVLAREL EPYDLMFIEE PVLSEHLEDL PEITKHTSIP IALGERLFSR
WDFKRVFEQG CVDIIQPDPS HAGGITETRK IAAMAEAYDV AVALHCPLGP IALAANLQLD
ALCYNAFIQE QSLGIHYNKT NDLLDYLVDP DVFAYRDGHV DIPTGPGLGI EINEDYVRAR
AAEGHRWRNP VWRHRDGSFA EW