Gene Gdia_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2998 
Symbol 
ID6976432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3273561 
End bp3275327 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content61% 
IMG OID643392506 
ProductHAD-superfamily hydrolase, subfamily IA, variant 1 
Protein accessionYP_002277343 
Protein GI209545114 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.211302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCCC ACCTGAAGCA GTCCATCGCA CGAGCCGACG CCGTCTCTTT CGATGTGTTC 
GATACGCTGT TCGTCCGCCC GCTTGCCGAT CCGGAAGACC TCTTCGACAT CATCGGCGAG
AAATTCGGCA TCGCCTCCTT CCGCCGCCTG CGCCAGGAAG CGCAGGTACG GGCGTTCCAG
CGCATGCGGG AGAACGGACA GAAGGAAATC ACGCTCGACG GCATCTATGC GTGCTTCGAT
TCCGTGTCGG TGCCGGCATC CGTGCTGCGC GATGCCGAGT ACCAGCTCGA ACTCGCCCTG
ACGCTGCCCA ATCCCGATCT CATGGACGTG TTCAGGCAGA CGATCGCCGA TAAACCCGTC
GTCATCACGT CGGATATGTA CCTGCCGCAA GCCTTCTTCG ACGATCTTTT CCACAAGCAC
CGGCTGCAGC CCAGCGCGAC CTTCATTTCG TCGGAGCGAA ACGCAACCAA GCGCGATACC
GGTGAACTGT TCGACCGGGT GTCACAGGAA CTCGGCATAG ACCCGGGGCG CATCCTGCAT
ATCGGGGACA ATCCGCTGTC GGACGTGGAA CGGGCCAGGC AAAAAGGCCT GTCCGCCTAT
CATTACGTCG ATCCCACACG ACAGCAGAAA TCCAGTCGCT TTCCCCCGTC GGCATCGATC
GCCGGCAGCC TCATCCGCTC GATCGCCGAT CGGCCGCCGC CGGGATCGTT TACCGAACTC
GGGTTTCGTT TCGGCGGGCC GGCGGCAGTG GGCTTCCTCG ACTGGATTGT CCGCAAATCA
GCGCAGGACA AGATCGACAT CGTGCTGTTC GTATCGCGAG ACGGATATGT TCTTGAACGC
CTCGCCCGCA CGATGCCCGC GGGGACCTTG CCGCGTTTCA CCTATTTCAT GGGCTCGCGC
GTCGCCTTTA CGCTCGCCGC CACCGACGAG TCCAACTTCA ATACGCAGAT GGAATTCTTC
CTTGCGGGCG CACATGGATT GCGGCCGATC GAGGTGCTGG AGCGGCTGGG CGTCACGCCA
CCGGCCGACC GGGTGATGGA TGACCTCGGC CTCGGAGCCG GAATCGTCAT CAGCAATGAC
AATATCAGCC GCATCCGGGA TTTCGTGGGC GCCTTCCGCG GAGACATCCT GCAGGTATGC
CGTCGCAACC GGCGCGGCCT CCTCAACTAC CTCAAACAGG TGGGCGTTGA ACCGGGCATG
CGCGTCGCCA TGGTCGATGT GGGCTGGAAC GGAACGACGC AGGATGCCTT CGACCTCGCC
CTCGGCAAGC TGATGCAGGT CGAACTGTTC GGCTACTACC TGTGCCTGAA CGAATCGGAT
GATTGCCGGC GGCGGCGGCA AAGACTGAGG ATGGACGCCC TGCTGTCGCG CGAATCAATC
GGCCCGGAAC GGGTAACCGC CGTTTATGCC AATCGTGTCG CCGTCGAACT GTTCTTCTCG
GCACCCCATG ACGCCGTCAT CGGCTACCAG GATGCGATTG GAAAGGATGT CGCCATCATC
GAGGATTCCG GGCGAATTGC CATTGATGGC CATGCCCGAA TTTCGACGGA GATCACGGAC
GGCATCGAAC AGTTCGCGCT GACATTCCGT AATCTTTGCG CCGAGATCGG CCTTGTTGCC
GATCCGCTGG CGACTGCACT GCCGGTTGTG GACTTTGTCG AATCGATTGA CGCGGAAACG
CGCGGCTTAC TGGCGTCCGT CGAAAATTTC GATGCATGGG GCAGTACGCG AAACCAGCGC
GTCGCGCTGA CGACATACCT GCCGTAA
 
Protein sequence
MVSHLKQSIA RADAVSFDVF DTLFVRPLAD PEDLFDIIGE KFGIASFRRL RQEAQVRAFQ 
RMRENGQKEI TLDGIYACFD SVSVPASVLR DAEYQLELAL TLPNPDLMDV FRQTIADKPV
VITSDMYLPQ AFFDDLFHKH RLQPSATFIS SERNATKRDT GELFDRVSQE LGIDPGRILH
IGDNPLSDVE RARQKGLSAY HYVDPTRQQK SSRFPPSASI AGSLIRSIAD RPPPGSFTEL
GFRFGGPAAV GFLDWIVRKS AQDKIDIVLF VSRDGYVLER LARTMPAGTL PRFTYFMGSR
VAFTLAATDE SNFNTQMEFF LAGAHGLRPI EVLERLGVTP PADRVMDDLG LGAGIVISND
NISRIRDFVG AFRGDILQVC RRNRRGLLNY LKQVGVEPGM RVAMVDVGWN GTTQDAFDLA
LGKLMQVELF GYYLCLNESD DCRRRRQRLR MDALLSRESI GPERVTAVYA NRVAVELFFS
APHDAVIGYQ DAIGKDVAII EDSGRIAIDG HARISTEITD GIEQFALTFR NLCAEIGLVA
DPLATALPVV DFVESIDAET RGLLASVENF DAWGSTRNQR VALTTYLP