Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2998 |
Symbol | |
ID | 6976432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3273561 |
End bp | 3275327 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643392506 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 1 |
Protein accession | YP_002277343 |
Protein GI | 209545114 |
COG category | [R] General function prediction only |
COG ID | [COG5610] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.211302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTCCC ACCTGAAGCA GTCCATCGCA CGAGCCGACG CCGTCTCTTT CGATGTGTTC GATACGCTGT TCGTCCGCCC GCTTGCCGAT CCGGAAGACC TCTTCGACAT CATCGGCGAG AAATTCGGCA TCGCCTCCTT CCGCCGCCTG CGCCAGGAAG CGCAGGTACG GGCGTTCCAG CGCATGCGGG AGAACGGACA GAAGGAAATC ACGCTCGACG GCATCTATGC GTGCTTCGAT TCCGTGTCGG TGCCGGCATC CGTGCTGCGC GATGCCGAGT ACCAGCTCGA ACTCGCCCTG ACGCTGCCCA ATCCCGATCT CATGGACGTG TTCAGGCAGA CGATCGCCGA TAAACCCGTC GTCATCACGT CGGATATGTA CCTGCCGCAA GCCTTCTTCG ACGATCTTTT CCACAAGCAC CGGCTGCAGC CCAGCGCGAC CTTCATTTCG TCGGAGCGAA ACGCAACCAA GCGCGATACC GGTGAACTGT TCGACCGGGT GTCACAGGAA CTCGGCATAG ACCCGGGGCG CATCCTGCAT ATCGGGGACA ATCCGCTGTC GGACGTGGAA CGGGCCAGGC AAAAAGGCCT GTCCGCCTAT CATTACGTCG ATCCCACACG ACAGCAGAAA TCCAGTCGCT TTCCCCCGTC GGCATCGATC GCCGGCAGCC TCATCCGCTC GATCGCCGAT CGGCCGCCGC CGGGATCGTT TACCGAACTC GGGTTTCGTT TCGGCGGGCC GGCGGCAGTG GGCTTCCTCG ACTGGATTGT CCGCAAATCA GCGCAGGACA AGATCGACAT CGTGCTGTTC GTATCGCGAG ACGGATATGT TCTTGAACGC CTCGCCCGCA CGATGCCCGC GGGGACCTTG CCGCGTTTCA CCTATTTCAT GGGCTCGCGC GTCGCCTTTA CGCTCGCCGC CACCGACGAG TCCAACTTCA ATACGCAGAT GGAATTCTTC CTTGCGGGCG CACATGGATT GCGGCCGATC GAGGTGCTGG AGCGGCTGGG CGTCACGCCA CCGGCCGACC GGGTGATGGA TGACCTCGGC CTCGGAGCCG GAATCGTCAT CAGCAATGAC AATATCAGCC GCATCCGGGA TTTCGTGGGC GCCTTCCGCG GAGACATCCT GCAGGTATGC CGTCGCAACC GGCGCGGCCT CCTCAACTAC CTCAAACAGG TGGGCGTTGA ACCGGGCATG CGCGTCGCCA TGGTCGATGT GGGCTGGAAC GGAACGACGC AGGATGCCTT CGACCTCGCC CTCGGCAAGC TGATGCAGGT CGAACTGTTC GGCTACTACC TGTGCCTGAA CGAATCGGAT GATTGCCGGC GGCGGCGGCA AAGACTGAGG ATGGACGCCC TGCTGTCGCG CGAATCAATC GGCCCGGAAC GGGTAACCGC CGTTTATGCC AATCGTGTCG CCGTCGAACT GTTCTTCTCG GCACCCCATG ACGCCGTCAT CGGCTACCAG GATGCGATTG GAAAGGATGT CGCCATCATC GAGGATTCCG GGCGAATTGC CATTGATGGC CATGCCCGAA TTTCGACGGA GATCACGGAC GGCATCGAAC AGTTCGCGCT GACATTCCGT AATCTTTGCG CCGAGATCGG CCTTGTTGCC GATCCGCTGG CGACTGCACT GCCGGTTGTG GACTTTGTCG AATCGATTGA CGCGGAAACG CGCGGCTTAC TGGCGTCCGT CGAAAATTTC GATGCATGGG GCAGTACGCG AAACCAGCGC GTCGCGCTGA CGACATACCT GCCGTAA
|
Protein sequence | MVSHLKQSIA RADAVSFDVF DTLFVRPLAD PEDLFDIIGE KFGIASFRRL RQEAQVRAFQ RMRENGQKEI TLDGIYACFD SVSVPASVLR DAEYQLELAL TLPNPDLMDV FRQTIADKPV VITSDMYLPQ AFFDDLFHKH RLQPSATFIS SERNATKRDT GELFDRVSQE LGIDPGRILH IGDNPLSDVE RARQKGLSAY HYVDPTRQQK SSRFPPSASI AGSLIRSIAD RPPPGSFTEL GFRFGGPAAV GFLDWIVRKS AQDKIDIVLF VSRDGYVLER LARTMPAGTL PRFTYFMGSR VAFTLAATDE SNFNTQMEFF LAGAHGLRPI EVLERLGVTP PADRVMDDLG LGAGIVISND NISRIRDFVG AFRGDILQVC RRNRRGLLNY LKQVGVEPGM RVAMVDVGWN GTTQDAFDLA LGKLMQVELF GYYLCLNESD DCRRRRQRLR MDALLSRESI GPERVTAVYA NRVAVELFFS APHDAVIGYQ DAIGKDVAII EDSGRIAIDG HARISTEITD GIEQFALTFR NLCAEIGLVA DPLATALPVV DFVESIDAET RGLLASVENF DAWGSTRNQR VALTTYLP
|
| |