Gene Gdia_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2076 
Symbol 
ID6975503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2303408 
End bp2304826 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content67% 
IMG OID643391606 
Product2-nitropropane dioxygenase 
Protein accessionYP_002276451 
Protein GI209544222 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.321315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.310329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGA TCAATGCGAT CCGCATGGGC GGGGTGGATG TCCTGCCGCT GATCGAAGGT 
GGAAAGGGCG TGTCGGTTTC GACCGGCATA TCGTCCGGAC ATTGGGCGGC GGCAGGTGGC
GCCGGCACCG TGTCGATCGT CAACGCCGAT TCGTACGACG AACAGGGTCG TCCGGTTCCG
CAGGTCTATC ACGGCCGGAC CCGGCGGGAA CGGCACGAGG AACTGATCCG CTACGCCATC
CGGGGCGGCG TCGCCCAGGC CCGCATCGCC CACGACCTTT CGGGTGGCCG GGGCCGCGTC
CACGCCAACA TCCTGTGGGA AATGGGCGGG GCCGAGGACG TGATCACCGG CGTGCTGGAA
GAAGCACCCG GCCTGATCCA CGGCCTGACC TGCGGGGCGG GCATGCCCTA TCGCCTGTCG
GGCATCGCGA TGCGGTTCGG CATCCATTAT TATCCCATCG TGTCGTCCGC CCGGGCCTTC
AATGCCCTGT GGAAGCGCTC GTTCCACAAG AGCGCCGACC TGCTGGGCGG CGTGGTGTAC
GAGGACCCGT GGCGGGCCGG CGGCCATAAC GGCCTGTCGA ACACCGAGGA CCCGGGCAGC
CCCGAGGACC CGTTTCCGCG CGTCCTGGCG CTGCGCAAGC TGATGCGCAC CTTCGGCCTG
GACGACACCC CGATCATCAT GGCCGGCGGC GTATGGTGGC TGGAGGAATG GCAGGACTGG
ATCGACAGTC CCGAACTGGG GCCGATCGCC TTCCAGTTCG GCACCCGCCC GCTGCTGACG
CAGGAAAGCC CGATCCCCGA CGCGTGGAAG CGCAAGCTGC TGACGCTGAA GAAGGGCGAC
GTGTTCCTGA ACCGGTTCTC GCCCACCGGC TTCTATTCCT CGGCGGTGAA CAACCCGTTC
CTGCGGGAAC TGCAGGGGCG GTCGGAACGC CAGGTCGCCT ATTCCACCGA ACCGGTGGGC
GAGCATACCG CGTCCTACGG CGTGGGCGCC CGCGCCCGGC AGGTCTTCAT GACCGAGGCC
GACCGCGAGC ATGTCCGCCT GTGGGAACTG GAAGGCTATA CCGAGGCGAT GCGCACGCCG
GATTCCACCC TGATCTTCGT CACGAAGGAC AAGGCGCAGG AAATCCTGAC CGACCAGGTG
GACTGCATGG GCTGCCTGTC GGAATGCCGC TTCTCGAACT GGAGCCAGCG GGGCCCGAAC
TATACCAACG GCCACAAGGC CGATCCGCGT TCCTTCTGCA TCCAGAAGAC GCTGCAGGCC
GTCGCCCATG CCCACGGGGA CACGGGCGAT GCGGCGATGG ACAACAACCT GATGTTCGGT
GGCACGAACG CGTGGCGGTT CGGAACCGAT CCGTTCTATG CCAACGGGTT CGTGCCGACG
GTGGGGCAAC TGGTGGACCG GATTCTCACC GGCCGCTGA
 
Protein sequence
MKAINAIRMG GVDVLPLIEG GKGVSVSTGI SSGHWAAAGG AGTVSIVNAD SYDEQGRPVP 
QVYHGRTRRE RHEELIRYAI RGGVAQARIA HDLSGGRGRV HANILWEMGG AEDVITGVLE
EAPGLIHGLT CGAGMPYRLS GIAMRFGIHY YPIVSSARAF NALWKRSFHK SADLLGGVVY
EDPWRAGGHN GLSNTEDPGS PEDPFPRVLA LRKLMRTFGL DDTPIIMAGG VWWLEEWQDW
IDSPELGPIA FQFGTRPLLT QESPIPDAWK RKLLTLKKGD VFLNRFSPTG FYSSAVNNPF
LRELQGRSER QVAYSTEPVG EHTASYGVGA RARQVFMTEA DREHVRLWEL EGYTEAMRTP
DSTLIFVTKD KAQEILTDQV DCMGCLSECR FSNWSQRGPN YTNGHKADPR SFCIQKTLQA
VAHAHGDTGD AAMDNNLMFG GTNAWRFGTD PFYANGFVPT VGQLVDRILT GR