Gene Gdia_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0133 
Symbol 
ID6973525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp146879 
End bp147937 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content71% 
IMG OID643389667 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_002274548 
Protein GI209542319 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.899957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.134277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGT CGGATGACGC AATGTTTCGG ACCGCCGCGC GCGCGCGGCT GGACCGCCTG 
TGGTCCGCCG GCACGGCCTT CCTGGGCAGC GAGGTCGCGA TCCTGGGCGG CGCCATGTCC
TGGGTCAGCG AGCGCCATCT GGTATCGGCC ATTTCCAACG CCGGCGGCTT CGGCGTGCTG
GCCTGCGGCG CCATGGAACC CGACCGGCTG GCCGAGGAAA TCGCCGCCAC CCAGGCGCTG
ACCAGCCGCC CCTTCGGCGT CAACCTGATC ACCATGCACC CACGCCTGGA CGACCTGATC
CAGGTCTGCC TTTCGGCCGG GGTCACGCAT GTCGTGCTGG CCGGCGGCAT TCCGCCGGGG
CCGGCCATCC GCGCGATCAA GGATGGCGGG GCGCGGGTCG TGGCGTTCGC GCCGGCCCTG
GTGCTGGCCA AGCGGCTGGT GCGCATGGGC GTCGATGCCC TGGTGATCGA GGGCGCGGAG
GCCGGCGGGC ATGTCGGCCC GGTCTCGCTG ACCGTCCTGG CGCAGGAAGT GCTGCCCCAT
ATCCGCTCGG TTCCCGTCTT CGTCGCGGGC GGGCTGGGAC GGGGCGAGGC CATCCTGTCC
TATCTGGAGC AGGGGGCGGC CGGCGCGCAG CTCGGCACCC GCTTCGCCGC GTCGGCCGAA
AGCATTGCCC ACGAACGGTT CAAGGCCGCG TTCGTCCGCG CCAACGCCCG CGACGCCGTG
ACGTCGGTCC AGCTTGACGA ACGCTTCCCC GTCATTCCGG TGCGTGGCCT GTCGAACGAG
GGCGGACGCG CCTTCCTGCG CCATCAGGCG GAAACGATCC GCCGCTACCT GGACGGCGAA
CTGACGCGTG AGGCCGCGCA ACTGGATATC GAGCATTTCT GGGCCGGGTC GCTGCGCCGG
GCGGTGATCG AGGGCGACGT GGAACAGGGT TCGGTCATGG CCGGCCAGTC GGTCGGCATG
ATCTCCTCCG TCCAGCCGGT CGCGGCCATC ATCGCCGAAC TGGTCGAACA GGCGGTCGAT
GCGCTGGTTC GGCGCGACAT GCCGGCGGGG GATGCGTGA
 
Protein sequence
MPLSDDAMFR TAARARLDRL WSAGTAFLGS EVAILGGAMS WVSERHLVSA ISNAGGFGVL 
ACGAMEPDRL AEEIAATQAL TSRPFGVNLI TMHPRLDDLI QVCLSAGVTH VVLAGGIPPG
PAIRAIKDGG ARVVAFAPAL VLAKRLVRMG VDALVIEGAE AGGHVGPVSL TVLAQEVLPH
IRSVPVFVAG GLGRGEAILS YLEQGAAGAQ LGTRFAASAE SIAHERFKAA FVRANARDAV
TSVQLDERFP VIPVRGLSNE GGRAFLRHQA ETIRRYLDGE LTREAAQLDI EHFWAGSLRR
AVIEGDVEQG SVMAGQSVGM ISSVQPVAAI IAELVEQAVD ALVRRDMPAG DA