Gene Gdia_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0040 
Symbol 
ID6973429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp45162 
End bp46664 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content68% 
IMG OID643389573 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_002274457 
Protein GI209542228 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.116515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0387479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGC CCCATAATCC GCTATCCCAT CCGCGCGTCG ACGATCGCAT CCGCGAGGCG 
CTGGCGTTCG ACGACGTTCT GGTCGTCCCG GCGGAATCCA ATGTCCTGCC CGGACAGACC
TCGACGAAAA GCCGCCTGAC GCGCCGCATC GGCCTGAATA TCCCGCTGAT TTCCTCGGCC
ATGGACACGG TGACCGAGGA CGCGATGGCC ATCGCCATGG CGCAGCAGGG CGGCATGGGC
GTGATCCACA AGAATCTGAG CGTCGAGGAA CAGGCCGAGC AGGTCCGGCG CGTGAAGCGT
TTCGAATCCG GCATGGTCGT CAATCCGGTG ACGGTGTGGC CCGACCAGAC CCTGGCCGAC
GTCAATGCGA TCATGTCGCG CCACGGCATC AGCGGCCTGC CGGTGATCGA GCGCGAGACC
AAGCGGCTGG TCGGCATGCT GACCAACCGC GACGTCCGCT TCGCCACCGA TCCCGCCCTG
CGCGTGGATT CGCTGATGAC GCGGGAAAAC CTGGTGACCG TCGGCGCCGA TGTCGGCCAC
GACCAGGCGC GGCAGTTGCT GCACCGCCAC CGGATCGAGA AGCTGCTGGT AGTCGATGAC
GAAGGGCGCT GCGTCGGACT GATCACCGTC AAGGACATCG AAAAGGCGGT CCTGCACCCG
CTGGCCAACA AGGATGAGAT GGGGCGCCTG CGCTGCGCCG CCGCGACCGG CGTGGGCGAG
GACGGGTTCA CCCGCGCCCG GGCGCTGATC GAGGCCGGGG TGGATGTCGT GGTCGTCGAT
ACCGCGCACG GCCATTCCTC GGGCGTGCTG GACACGGTGG CGCGGGTCAA GGCGGTGGAT
GACCGGATCC AGGTCGTCGC CGGCAACGTC GCGACGCCCG AGGCCGCCGT GGCGCTGATC
GAGGCCGGGG CCGACTGCGT GAAGATCGGC ATCGGCCCGG GGTCGATCTG CACCACGCGG
GTCGTGGCCG GCGTGGGCGT GCCGCAGTTC AGCGCGGTGC TGGAAACCTC GGCCGCGTGC
CATGAGCTGG ACGTGCCAGC CATCGCCGAT GGCGGCATAC GGACGTCGGG CGACATCGTC
AAGGCGATCG GGGCCGGGGC GGACGTGGTC ATGATCGGCT CGCTGCTGGC CGGGACCGAG
GAAGCGCCGG GCGAGGTGTT CCTGTATGAA GGCCGGTCCT ACAAATCCTA TCGCGGGATG
GGCAGCCTGG GCGCCATGGC GCGCGGCTCG GCGGACCGGT ATTTCCAGCA GGAGATCAAG
GAAACCCACA AGATGGTCCC CGAGGGGATC GAGGGGCGCG TCGCCTACAA GGGCGGCATG
GACGCCGTGG TGCACCAACT GGTCGGTGGC CTGCGCGCCG GCATGGGCTA TACCGGGTCG
GCCACGATCG CGGACCTGCA GGTTCGTGCG CGTTTCCGCC GCATCACGGG GGCGGGACTG
CGCGAAAGCC ACGTCCATGA CGTGGCGATC ACGCGCGAGG CGCCGAATTA CCGCCGCGAC
TGA
 
Protein sequence
MSSPHNPLSH PRVDDRIREA LAFDDVLVVP AESNVLPGQT STKSRLTRRI GLNIPLISSA 
MDTVTEDAMA IAMAQQGGMG VIHKNLSVEE QAEQVRRVKR FESGMVVNPV TVWPDQTLAD
VNAIMSRHGI SGLPVIERET KRLVGMLTNR DVRFATDPAL RVDSLMTREN LVTVGADVGH
DQARQLLHRH RIEKLLVVDD EGRCVGLITV KDIEKAVLHP LANKDEMGRL RCAAATGVGE
DGFTRARALI EAGVDVVVVD TAHGHSSGVL DTVARVKAVD DRIQVVAGNV ATPEAAVALI
EAGADCVKIG IGPGSICTTR VVAGVGVPQF SAVLETSAAC HELDVPAIAD GGIRTSGDIV
KAIGAGADVV MIGSLLAGTE EAPGEVFLYE GRSYKSYRGM GSLGAMARGS ADRYFQQEIK
ETHKMVPEGI EGRVAYKGGM DAVVHQLVGG LRAGMGYTGS ATIADLQVRA RFRRITGAGL
RESHVHDVAI TREAPNYRRD