Gene Gdia_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1536 
Symbol 
ID6974946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1712229 
End bp1713329 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID643391067 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002275930 
Protein GI209543701 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.26366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAGA TGACAGACAT GACGGAACGG GACCCGCGCT GGGCCCGCAT CGTGGCACGG 
GACCGCCGCG CGGACGGCCT GTTCTGGTAT TCGGTGCGCA CGACCGGGGT CTATTGCCGC
CCGTCCTGTC CCTCCCGAAT GGCAAAGCCC GGCAACGTGC GCCTGCATGA CACGCTGGGG
GCCGCACGGG CCGCGGGCTT CCGTCCCTGC CGCCGCTGCA ACCCGGACGG CGCATCGCCG
CAGGATATCG GCCAGGCCCT GGTCGTCCAG GCCTGCCGGC TGATCGCGGC GCGGGACGAC
ATGCCGTCCA TCGCCGACCT GGCCCGCGCG GTGGAACTGA GCCCCAGCCA TTTCCACCGG
CTGTTCCGGG CCGAGACCGG CCTGACGCCC CATGCCTATG CCGCCGCCCA TCGCGCCGGC
CGCATCCGCG CGGAACTGCC CCGCGCGGAC AGCGTGACCG AGGCGATTTT CGATGCCGGA
TACGGGTCCA GCGGACGGTT CTACGCCCAG GCGGCCGATC TGCTGGGCAT GACACCGGCC
CGCTATCGCG CCGGCGGCGC GGGCGAACGG ATTCGCTTCG CCGTCGGGCA ATGCAGCCTC
GGCGCGATCC TGGTCGCGTC CAGTGCCAGG GGCGTGGTCG CGATCCTGAT CGACGACGAT
CCGAACACCC TGGTCCGCGA CCTGCAGGCA CGTTTTCCCC GCGCCGAGCT GATCGGCGGT
GACGAGGAGT ACGAACGGCA TGTCGCACTG GTGGTGGGCT TCGTGGAGGC ACCCGGAATC
GGGCTGGACC TGCCGCTCGA TATTCGCGGC ACCGCGTTCC AGCATCGTGT CTGGAGCGCG
CTGCGCGACA TTCCGGCAGG GCAAACCGTC ACCTATGCCG ACATCGCCCG ACGGATCGGC
CAGCCCCAGG CTGTCCGGGC GGTCGCCGGG GCCTGCGCGG CCAACCGGAT TGCCGTGGCC
ATTCCCTGCC ATCGCGTCGT CCGCAACGAC GGCGCACTGT CGGGCTATCG CTGGGGGGTG
GACCGCAAGC GCATCCTGAT CGATCGCGAG CGCACGGCCG CGCCGGTTCC TTCCGCCCGT
CCAGAAGGCC GATCGTCTTG A
 
Protein sequence
MVEMTDMTER DPRWARIVAR DRRADGLFWY SVRTTGVYCR PSCPSRMAKP GNVRLHDTLG 
AARAAGFRPC RRCNPDGASP QDIGQALVVQ ACRLIAARDD MPSIADLARA VELSPSHFHR
LFRAETGLTP HAYAAAHRAG RIRAELPRAD SVTEAIFDAG YGSSGRFYAQ AADLLGMTPA
RYRAGGAGER IRFAVGQCSL GAILVASSAR GVVAILIDDD PNTLVRDLQA RFPRAELIGG
DEEYERHVAL VVGFVEAPGI GLDLPLDIRG TAFQHRVWSA LRDIPAGQTV TYADIARRIG
QPQAVRAVAG ACAANRIAVA IPCHRVVRND GALSGYRWGV DRKRILIDRE RTAAPVPSAR
PEGRSS