Gene Gdia_1018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1018 
Symbol 
ID6974415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1146545 
End bp1149451 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID643390540 
Productprotein of unknown function DUF1156 
Protein accessionYP_002275416 
Protein GI209543187 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.174443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT CGACCCGGCC CAACCCGACT ATTCGACGCA AGCTGATCGA GGTGTCGCTC 
CCGCTGGATG AGATCAACGC CGCTAGCGCC AAGGAAAAGT CCATCCGGCA TGGCCATCCA
TCCACGTTGC ATCTGTGGTG GGCGCGTCGG CCACTTGCCG CCTGTCGCGC CGTGCTGTTC
GCGCAACTGG TGGACGATCC TTCCGCATGG CCCGACCGCT TTCCGACAGA GGAGGCGCAG
GAAGCCGAAC GCAACCGGCT CCACGACGTC ATCCGGGCGA TGGTTCCGTG GGATGCAAGC
GGGAACGAGA CGATCCTGAA TGCCGCTCGC TGGGAAATCG CCCGTTCTGT TGCATGGAAC
ATCGGGGAGG AACCGCCTCC TCGCGAAAAC GGTGAGGCCA TCCTTCGCTA CCTTCAGGAA
AAGGCCCCAC CCGTCTATGA CCCGTTCTCC GGCGGTGGTT CCATCCCGCT GGAAGCGCAA
CGCCTCGGCC TGCGTGCCTA TGGCTCGGAC CTTAACCCAG TCGCCGTTCT GATCGGCAAG
GCACTCGTCG AGATACCACC CAAATTCGCG GGGCAGGCCC CCGTCAATCC CGATGCGAGA
GCAGAGGCTG CTCGTGGTGG GGCATGGCAG GGACGCGGCG CACAGGGACT AGCCGAAGAT
GTGCGCTATT ACGGCAAATG GATGCGCGAC GAGGCGGAAA TGCGCATTGG CCATCTCTAC
CCGAAAGCGG CGCTGCCCGA TGGATCGGAA GCGACTGTTA TCGCCTGGCT CTGGGCGCGA
ACGGTCCGCA GTCCTGACCC GGCGGCAAAA GGCGCAATGG TGCCCTTGGT CTCTTCCTTC
ATGCTGTCCA CCAAAGAAGG AAAGAAGGCG TGGGCGGAGA TCGTTCATGA CGCAAATGCG
TCTGACGGCT GGCGCTTTGT GGTCCGGACG GGGGAACTAT CGAAAGATGA TGAAATACAG
CTGAAGAATG GAACAAAGAC AGGTCGTGGT GCAAATTTTC TCTGTGCTTT AACCGGTGCG
GCAATTAGCG GAGATTGGAT CAAGGCTGAA GGTGTTGCAG GCCGTCTTGG AGAGCGTTTG
ATGGCTGTTG TTGCAGAAGG TAAACGCACT CGTCTCTACT TATCTCCGAG GAATTCAGAC
GAGCTTGTAG CGAAGTCTGC TATCCCAAAA TGGTTGCCAG AGGGAGAAAT TGCTAACGAT
CCACGTGCAT TATGGGTAAT TTCTTATGGC CTAAAGACTT TCGCATCGCT CTTTACACCA
CGTCAACTTG TGGCGCTAAC CACATTCTCC GATCTCGTCG CCGAAGCACG CGAGAAAGTG
CTGCAGCACG CCATCGCCGC CGGTCGTTCA ACTGCCTCAA CTCCACTTCA TGAAGGCGGC
ACAGGAGCGA CCGCCTATGC GGATGCTGTG GCGACGTATC TGGCATTGGG TGTAAGTAAA
ATAGCCGACT ACTCATCCAC AATTGTTCTT TGGAGTTCCT CTCGAGATCA AGCTAAGTCT
ACTTTTGCTC GACAGGCATT GCCCATGGCA TGGGACTTTG CAGAGGTTAA CGCTTTTGCT
GAAGCAGCGG GGGACTTCTC TGTGTCGATT GCCGGCATCT CCCGAACATT GGGAGACCTT
CCCGCGATGT CCGGGGGAGT AGTTTATAAT ATAAACGCTG CTACAAATTC ATTTCCCGTT
CGTCCTGTTC TGATCTCCAC TGATCCGCCG TACTATGACA ACATCGGCTA CGCTGACCTT
TCTGATTTTT TTTATACTTG GTTGAGACAT TCTCTGGCGG ACGAATGGCC CGGTTTGTTC
CGACGCCTTG TTACGCCCAA AAGAGAAGAG CTTATTGCGA CCCCATATCG ACACGGCGGA
AAAGAGGGGG CTGAAGCATT CTTTATGGCT GGCATGAAAG ATGCTTTGGC CTCGATACGG
GAGGCATCAG TCAAGACAGA ACCTCTGACA ATCTACTACG CGTTTAAACA GTCAGAGATT
GAGCAGGAAG GAGTTACTTC AGCGGGATGG GCTTCTTTCT TGCAGGCTGT CGTCGATACC
GGTTTATTGA TTGACGGTAC ATGGCCAGTG AGAACGGAAA GGGGCGCAAG AACAATTGCT
AGTGGGACAA ATGCGCTTGC CTCCTCTATC GTGCTTGTCT GTCGGACACG TTCAGATAGA
TCCGGCGTTA TCACCCGTTC TGACTTCCTT CGAGCGTTGC GTAGAGAACT CCCCGCAGCC
CGTGAGCGCC TGCGCGATGA TGGCGTCTCG CCAGTCGATA TGCCGCAGTC CATCATCGGT
CCCGGAATGG GTGTCTTCAC CCGATACGCC AGCGTGCTGG AAGACGACGA CAGCGTCATG
AGCGTGCGTA CGGCCCTGGC GCTCATCAAC CGCGTCTGGG AAGAAATAGA GAACGCGCTG
GACGCGGATT TCGATCCCGA GACGCAGGTG GCCCTGGCCT GGTACGCTTC GCATGGTTTC
GACACGCGAC CTTCCGGCGA ACTCATCACA CTGGCCAATG CCAAGAATAT CTCATTGTCT
TCACTCTTCC AGTCCGAGGT GTTTCTTGAC CGGCGCGGCA AGGCGCAACT GACGCCGCGT
GAAAACCTTC CAGCAGGCTG GTCACCACAG ACAGACGGAA CGCTGACTGT CTGGGAGTGT
GTTCAGCACG TTGCCCGCAC GCTGGAGGCC AAAGAAGGTG GTCAGGAGGC AGCAGCCCGT
CTCGTCGCAG GCATGGGCGG AAAGACAGAA GCCGCACGAG CGCTGGCCTA TCGCCTCTTC
CAGATCGCCA CGGACAAGGG ATGGTCCGCC GAGGCACTGG TTTATAACGC GCTGGCCGAT
GAATGGCCGA CCCTTGAGAG GCTGGCGTCG GAAATCCCCA ATCCTGTTGC ATCACCCGTG
GCGGAAGAAA CGCCACGTCT GCTTTGA
 
Protein sequence
MSTSTRPNPT IRRKLIEVSL PLDEINAASA KEKSIRHGHP STLHLWWARR PLAACRAVLF 
AQLVDDPSAW PDRFPTEEAQ EAERNRLHDV IRAMVPWDAS GNETILNAAR WEIARSVAWN
IGEEPPPREN GEAILRYLQE KAPPVYDPFS GGGSIPLEAQ RLGLRAYGSD LNPVAVLIGK
ALVEIPPKFA GQAPVNPDAR AEAARGGAWQ GRGAQGLAED VRYYGKWMRD EAEMRIGHLY
PKAALPDGSE ATVIAWLWAR TVRSPDPAAK GAMVPLVSSF MLSTKEGKKA WAEIVHDANA
SDGWRFVVRT GELSKDDEIQ LKNGTKTGRG ANFLCALTGA AISGDWIKAE GVAGRLGERL
MAVVAEGKRT RLYLSPRNSD ELVAKSAIPK WLPEGEIAND PRALWVISYG LKTFASLFTP
RQLVALTTFS DLVAEAREKV LQHAIAAGRS TASTPLHEGG TGATAYADAV ATYLALGVSK
IADYSSTIVL WSSSRDQAKS TFARQALPMA WDFAEVNAFA EAAGDFSVSI AGISRTLGDL
PAMSGGVVYN INAATNSFPV RPVLISTDPP YYDNIGYADL SDFFYTWLRH SLADEWPGLF
RRLVTPKREE LIATPYRHGG KEGAEAFFMA GMKDALASIR EASVKTEPLT IYYAFKQSEI
EQEGVTSAGW ASFLQAVVDT GLLIDGTWPV RTERGARTIA SGTNALASSI VLVCRTRSDR
SGVITRSDFL RALRRELPAA RERLRDDGVS PVDMPQSIIG PGMGVFTRYA SVLEDDDSVM
SVRTALALIN RVWEEIENAL DADFDPETQV ALAWYASHGF DTRPSGELIT LANAKNISLS
SLFQSEVFLD RRGKAQLTPR ENLPAGWSPQ TDGTLTVWEC VQHVARTLEA KEGGQEAAAR
LVAGMGGKTE AARALAYRLF QIATDKGWSA EALVYNALAD EWPTLERLAS EIPNPVASPV
AEETPRLL