Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1018 |
Symbol | |
ID | 6974415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1146545 |
End bp | 1149451 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643390540 |
Product | protein of unknown function DUF1156 |
Protein accession | YP_002275416 |
Protein GI | 209543187 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.174443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCT CGACCCGGCC CAACCCGACT ATTCGACGCA AGCTGATCGA GGTGTCGCTC CCGCTGGATG AGATCAACGC CGCTAGCGCC AAGGAAAAGT CCATCCGGCA TGGCCATCCA TCCACGTTGC ATCTGTGGTG GGCGCGTCGG CCACTTGCCG CCTGTCGCGC CGTGCTGTTC GCGCAACTGG TGGACGATCC TTCCGCATGG CCCGACCGCT TTCCGACAGA GGAGGCGCAG GAAGCCGAAC GCAACCGGCT CCACGACGTC ATCCGGGCGA TGGTTCCGTG GGATGCAAGC GGGAACGAGA CGATCCTGAA TGCCGCTCGC TGGGAAATCG CCCGTTCTGT TGCATGGAAC ATCGGGGAGG AACCGCCTCC TCGCGAAAAC GGTGAGGCCA TCCTTCGCTA CCTTCAGGAA AAGGCCCCAC CCGTCTATGA CCCGTTCTCC GGCGGTGGTT CCATCCCGCT GGAAGCGCAA CGCCTCGGCC TGCGTGCCTA TGGCTCGGAC CTTAACCCAG TCGCCGTTCT GATCGGCAAG GCACTCGTCG AGATACCACC CAAATTCGCG GGGCAGGCCC CCGTCAATCC CGATGCGAGA GCAGAGGCTG CTCGTGGTGG GGCATGGCAG GGACGCGGCG CACAGGGACT AGCCGAAGAT GTGCGCTATT ACGGCAAATG GATGCGCGAC GAGGCGGAAA TGCGCATTGG CCATCTCTAC CCGAAAGCGG CGCTGCCCGA TGGATCGGAA GCGACTGTTA TCGCCTGGCT CTGGGCGCGA ACGGTCCGCA GTCCTGACCC GGCGGCAAAA GGCGCAATGG TGCCCTTGGT CTCTTCCTTC ATGCTGTCCA CCAAAGAAGG AAAGAAGGCG TGGGCGGAGA TCGTTCATGA CGCAAATGCG TCTGACGGCT GGCGCTTTGT GGTCCGGACG GGGGAACTAT CGAAAGATGA TGAAATACAG CTGAAGAATG GAACAAAGAC AGGTCGTGGT GCAAATTTTC TCTGTGCTTT AACCGGTGCG GCAATTAGCG GAGATTGGAT CAAGGCTGAA GGTGTTGCAG GCCGTCTTGG AGAGCGTTTG ATGGCTGTTG TTGCAGAAGG TAAACGCACT CGTCTCTACT TATCTCCGAG GAATTCAGAC GAGCTTGTAG CGAAGTCTGC TATCCCAAAA TGGTTGCCAG AGGGAGAAAT TGCTAACGAT CCACGTGCAT TATGGGTAAT TTCTTATGGC CTAAAGACTT TCGCATCGCT CTTTACACCA CGTCAACTTG TGGCGCTAAC CACATTCTCC GATCTCGTCG CCGAAGCACG CGAGAAAGTG CTGCAGCACG CCATCGCCGC CGGTCGTTCA ACTGCCTCAA CTCCACTTCA TGAAGGCGGC ACAGGAGCGA CCGCCTATGC GGATGCTGTG GCGACGTATC TGGCATTGGG TGTAAGTAAA ATAGCCGACT ACTCATCCAC AATTGTTCTT TGGAGTTCCT CTCGAGATCA AGCTAAGTCT ACTTTTGCTC GACAGGCATT GCCCATGGCA TGGGACTTTG CAGAGGTTAA CGCTTTTGCT GAAGCAGCGG GGGACTTCTC TGTGTCGATT GCCGGCATCT CCCGAACATT GGGAGACCTT CCCGCGATGT CCGGGGGAGT AGTTTATAAT ATAAACGCTG CTACAAATTC ATTTCCCGTT CGTCCTGTTC TGATCTCCAC TGATCCGCCG TACTATGACA ACATCGGCTA CGCTGACCTT TCTGATTTTT TTTATACTTG GTTGAGACAT TCTCTGGCGG ACGAATGGCC CGGTTTGTTC CGACGCCTTG TTACGCCCAA AAGAGAAGAG CTTATTGCGA CCCCATATCG ACACGGCGGA AAAGAGGGGG CTGAAGCATT CTTTATGGCT GGCATGAAAG ATGCTTTGGC CTCGATACGG GAGGCATCAG TCAAGACAGA ACCTCTGACA ATCTACTACG CGTTTAAACA GTCAGAGATT GAGCAGGAAG GAGTTACTTC AGCGGGATGG GCTTCTTTCT TGCAGGCTGT CGTCGATACC GGTTTATTGA TTGACGGTAC ATGGCCAGTG AGAACGGAAA GGGGCGCAAG AACAATTGCT AGTGGGACAA ATGCGCTTGC CTCCTCTATC GTGCTTGTCT GTCGGACACG TTCAGATAGA TCCGGCGTTA TCACCCGTTC TGACTTCCTT CGAGCGTTGC GTAGAGAACT CCCCGCAGCC CGTGAGCGCC TGCGCGATGA TGGCGTCTCG CCAGTCGATA TGCCGCAGTC CATCATCGGT CCCGGAATGG GTGTCTTCAC CCGATACGCC AGCGTGCTGG AAGACGACGA CAGCGTCATG AGCGTGCGTA CGGCCCTGGC GCTCATCAAC CGCGTCTGGG AAGAAATAGA GAACGCGCTG GACGCGGATT TCGATCCCGA GACGCAGGTG GCCCTGGCCT GGTACGCTTC GCATGGTTTC GACACGCGAC CTTCCGGCGA ACTCATCACA CTGGCCAATG CCAAGAATAT CTCATTGTCT TCACTCTTCC AGTCCGAGGT GTTTCTTGAC CGGCGCGGCA AGGCGCAACT GACGCCGCGT GAAAACCTTC CAGCAGGCTG GTCACCACAG ACAGACGGAA CGCTGACTGT CTGGGAGTGT GTTCAGCACG TTGCCCGCAC GCTGGAGGCC AAAGAAGGTG GTCAGGAGGC AGCAGCCCGT CTCGTCGCAG GCATGGGCGG AAAGACAGAA GCCGCACGAG CGCTGGCCTA TCGCCTCTTC CAGATCGCCA CGGACAAGGG ATGGTCCGCC GAGGCACTGG TTTATAACGC GCTGGCCGAT GAATGGCCGA CCCTTGAGAG GCTGGCGTCG GAAATCCCCA ATCCTGTTGC ATCACCCGTG GCGGAAGAAA CGCCACGTCT GCTTTGA
|
Protein sequence | MSTSTRPNPT IRRKLIEVSL PLDEINAASA KEKSIRHGHP STLHLWWARR PLAACRAVLF AQLVDDPSAW PDRFPTEEAQ EAERNRLHDV IRAMVPWDAS GNETILNAAR WEIARSVAWN IGEEPPPREN GEAILRYLQE KAPPVYDPFS GGGSIPLEAQ RLGLRAYGSD LNPVAVLIGK ALVEIPPKFA GQAPVNPDAR AEAARGGAWQ GRGAQGLAED VRYYGKWMRD EAEMRIGHLY PKAALPDGSE ATVIAWLWAR TVRSPDPAAK GAMVPLVSSF MLSTKEGKKA WAEIVHDANA SDGWRFVVRT GELSKDDEIQ LKNGTKTGRG ANFLCALTGA AISGDWIKAE GVAGRLGERL MAVVAEGKRT RLYLSPRNSD ELVAKSAIPK WLPEGEIAND PRALWVISYG LKTFASLFTP RQLVALTTFS DLVAEAREKV LQHAIAAGRS TASTPLHEGG TGATAYADAV ATYLALGVSK IADYSSTIVL WSSSRDQAKS TFARQALPMA WDFAEVNAFA EAAGDFSVSI AGISRTLGDL PAMSGGVVYN INAATNSFPV RPVLISTDPP YYDNIGYADL SDFFYTWLRH SLADEWPGLF RRLVTPKREE LIATPYRHGG KEGAEAFFMA GMKDALASIR EASVKTEPLT IYYAFKQSEI EQEGVTSAGW ASFLQAVVDT GLLIDGTWPV RTERGARTIA SGTNALASSI VLVCRTRSDR SGVITRSDFL RALRRELPAA RERLRDDGVS PVDMPQSIIG PGMGVFTRYA SVLEDDDSVM SVRTALALIN RVWEEIENAL DADFDPETQV ALAWYASHGF DTRPSGELIT LANAKNISLS SLFQSEVFLD RRGKAQLTPR ENLPAGWSPQ TDGTLTVWEC VQHVARTLEA KEGGQEAAAR LVAGMGGKTE AARALAYRLF QIATDKGWSA EALVYNALAD EWPTLERLAS EIPNPVASPV AEETPRLL
|
| |