Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2772 |
Symbol | |
ID | 6976203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3038867 |
End bp | 3041845 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643392281 |
Product | hypothetical protein |
Protein accession | YP_002277120 |
Protein GI | 209544891 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAACA GGGGTTCCGA GTGGCGGCGA TGGGAGCCGC ACATTCACGC GCCCGGCACC GCCATGAACA ACCAGTTCAG CGGCCCGACG GCGTGGCACG ATTATCTGAC CGCTCTGGAG AAGGCGACGC CGGTGATCGA GGCGATCGCG GTGACCGACT ATTATGTGAC GGACACCTAC GAGGAAGTGC TCCGCCAAAA GGCGCTGGGC CGCTTGCCCA CGGCCGCGCT GGTCTTCCCG AATGTCGAGC TGCGGCTGGA TGTGGCGACG GCCAAAGGCG GCTTCGTCAA TCTCCACCTC TTCGTGAGCC CTGAAGACCC GAACCATGTT GAGGAGCTGC AGCGCCTGCT GTCGCGCTTG CAGTTCAACG TCATGCAGGA TCGGTTCGAC TGTACCCGTG CCGACCTGAT CCGCCTCGGT AAGAAAGCCG ATCCGGACAT CACCGACGAC CGCGCCGCGC TCGCCTACGG CGCAAACCAG TTCAAGGTGA ATTTCCAGAA GCTGCGCGAA GTGTTTTCGG AGAGCGGTTG GGCGAAGAAG AATATCCTGA TTGCGGTCGC CGGCGGCGCC ACCGACGGCA CATCGGGCGT TCGCGAAGCC GCCGACCAGA CCATTCGGCG CGAGATCGAG GGCTTCGCCC ACGTCATTTT CGCCAGCAGC GTTGCACAGC GCGAGTTCTG GCTCGGCGAT CGCGATCTCA GTCCCACCGA GATCCGCACG CGCTATGGCG GCCTCAAGCC CTGCCTGCAC GGCAGCGACG CGCACAAGCT GGACGACGTC GCCACGCCGT TCGGCGACCG CTTCTCGTGG ATCAAGGGCG GCCTTGAGTT CGATGCCCTC CGCCAAGCCT GCATCGACCC GGGCGGCCGC GCTTATGTTG GCGCCGAGCC GCCGGCATCG GCCACACCGT CGCAGGTCAT TTCGAAGATC GAGATTCTTG ACTCACCGTG GGCGGCAACG CCGGTGATCC CGCTGAACCC CGGCCTTGTC GCTATCGTCG GCGCCCGTGG TTCGGGCAAG ACCGCGCTGG CGGACGTCAT CGCGGCGGGC TGCGACTCCA TTACGGACGA GGCGTGGAAT GCGGATGAAT GGGCGAACCC GTCATTCCTC GTCCGGGCCC GACCGCTGAT CGGCGAAGGC AAGGTCAAGG TTAGTTGGTC GGCGGGCGAG CCGAGCGTCC GCGCGCTGGA TGGCTCCGAC GCCAATGGCC CCTTCGCTTA CGAGCGCGTC CGCTATCTGT CGCAGCAATT CGTGGAGGAG CTTTGCTCCT CCAACGGGCT GACCGACGGT CTGTTGCGCG AGATCGAGCG GGTGATCTTC GAGGCCCACC CCGACGACGA GCGCGATGGC GCACTGGATT TCGACGAGCT GCTCGAACAG CGCGCGACAC GGCACCGCCT TGCGCGCGAG CGCGAAGCCG ACGCGGTCGC CCAGATTTCC GATCGCATCA GTACGGAGAT CGAGAAGGAA AAGATGGCCG CGACCTATGA GTCGCAGGCA GCGCAAAAGA AGAAGCTGGT CGACGCCTAC ACGGCTGATC GATCCAAATT GGTCTCTGCC GGGAGCGAGA AGCGCGCCGA ACGGCATACC GAACTGGCTG GTGCGGCGAA CACGGTGCGC GCTACGCTGC GTCGGTTTAC GGGCCAGCGG CAGACTTTCC TTGCCCTCCA GGACGAGGTG AAGGATCAGC GGCGCAATCA AGCCCCCGAG GGCCTTCGAC AGGCGCAGGC CCGCCACTCG AACAGCGGCA TGTCGGCGGA GCAATGGTCC GCCTTCCTGT TGGATTACAC CGGGCCGGTC GACAACGATC TCGACGGCTA CGTGAAATGG GCAGATGGGA AGATTGCCGA GCTGAAAGGC ACGCCTCCCG TACCGGGCGA TCCAAACAAA CCGTATTTCC CGGACGATGC CGATCTGAGC AAGCTGTCCC AGGCCATGCT CGACGCCGAG ATGTCGCGGC TGGAAAAGCT CGTCAGCGCC GACAAGGAGA CCCAGCGTCG CTACACGGCC CTATCCGGCA ACATCGCGAC CGAGACTGCG GCGCTGCAAA CGCTGAACGA GAAGCTGAAG GATGCTCAGG GCGCCAAGGA TCGCGCCAAG GAACTGCAGC GCGATCGCGA AGATGCCTAC GGGCGGGCGT TCGACGCTCT TGTCGCCGAA CAGTCGGTGT TGGAGGCGCT CTACGCGCCG CTGATGGAGC GCCTGTCGAA GGCCTCCGGG ACGCTTCAGA AACTGTCGTT CTCGGTCGCC CGCGTGGCCA ATGTCGAGCA GTGGGCGTCG GAAGCGGAAG ACGGACTGAT CGATCTCCGC AAGGCGGGCG CGTTCCGTGG CAAGGGCACC TTGCTGCAGA AGGCCAACGA AGTGATTAAG TCGGCGTGGG AAACCGGCGA CGCGGCCGCG GTGCGAACCG CGATGGCCGA GTTTCGACGG CTCTACCAAA GGGCGCTGCT GGACCATTCG CCGGTAGCGC ATACCGACCA GGTCGAATTC CGCGCCTGGT CGAAGCGGTT TGCGCAATGG TTGTTCAGCA CCGACCATAT CTCGATCCGC TACGGGATCG ACTATGATGG GGTCGACATC AGGAAGCTGT CACCGGGCAC GCGCGGCATC GTCCTGCTGC TCCTCTATCT GGCGCTGGAT GACAGCGACA ATCGCCCGCT CGTGATCGAC CAACCAGAGG AGAATCTGGA CCCGAAGTCG GTGTTCGACG AGTTGGTCCA TCTCTTCGTT GAGGCAAAGG CCCACCGGCA GGTCATCATG GTCACGCACA ACGCGAACCT GGTGATCAAC ACCGACGCCG ACCAGATCAT CATCGCCGAG TCGGGGCCGC ATCCGCAGGG CGCGCTTCCG CCGATCACCT ACAAGTCCGG AGGGCTTGAG AACGCAGAGA TCAGGAAGGC GGTGTGCGAC ATTCTGGAGG GCGGCGAAGG CGCGTTTCAA GAGCGAGCAC GCCGCCTGCG GGTAAGGCTC GAACGGTAG
|
Protein sequence | MLNRGSEWRR WEPHIHAPGT AMNNQFSGPT AWHDYLTALE KATPVIEAIA VTDYYVTDTY EEVLRQKALG RLPTAALVFP NVELRLDVAT AKGGFVNLHL FVSPEDPNHV EELQRLLSRL QFNVMQDRFD CTRADLIRLG KKADPDITDD RAALAYGANQ FKVNFQKLRE VFSESGWAKK NILIAVAGGA TDGTSGVREA ADQTIRREIE GFAHVIFASS VAQREFWLGD RDLSPTEIRT RYGGLKPCLH GSDAHKLDDV ATPFGDRFSW IKGGLEFDAL RQACIDPGGR AYVGAEPPAS ATPSQVISKI EILDSPWAAT PVIPLNPGLV AIVGARGSGK TALADVIAAG CDSITDEAWN ADEWANPSFL VRARPLIGEG KVKVSWSAGE PSVRALDGSD ANGPFAYERV RYLSQQFVEE LCSSNGLTDG LLREIERVIF EAHPDDERDG ALDFDELLEQ RATRHRLARE READAVAQIS DRISTEIEKE KMAATYESQA AQKKKLVDAY TADRSKLVSA GSEKRAERHT ELAGAANTVR ATLRRFTGQR QTFLALQDEV KDQRRNQAPE GLRQAQARHS NSGMSAEQWS AFLLDYTGPV DNDLDGYVKW ADGKIAELKG TPPVPGDPNK PYFPDDADLS KLSQAMLDAE MSRLEKLVSA DKETQRRYTA LSGNIATETA ALQTLNEKLK DAQGAKDRAK ELQRDREDAY GRAFDALVAE QSVLEALYAP LMERLSKASG TLQKLSFSVA RVANVEQWAS EAEDGLIDLR KAGAFRGKGT LLQKANEVIK SAWETGDAAA VRTAMAEFRR LYQRALLDHS PVAHTDQVEF RAWSKRFAQW LFSTDHISIR YGIDYDGVDI RKLSPGTRGI VLLLLYLALD DSDNRPLVID QPEENLDPKS VFDELVHLFV EAKAHRQVIM VTHNANLVIN TDADQIIIAE SGPHPQGALP PITYKSGGLE NAEIRKAVCD ILEGGEGAFQ ERARRLRVRL ER
|
| |