Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2997 |
Symbol | |
ID | 6976431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3271252 |
End bp | 3273501 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643392505 |
Product | hypothetical protein |
Protein accession | YP_002277342 |
Protein GI | 209545113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.286656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.38238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAATA TCATAAAAAA AAATAAATCG GTCGATATAA ATAAAAACGC CCTTTGTGTC ATATTGCTGG TGTGTGCTCC ACTTCTCCTG AAGTTTCCAC TTCTGATCGG CCTGCTGATC GCCGATCCTG CGCTGCTCTA TGCCGGGCTC CAGACCGGTC TGCATGCCGG TCCGCTAACC GGCTATCCGC CGATGCCGAC CATCGACCCG AACATCGGTT TCACGAGTCA TGCACTGGGG TATCGGGCGG CCGAGGATGT TCTGAGCGGA CGCCTGCCGT GGTGGAATCC CTATGAGGGC GTGGGTATGC CCCTTGCCGG GGAAATGCAG TCGGCGGCGC TGTTCCCATT GACGTTATTG CTGGCGCTGC ATAATGGCCA GTTGTACATG CATCTATGCT TGCAGATCAT TGCGGGCCTT TCGACCTATG CAGTACTGCG CAAGATCGGA TGCATGCGCT TCGGCGCGCT TGCCGCGGCC CTTGTATTCG AATTCGATGG CACATTCGCC TGGCTGGCCA ATGCCGTTAT CAATCCCATC GCCTTCCTGC CATTGACCCT CCTGGGCGTC GAAACGATAC GGGAGCGGGT CGAGGCCGGA CGCAGGGGAG GCGGCGCATG GGTCACGATC GGGCTCTCCG CTTCGCTCTA TGCCGGCTTT CCCGAGGTCG CCTATCTGGA CGGCCTGCTG GTCCTTGCCT GGACGTTGGT GCGGCTGGGT TCACTCGCGC GTCCGATGCG CTGGGCCTTC CTGCGCCGGA TCGTCGTTTC TGGCCTGGCC GCACTGGCGA TTTCCGCACT GATCCTCGTT CCGTTCGGCG ACTATATGCT TGTTGCCAAC ACGGGGGGAC ACGCGAATGG TGGTTTCGCG TTCATCTCGC TGAGTCCGGC CTTCCTGTTC GCCCTGCTGG TGCCTTATGC CTTCGGCGGA ATCTTCCAGA TTCCGCAATA TACGGAGTTC TGGTCATCGG TGGGGGGATA CACAGGCTGC ATTCTGCCTG TCCTTGCGCT GTGCGGCCTG GGAGGACAGG CGTTGCGCGG CGTGAGGGTC GCGCTGGGGC TGTGGATTGT CGTGACGATC GGCATCGCGT ACGGCGCACC GGGGGCCGGC CTCCTTGCCC AGTTCTTCCC GGGGTTCAAG TTCGTGGCGC TCTACCGTTA TTTGTCGCCG TCCTGGGAAT TCTGCCTGTG CGTGCTCGCC GCATTCGGGT TGACGGACCT GGCACGAAAC CACAGGTTCG CCCGGGTTAC CGCCGCCTGC ATTGCCGTTG CGGCCGTCTG CGTCGTGACC GCCTATGTGA CGCATCGACA TCATCTGCCC CTGGCGCGCA ACAGGCTTGC GCTGGACAGC ATCGTGTTTG CCGCGATCCT GCTGATCGGT ACCGGCGTGC TGGCCTGGCT GCCCCTGACG GCGGCGACAC GGTCACGCGG CATGGCGGGC GTGCTCGTCG CGGAAGTCGC CGTGCTGTTT GCCTTGCCGT CACTCAGTTC TCCGTCGCAT GGCGCCATCG ACCTGACAGG GGTACAGTAC CTGCAGCAGC ATCTCGGTTT TCAGCGTTTC GCGACATTCG GGCCGGTTCA ACCCAACTAT GGCAGCTATT TCGGGATTGC GTCCATCAAC CACAACGATC TTCCGCTCCC CAGGGACTGG ACGGATTACG TTGCCCGGCA CCTCGATGCC AATGCGCCGT CGATCCTCTT CACAGGCTTC AGCCGGAACG ATCCGAACGG TCCGTCCGCG GCGGACAATC TGGTGGTGAA CGTCGAGGCC TATCGGAAGG CGGGCGTGCG ATTTGTCCTG GTGCCTGCGG GATCACTGGA CAGCCCGGGC TTCGCATCGT TCAAGCACTA TGCCGCAGCC GATCATGGCG TGCGGGAAGC ATTCGGAAAC GGCGTGATGC GGATTTTCGA ACTGCCCGAT CCGACACCCT ATGCCGCGGC ACCCGGCTGT GTGCTGACGC CTCGTTCCCG CGATCTCATG GACGTCGATT GCACCGGACC GTCGCAACTG ACCCGCCTGG AAATGTACAT GGAGGGCTGG CACGCGTCCG TCGACGGCAA CCCGGTGCCG ATCAGCCGGA CGGGCGAGAT TTTTCAGCAG ATACCGGTTC GACAGGGTCG TTCCGTCGTG ACGTTCCGCT TTGTGCCACC CCATATCCAG TGGGCGTTTT TCGCCTTCAT TGTCGGCTGG ATCCTTTTCG CGATCGACCT TCTGGGAGGG AAGGAGCGCA CGGCAAGGAA AATCCTGTAG
|
Protein sequence | MINIIKKNKS VDINKNALCV ILLVCAPLLL KFPLLIGLLI ADPALLYAGL QTGLHAGPLT GYPPMPTIDP NIGFTSHALG YRAAEDVLSG RLPWWNPYEG VGMPLAGEMQ SAALFPLTLL LALHNGQLYM HLCLQIIAGL STYAVLRKIG CMRFGALAAA LVFEFDGTFA WLANAVINPI AFLPLTLLGV ETIRERVEAG RRGGGAWVTI GLSASLYAGF PEVAYLDGLL VLAWTLVRLG SLARPMRWAF LRRIVVSGLA ALAISALILV PFGDYMLVAN TGGHANGGFA FISLSPAFLF ALLVPYAFGG IFQIPQYTEF WSSVGGYTGC ILPVLALCGL GGQALRGVRV ALGLWIVVTI GIAYGAPGAG LLAQFFPGFK FVALYRYLSP SWEFCLCVLA AFGLTDLARN HRFARVTAAC IAVAAVCVVT AYVTHRHHLP LARNRLALDS IVFAAILLIG TGVLAWLPLT AATRSRGMAG VLVAEVAVLF ALPSLSSPSH GAIDLTGVQY LQQHLGFQRF ATFGPVQPNY GSYFGIASIN HNDLPLPRDW TDYVARHLDA NAPSILFTGF SRNDPNGPSA ADNLVVNVEA YRKAGVRFVL VPAGSLDSPG FASFKHYAAA DHGVREAFGN GVMRIFELPD PTPYAAAPGC VLTPRSRDLM DVDCTGPSQL TRLEMYMEGW HASVDGNPVP ISRTGEIFQQ IPVRQGRSVV TFRFVPPHIQ WAFFAFIVGW ILFAIDLLGG KERTARKIL
|
| |