Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1815 |
Symbol | |
ID | 6975237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2008847 |
End bp | 2011723 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643391340 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002276190 |
Protein GI | 209543961 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.613695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAATTTT CGGTTCATGC CGAGGCCGGG ATACCGGGAA TGATCCTCGG ACTGTCGGAC GACCTGTTTC GGACCGGCGC CGAGGGGGAT GTGGGCGTAC TGCTCGATGG CGTGTATCGG GGGCAGGCGT CCGTCCGGCG CGAGGCCGGC CGGGTCCTTG TTGGCCTGCC GACCCATCTG CTGGCGCGCG AGGTCGATCT GCTGGATCTG CGGGACGGGC GAAGCCTTCT GAAGACGGCG TGCTTCATCC TGCCCTGTTA CGAACTGACG ACGGGTGCGA TCACGGTATC CGGCGGCGCC ATCGTGGGCG ATTTCTCGGT GCGCGGACTG GACGACCATG TCCTGGTCGA ATGTATCGAG AACGGCCAGG TCCTGGCGCG GGGCTTCGCT ACCCGTGCCG GGGGGACGGA TTACCGGTTT CATCTTCCTT TCCCCACCCT GCTGACTCCG CAGCAGCAGG TAGCCTGCCT GTTCAGGATC GCCGGCCTGT TCCTGGACGG GCCGCCCTTC CTGCTGACGA TGCAGACGTT GGGCTATCTG GGTTATGTCG ACCGGCCGGA ACCGGGGTGC GTCACCGGCT GGGTGTGCGA CATGACCATG CCGGACCGGC GGGTGGCCGT CGATCTGGTG CGCGACGGCG TGGTGCTGCA GACCGTCCTG GCGGACCGGT TCCGCGAGGA TCTGCTGGCC CTCGGCATCG GGGACGGTCA CGCCGGGTTT TCCTTCACGC TGCCGCATGA CGGATCGTTC CGCAATCCGG CCGATGTCTC GGTCGTCGTA TCCGGCGGCA GCCTGAACCT GGTCAATTCC CCCCTCATGG TCTTCGCGCC GCCGCCGTTC CGGGGCGCGT TCGACCGGCT GCACGGCATG TCCGCCCATG GCTGGGCCCT GAACATGGCG GACCCGAAAA CGCCCGTGCA GGTCGAGGCC GTCTGCAACG GCCAGGTCCT GGCCACGGCC ACCGCGCGGC TGTTTCGCGG CGACCTGCTG GATGCGGGTC TGAACGAAGG TTTCTGCGCC TACAAGATCG ACATCGGCAA GCAGATCCTG GATCTGCTGG GGCAGGAGAT CCAGGTCCGC ATCGCCGGCC ATCCCGACCT GGTCCTGACG GGTTCGCCGC AGGTGGCGAC GCAGAACCCC AACCTGCTGC GCTACCTGAA GCCCGCGCGC GGGATGAATC CGGCCTTGCT GCCGCGCCTG CGCCGGACAC TGGATCACCG CGTGGGGCCT TTGCGATTAT CCATCGTCAT GCCGGTGTAC AATACGCCGC GCGACTGGCT GGTCCAGGCG ATCGAAAGCG TCCTGGCCCA ATGGTGCGGA CGGTGGGAAC TGATCTGCAT CGACGATTGT TCGACCGCGC CGCATGTGGG CGCGGTGCTG CGCGCGTACG CGGACAGGGA CCCGCGCATC CGCGTGCTGA CGCCGCAGGT CAATGGCGGC ATCGCGGTTG CGACCAATCT GGGCCTGCGC GCCGCGCGCG GCGATTATGT CACATTCCTG GACCATGACG ATGTCCTGGA ACCCGATGCG ATCTACCACC TTCTGAAGAC GGCGCGGGAG ACGGATGCCG ATTTCATCTA TTCCGACGAG GCGACGACCG ACGAGAACAT CGACAGCATC GCGGACGTCA AGGCGCGGCC CGCCTTTTCC TATGATTACT ATCTGTCGCA CCCGTATTTC GTGCATATGC TCTGCGTCCG GCGTCGCCTC GCCCATGAAA TCGGCGGATG GGACGAACGG ATGGCGATTT CCGCCGACGT GGATTTCGTG CTGCGCGTCC TGGCCCGGGC CGCCAGCATC GCCCATGTTC CCCGAATCCT GTATCGCTGG CGGACCCATG GCGGCAGCAC AGGCCATGCG AAGAAGAAAG CGGTGATGGA GGCGACATGC GCCGCCATCC AGCGGCACCT GGACCAGGCC CACCCGGGGG CGATGGTGTC CGCCGGGTTG GGGTTCAACC AGTTCCGTGT CGACTGGCCG GCGACGGAAG GGAAAATCCT GATCGTCATC CCGACGAAGA ACAAGGCCGA CCTGGTGCGT GTCGCCATCG ACTCCATCGC GCGGACATCC GCCGGCGCGG ACTACCGGAT CGTCGTCGTC GATCATGATT CCACCGAGCC CGAATCGGTC GCCTATTTCA AGTCGATCCG CGATCGCTGC ACGGTCATGA AATATACCGG CGAATTCAAC TATTCCCGCA TGAACAACCT GGCCGTCCGG AAACATGGCA GGGACGCCGA TTTCATCCTG TTCCTGAACA ACGATATCGA GGCGATCACC GACGGGTGGC TGGACAGGAT GCGGCGGCTG GCCCATCGGC CGGAGGTCGG GATCGTCGGT GCCCTGCTGC TCTATCCGGA CCGGCGCGTA CAACATGCGG GCGTCATCAT GGGATTCAAT GGATCGGCCG ATCATGCCTT CAAGTTCGAG GATGTCTATC TGAACGACGG AAACCAGAGG AGTTTCGGCT ATAATTGCAG CCTGACATCG GTGCGCGATT TTTCCGCCGT CACCGCGGCC TGCATGATGA TGCGCAGATC CGTCTTCGAT CAGGTGGGGG GATTCGACGA AACCCTGAAG GTCGGGTTCA ACGATACCGA TTTCTGCCTG CGTGTTCGCG AGGCGGGGCT GAAGGTCCTG TATGACGGCT ATACCGTCCT GTTCCATCAT GAAAGCGCCA CGCGCAGCCA GACCAGACAG GTCATGCACC CCGAGGATAC CGAGCGGATG CTGCACCGGC ATGGCCGGAT ACTGGAAGGC GGCGACCCGT TCTATAATCC CAATCTGAGC GTGACGGCAC AGGATCATGT CGTGCGCGAC GATAACGGAT GCGGGCGGGG CCGCGTGCGC GTCACGCGCC TTTGTGCCTC GATGTAA
|
Protein sequence | MEFSVHAEAG IPGMILGLSD DLFRTGAEGD VGVLLDGVYR GQASVRREAG RVLVGLPTHL LAREVDLLDL RDGRSLLKTA CFILPCYELT TGAITVSGGA IVGDFSVRGL DDHVLVECIE NGQVLARGFA TRAGGTDYRF HLPFPTLLTP QQQVACLFRI AGLFLDGPPF LLTMQTLGYL GYVDRPEPGC VTGWVCDMTM PDRRVAVDLV RDGVVLQTVL ADRFREDLLA LGIGDGHAGF SFTLPHDGSF RNPADVSVVV SGGSLNLVNS PLMVFAPPPF RGAFDRLHGM SAHGWALNMA DPKTPVQVEA VCNGQVLATA TARLFRGDLL DAGLNEGFCA YKIDIGKQIL DLLGQEIQVR IAGHPDLVLT GSPQVATQNP NLLRYLKPAR GMNPALLPRL RRTLDHRVGP LRLSIVMPVY NTPRDWLVQA IESVLAQWCG RWELICIDDC STAPHVGAVL RAYADRDPRI RVLTPQVNGG IAVATNLGLR AARGDYVTFL DHDDVLEPDA IYHLLKTARE TDADFIYSDE ATTDENIDSI ADVKARPAFS YDYYLSHPYF VHMLCVRRRL AHEIGGWDER MAISADVDFV LRVLARAASI AHVPRILYRW RTHGGSTGHA KKKAVMEATC AAIQRHLDQA HPGAMVSAGL GFNQFRVDWP ATEGKILIVI PTKNKADLVR VAIDSIARTS AGADYRIVVV DHDSTEPESV AYFKSIRDRC TVMKYTGEFN YSRMNNLAVR KHGRDADFIL FLNNDIEAIT DGWLDRMRRL AHRPEVGIVG ALLLYPDRRV QHAGVIMGFN GSADHAFKFE DVYLNDGNQR SFGYNCSLTS VRDFSAVTAA CMMMRRSVFD QVGGFDETLK VGFNDTDFCL RVREAGLKVL YDGYTVLFHH ESATRSQTRQ VMHPEDTERM LHRHGRILEG GDPFYNPNLS VTAQDHVVRD DNGCGRGRVR VTRLCASM
|
| |