Gene Gdia_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1815 
Symbol 
ID6975237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2008847 
End bp2011723 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content64% 
IMG OID643391340 
Productglycosyl transferase family 2 
Protein accessionYP_002276190 
Protein GI209543961 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.613695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAATTTT CGGTTCATGC CGAGGCCGGG ATACCGGGAA TGATCCTCGG ACTGTCGGAC 
GACCTGTTTC GGACCGGCGC CGAGGGGGAT GTGGGCGTAC TGCTCGATGG CGTGTATCGG
GGGCAGGCGT CCGTCCGGCG CGAGGCCGGC CGGGTCCTTG TTGGCCTGCC GACCCATCTG
CTGGCGCGCG AGGTCGATCT GCTGGATCTG CGGGACGGGC GAAGCCTTCT GAAGACGGCG
TGCTTCATCC TGCCCTGTTA CGAACTGACG ACGGGTGCGA TCACGGTATC CGGCGGCGCC
ATCGTGGGCG ATTTCTCGGT GCGCGGACTG GACGACCATG TCCTGGTCGA ATGTATCGAG
AACGGCCAGG TCCTGGCGCG GGGCTTCGCT ACCCGTGCCG GGGGGACGGA TTACCGGTTT
CATCTTCCTT TCCCCACCCT GCTGACTCCG CAGCAGCAGG TAGCCTGCCT GTTCAGGATC
GCCGGCCTGT TCCTGGACGG GCCGCCCTTC CTGCTGACGA TGCAGACGTT GGGCTATCTG
GGTTATGTCG ACCGGCCGGA ACCGGGGTGC GTCACCGGCT GGGTGTGCGA CATGACCATG
CCGGACCGGC GGGTGGCCGT CGATCTGGTG CGCGACGGCG TGGTGCTGCA GACCGTCCTG
GCGGACCGGT TCCGCGAGGA TCTGCTGGCC CTCGGCATCG GGGACGGTCA CGCCGGGTTT
TCCTTCACGC TGCCGCATGA CGGATCGTTC CGCAATCCGG CCGATGTCTC GGTCGTCGTA
TCCGGCGGCA GCCTGAACCT GGTCAATTCC CCCCTCATGG TCTTCGCGCC GCCGCCGTTC
CGGGGCGCGT TCGACCGGCT GCACGGCATG TCCGCCCATG GCTGGGCCCT GAACATGGCG
GACCCGAAAA CGCCCGTGCA GGTCGAGGCC GTCTGCAACG GCCAGGTCCT GGCCACGGCC
ACCGCGCGGC TGTTTCGCGG CGACCTGCTG GATGCGGGTC TGAACGAAGG TTTCTGCGCC
TACAAGATCG ACATCGGCAA GCAGATCCTG GATCTGCTGG GGCAGGAGAT CCAGGTCCGC
ATCGCCGGCC ATCCCGACCT GGTCCTGACG GGTTCGCCGC AGGTGGCGAC GCAGAACCCC
AACCTGCTGC GCTACCTGAA GCCCGCGCGC GGGATGAATC CGGCCTTGCT GCCGCGCCTG
CGCCGGACAC TGGATCACCG CGTGGGGCCT TTGCGATTAT CCATCGTCAT GCCGGTGTAC
AATACGCCGC GCGACTGGCT GGTCCAGGCG ATCGAAAGCG TCCTGGCCCA ATGGTGCGGA
CGGTGGGAAC TGATCTGCAT CGACGATTGT TCGACCGCGC CGCATGTGGG CGCGGTGCTG
CGCGCGTACG CGGACAGGGA CCCGCGCATC CGCGTGCTGA CGCCGCAGGT CAATGGCGGC
ATCGCGGTTG CGACCAATCT GGGCCTGCGC GCCGCGCGCG GCGATTATGT CACATTCCTG
GACCATGACG ATGTCCTGGA ACCCGATGCG ATCTACCACC TTCTGAAGAC GGCGCGGGAG
ACGGATGCCG ATTTCATCTA TTCCGACGAG GCGACGACCG ACGAGAACAT CGACAGCATC
GCGGACGTCA AGGCGCGGCC CGCCTTTTCC TATGATTACT ATCTGTCGCA CCCGTATTTC
GTGCATATGC TCTGCGTCCG GCGTCGCCTC GCCCATGAAA TCGGCGGATG GGACGAACGG
ATGGCGATTT CCGCCGACGT GGATTTCGTG CTGCGCGTCC TGGCCCGGGC CGCCAGCATC
GCCCATGTTC CCCGAATCCT GTATCGCTGG CGGACCCATG GCGGCAGCAC AGGCCATGCG
AAGAAGAAAG CGGTGATGGA GGCGACATGC GCCGCCATCC AGCGGCACCT GGACCAGGCC
CACCCGGGGG CGATGGTGTC CGCCGGGTTG GGGTTCAACC AGTTCCGTGT CGACTGGCCG
GCGACGGAAG GGAAAATCCT GATCGTCATC CCGACGAAGA ACAAGGCCGA CCTGGTGCGT
GTCGCCATCG ACTCCATCGC GCGGACATCC GCCGGCGCGG ACTACCGGAT CGTCGTCGTC
GATCATGATT CCACCGAGCC CGAATCGGTC GCCTATTTCA AGTCGATCCG CGATCGCTGC
ACGGTCATGA AATATACCGG CGAATTCAAC TATTCCCGCA TGAACAACCT GGCCGTCCGG
AAACATGGCA GGGACGCCGA TTTCATCCTG TTCCTGAACA ACGATATCGA GGCGATCACC
GACGGGTGGC TGGACAGGAT GCGGCGGCTG GCCCATCGGC CGGAGGTCGG GATCGTCGGT
GCCCTGCTGC TCTATCCGGA CCGGCGCGTA CAACATGCGG GCGTCATCAT GGGATTCAAT
GGATCGGCCG ATCATGCCTT CAAGTTCGAG GATGTCTATC TGAACGACGG AAACCAGAGG
AGTTTCGGCT ATAATTGCAG CCTGACATCG GTGCGCGATT TTTCCGCCGT CACCGCGGCC
TGCATGATGA TGCGCAGATC CGTCTTCGAT CAGGTGGGGG GATTCGACGA AACCCTGAAG
GTCGGGTTCA ACGATACCGA TTTCTGCCTG CGTGTTCGCG AGGCGGGGCT GAAGGTCCTG
TATGACGGCT ATACCGTCCT GTTCCATCAT GAAAGCGCCA CGCGCAGCCA GACCAGACAG
GTCATGCACC CCGAGGATAC CGAGCGGATG CTGCACCGGC ATGGCCGGAT ACTGGAAGGC
GGCGACCCGT TCTATAATCC CAATCTGAGC GTGACGGCAC AGGATCATGT CGTGCGCGAC
GATAACGGAT GCGGGCGGGG CCGCGTGCGC GTCACGCGCC TTTGTGCCTC GATGTAA
 
Protein sequence
MEFSVHAEAG IPGMILGLSD DLFRTGAEGD VGVLLDGVYR GQASVRREAG RVLVGLPTHL 
LAREVDLLDL RDGRSLLKTA CFILPCYELT TGAITVSGGA IVGDFSVRGL DDHVLVECIE
NGQVLARGFA TRAGGTDYRF HLPFPTLLTP QQQVACLFRI AGLFLDGPPF LLTMQTLGYL
GYVDRPEPGC VTGWVCDMTM PDRRVAVDLV RDGVVLQTVL ADRFREDLLA LGIGDGHAGF
SFTLPHDGSF RNPADVSVVV SGGSLNLVNS PLMVFAPPPF RGAFDRLHGM SAHGWALNMA
DPKTPVQVEA VCNGQVLATA TARLFRGDLL DAGLNEGFCA YKIDIGKQIL DLLGQEIQVR
IAGHPDLVLT GSPQVATQNP NLLRYLKPAR GMNPALLPRL RRTLDHRVGP LRLSIVMPVY
NTPRDWLVQA IESVLAQWCG RWELICIDDC STAPHVGAVL RAYADRDPRI RVLTPQVNGG
IAVATNLGLR AARGDYVTFL DHDDVLEPDA IYHLLKTARE TDADFIYSDE ATTDENIDSI
ADVKARPAFS YDYYLSHPYF VHMLCVRRRL AHEIGGWDER MAISADVDFV LRVLARAASI
AHVPRILYRW RTHGGSTGHA KKKAVMEATC AAIQRHLDQA HPGAMVSAGL GFNQFRVDWP
ATEGKILIVI PTKNKADLVR VAIDSIARTS AGADYRIVVV DHDSTEPESV AYFKSIRDRC
TVMKYTGEFN YSRMNNLAVR KHGRDADFIL FLNNDIEAIT DGWLDRMRRL AHRPEVGIVG
ALLLYPDRRV QHAGVIMGFN GSADHAFKFE DVYLNDGNQR SFGYNCSLTS VRDFSAVTAA
CMMMRRSVFD QVGGFDETLK VGFNDTDFCL RVREAGLKVL YDGYTVLFHH ESATRSQTRQ
VMHPEDTERM LHRHGRILEG GDPFYNPNLS VTAQDHVVRD DNGCGRGRVR VTRLCASM