Gene Gdia_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2999 
Symbol 
ID6976433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3275377 
End bp3278343 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content60% 
IMG OID643392507 
Productglycosyl transferase family 2 
Protein accessionYP_002277344 
Protein GI209545115 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAT CTGTTCCATG TGTTGGTCCG GCCTCCTTCT GGGCACCACA GCATATCCTG 
GAATCGGCAT GGCTGGAGCA CGCACCGTTC GCGTTCTGGC TGATGGAGAA GGTTCAGCCG
GCCGTCTTCG TCGAACTGGG GACGCATGCC GGGTTCTCGT TCCTCGCGTT CTGCCAGGCG
GTCCAGCGGC TCAGGCTGCC GACACGGTGC TATGCGGTCG ATACCTGGGC GGGGGACGAG
CACGCCGGCT TTTACGGCGA ACAGGTATTC AACACCCTGT CCGGCCTTCA GGGGCAGCAT
TATGCGGGAT TCTCGCGACT CATCCGCGCG TATTTCCACG ATGCGCTCGT CCATTTCACG
GACGGCGAGA TCGATCTCCT GCACATCGAC GGACGACACC GCTACGAAGA TGTCCTCGAG
GACTATACGA CCTGGCTGCC GAAAATGTCC GAACACGGCG TGGTGCTGTT CCACGACATC
AACGTTCGGG AAGGCGATTT CGGGGTCTGG CGCCTGTGGG AGGAACTGCG CGCCAGACAC
CCGTCATTTG AATTCTTTCA CGGGCACGGG CTGGGCGTAC TCTGTCCGGG TTCGCGCGTG
CCGCCCGGAT TGCGCCCGCT GCTGGAGAGC GGCACCGAGG CCCGCGTGGC GATACGCGAA
GCCTACTCAC GGCTGGGCGC CGCCGTTTCG ACCCAGTACG TGCTGGAACG GGCCCACAGG
GAACTGAATG CCGAAATCGG CGCCTTGCAT CAGCAACTGG CGACCCGTGC CCAGGACGAA
GAGCGGCTGC GCGCGGATCT GTCGTCGATC CAGCACGAAC GCGACAGCAT CCAGAAGGAT
CTTTCCGCGA ATATCGACGA CCTGAACCAG CAGTTGGATG CCCGTGACCA GGAGGACGAG
CGCCTGCGTA CCGACCTGTC GTCGGTTCAA GGCGAACTGT CCATTACCCA ACGGGATCTT
TCCAGCGTCC AGAACGTGCT GGCGCTGACG AATACCGAGG TCGCGCAGAT CCGGGCCAGC
ACGACCTGGC GCGCCACGAA GGTCGCCCGG GACGTGGGGT CGCGCCTTTC GCCCCGTGTG
CGTCACCAGT TGCGGCGCGG TGCCAAGGCG GTGTGGTGGG CCATGACGCC CCATCGGATG
CCCGCGCGAC TCCGTTTCCG CCGCGCGCGC GCGGTACAGC ACGCCGTCTT CAGCGAGACG
CCAGGCGACC CGTCCTGCCC GCACATCGCC TATCAGCCCT GGCCGGCATC ATCCGGCACC
GCACTGCCGA ACGGGATGCC GCAAGGCACC TATCGGCTCG CCTCCGACCC GTCGGGCTAC
GTCTTCGTAC CCCGGCGGCG CCCCGACAAT CTGGACGCGC AGATCGCGCG CCTGGCGAAG
CGCCCCGCAT TCTCGATCGT CGTGCCGCTG TACAACACGC CGGACGATCT GTTCCAGCGA
ATGGTGGGCT CCGTCCTGGC GCAATGGTAT CCGCACTGGG AACTGATCCT CGTCGACGAC
AAGAGTCCGC AGCAATCCGT GCGCGACAAT GCGTCGAAGC TCGTCGATCC CCGGATCAGG
ACCATTCTGC TGGAAAGCAA CATGGGAATA TCGGGTGCGA CCAATCGGGG GCTGGCCGAA
GCCGGCGGCG ACTACATCGT GTTCCTCGAT CACGACGACG AACTGACCGA CGACTGCCTG
TTCGAACTGG CGAAATGCAT CGACGCGGAA GATCCCGACT ACGTCTACAG CGACGAGGAC
AAGATCGAGC CGGACGGGCG TTTCAGCCAG CCGTTCTTCA AGCCCGACTG GTCGCCCGAT
ACGCTGATGA GCACGATGTA CACGTGTCAC GTGTCGTGCG TGCGCCGCGC GCTGCTCGAA
ACGGTGGGCG ATCTGCGATC GGAATTCGAC GGAAGTCAGG ATTGGGATTT CGTCCTGCGC
GTGACGGAAG CGGCCAAACG CATCTCTCAC GTGCCGAAGG TGCTCTATCA CTGGCGTATC
ATCCCGCAAT CAGTGGCTTC AGATTTGAAT GCGAAGCCTT ATGCGGTCGA CGCGGGCCGT
CGCGCCCGCA TGGCGGCGCT CGAACGTCGT GGACTGAAGG GAACGATCGA AGCCGTCCCG
CAACTGGCGG GATATTTCCG CGTCAATTAT GACGTGCAGG GAACCCCGCT GGTATCCATC
ATCATTCCCA CCAAGAACAA CGGGACCGTA TTGAAAAACT GTCTCGATTC CATTTTTGGA
CATTCGCATT ACCGAAACTT CGAAATCGTC CTGCTTGATA ATGGTTCAAC CGACGCGGCA
ACGGTGAATT ACCTGGACTC CCTTCACGCC AACCCGAACG TGCGGGTGAT AAGGCATGAC
GCACCGTTCA ACTATTCCGA GATCAACAAT ATCGGCGTCG GCGATGCGAA GGGCAGCCTA
TTGCTGTTCC TGAACGACGA CACTCAGGTG ATCTCACCTG ACTGGATCGG ACGGATGGCC
GGATATGCGC AACTTACGCA TGTCGGCGCT GTCGGTGCCA AGCTGTTGTA TCCAGACAGC
CGGAAAATCC AGCATAGTGG CGTCCTGAAT CTGGCGGATG GTCCCAATCA CGCCTTTTGG
TCTGCCGACG CATACACTCC GGGCTATTTC GCCCGCAATT TGCTCGAATA TGACTGGATC
GCCGTGACCG GAGCCTGCCT CATGATCGAG CGAACGAAAT TTGATGCTGT CGGCGGGTTC
GACGAGTCCT TCCCGATCGC CTATAACGAC GTCGATCTGT GTTTCCGTCT GGTAGAACAC
GGCTTCTATA ATGTGGTGTG TCCTGGCGCT GAGCTTTTCC ATTATGAATC TCTAAGCCGA
GGAAATGACA ACAAAAACAA AGAAAAACGG CGACGCCTCG ATCAGGAAAA AAATCGGCTT
TATTATAAAA ATCCACATTT CCTGATGCAC GATCCTTTCT ACAATCCCAA TTTGGGACCC
AATGATTTTT ATTTTTCGCT TTCATGA
 
Protein sequence
MDTSVPCVGP ASFWAPQHIL ESAWLEHAPF AFWLMEKVQP AVFVELGTHA GFSFLAFCQA 
VQRLRLPTRC YAVDTWAGDE HAGFYGEQVF NTLSGLQGQH YAGFSRLIRA YFHDALVHFT
DGEIDLLHID GRHRYEDVLE DYTTWLPKMS EHGVVLFHDI NVREGDFGVW RLWEELRARH
PSFEFFHGHG LGVLCPGSRV PPGLRPLLES GTEARVAIRE AYSRLGAAVS TQYVLERAHR
ELNAEIGALH QQLATRAQDE ERLRADLSSI QHERDSIQKD LSANIDDLNQ QLDARDQEDE
RLRTDLSSVQ GELSITQRDL SSVQNVLALT NTEVAQIRAS TTWRATKVAR DVGSRLSPRV
RHQLRRGAKA VWWAMTPHRM PARLRFRRAR AVQHAVFSET PGDPSCPHIA YQPWPASSGT
ALPNGMPQGT YRLASDPSGY VFVPRRRPDN LDAQIARLAK RPAFSIVVPL YNTPDDLFQR
MVGSVLAQWY PHWELILVDD KSPQQSVRDN ASKLVDPRIR TILLESNMGI SGATNRGLAE
AGGDYIVFLD HDDELTDDCL FELAKCIDAE DPDYVYSDED KIEPDGRFSQ PFFKPDWSPD
TLMSTMYTCH VSCVRRALLE TVGDLRSEFD GSQDWDFVLR VTEAAKRISH VPKVLYHWRI
IPQSVASDLN AKPYAVDAGR RARMAALERR GLKGTIEAVP QLAGYFRVNY DVQGTPLVSI
IIPTKNNGTV LKNCLDSIFG HSHYRNFEIV LLDNGSTDAA TVNYLDSLHA NPNVRVIRHD
APFNYSEINN IGVGDAKGSL LLFLNDDTQV ISPDWIGRMA GYAQLTHVGA VGAKLLYPDS
RKIQHSGVLN LADGPNHAFW SADAYTPGYF ARNLLEYDWI AVTGACLMIE RTKFDAVGGF
DESFPIAYND VDLCFRLVEH GFYNVVCPGA ELFHYESLSR GNDNKNKEKR RRLDQEKNRL
YYKNPHFLMH DPFYNPNLGP NDFYFSLS