Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0509 |
Symbol | |
ID | 6973905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 559186 |
End bp | 562413 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643390041 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002274918 |
Protein GI | 209542689 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.253023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0268717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCC CGCCCGGCGA CACGGTCGTC GGACTGCTGC TGGATATCGC CTGCCTTCTG CTGAGCAACG GTGCGGACGC CCCGGCGATC GAAAACACCG TCGGGGCCTT CGCATCCTGC CTGGGCCATG ACGCGGGATT GACCATTACG TACCGGGTGG ACGCCTTCCT GCTGGACATC GGGACCAGTT CGGGCGAGCA GGCGCGGCAC GCCGTGCCGA TCGCGACCAT GCGTGTGGCG CCGTCGGTGA TCGAGCCGCT GCTGGCGCTG GCCGGCCACG CGTCCTGCAC GCAGGACGAC ATCCATGCCG CGCGGGCCCG CGCGCTGCGG CGGGACGCGG CGGCCGATCC CGGGCACCAG CCGGCCTGGT ACCGGTTCGA CCCGGACTGG TATCGGGCGG CCTATCCCTT CGTCGCCGAG CAGATGGTCT TCCTGGGCTG CGACGACGTC GTGGCCTATT TCCGGGATTT CGGCATCGGG CTGGGCCATT CCCCCAACCC GTTCTTCGAC GAGGCATGGT ATCGCACGGC CCATCCCGAC ATTGCGCGAC TGATCGCGGA CGGCGTGGTC CAGAACGGAT TCGTCCATTA CCTGACCACC GGTTTTGCCG ATCGTTCTCC GCACTGGCTG TTCGATTCCG GCCTGTATCG CAGGGCGCAC CCCGACCTGT CGCCCGAAGG CCTGGCGATA CGGGGGTACC GCAACCTTTA CGACCATTAC CTGGAAGTCG GCGACCCGTC CGGCCTGCGG GGGCAATGGC TGTTCGACCC GTCCGGCAGG TTCCGGCAGG TGGCGGCGTC CCTGCCCCAC GTGGCACCGA CCCTGTCGCT GTCGCCGTGT TTCGACGCGA TCTGGTATCT GAAGACCTAT CCTGAAGTCG CCGCGCTGAT CGCCGCCGGT GCCTATTCCT GCGCGCTGCA TCACTACCTG GCCAATCCGA CCCCGACCCG CTTCTGTGCG ACGCCCTGGT TTTCCGAAGA TTACTACCGT GTTTTTTACG AAGACGTGGA CAGCGCGCTG CGGAACGGCA CCTTCCGCAC GGGATACGAG CATTTTCTGG AATTCGGCCT GCGGGAATGG CGGCGCCCCC ACCCGGATGT CGACCTGCGC GCCTTTCGCG ACCGGCTGGC GCGGACGAAC CCGGAAATGC GCCTGGAACC GGACCCGTTC CGTTTCTGGC TGGCCGTACC GGCCGACCTG CGCATCGCGC CGGCCCCGCC ACGCATCGAC GAGGCGATAT CCCGCGACGC CTTCCGGCAG GCGGCCGAGG ATATGCTGCT GCTGCACGCG CACGAACCGA TCGATTTCAC GCCGGACGGG CCGGCCGACC TGGCCGTGGT GATGGTCGCG CACAACCGGT TCAGCCTGAC GATGCAGGCC CTGGCCGCGC TGCGCACCGG AGGGCCCGGA AACATGCAGG TCATCATTGC CGATTCCGGA TCACATGACG AGACGCGGCA TCTGGAGCGG TACGTCGCGG GTGCCCGAAT CATCCGCTTC GCGCGCAATG TCGGCTATAT CGAGGCCTGC AACGCGGCGC TGCGGATGGT GACGGCACCC TGCACATTGT ACCTGAACAA CGACCTGATC GTGGAGTACG GAGCCATCGC GCGGGCGCTG CGCCGACTGC ATGCCGCGCC CGACATCGGG GCGGTCGGTG CGAAGATTGT CCGCAGCAAC GGCATTCTTC AGGAAGCGGG GTCGATCCTG TGGCGGGACG GCACCACCAG CGGCTATCTG CGCGACCGCG ATCCCGCGAC GCCCGAGGCG AACTTCGTGC GCGAGGCCGA TTACTGTTCC GGCGCGTTCC TGCTGGCGCG GACCGGGTTG CTGCATCAGC TTGACGGTTT CGATCCGGCC TTTTCCCCGG CCTATTACGA GGAAGTCGAT CTGTGCGTGC GGATGCGCAA GGCCGGGTAC CGGGTCGTCT ATGATCCGTC GGTCATGGTG CGGCATCTGG AATACGGATC GTCCGACACC GATCATTCCC GTGTCCTGAT GCATCGCAAT CATCGCGTCT TCAGCGACAG GCATCGCGAT ATCCTGCGCT ATTGCCAGCC CCGCGCCGCA GGGAACGCCA TCTTCGCGCG CTCGCCCCGC GGCGCCCGGC GACGCATCCT GTATATCGAG GACCGGCCGC CGATCCGCCG CCATGGCGCG GGCTATGCAC GGTCCAACGA CATCGTGCGC CTGATGGTGG AAATGGATTA TCAGGTCACC ATCTTTCCGA TCCTGATGAC TGACACGCCC CTGCTGGACA TCTACGGCGC CCTGCCCGAC AGCGTGGAAA TCCTGCATGA CCGCCATATC GGCATGCTGG CCGACCTGAT CCGCGAACGG CCGGGATATT ACGATCTGGT CTGGGTGGGC CGGACCCATA ATCTGGCGCA GATCCTGCCG ATCCTGGCCG CGTCACCGGC CGCCCTGCCG GTCGAGGGCT TCATCCTGGA CACCGAGTGC ATCGCCGCCC CCCGGACGGC GGAGCGCGCG CGGGTGCTGG GCCTGGCATC CCCGCCGAAA CTGGATCAGG CGGTGCGCGA CGAACTGGCC TGCGCCTATT TCTGCCAGCA GATCGTGGCA GTCAGCGACC ATGACGCAGC ACTGGTGCGA TCGGCCGGTT ACGACAATGT CGCCGTGCTG GGGCACATGC TGGAACCGGC CCCCACGCCC TCGGGCTGGG CGGAGCGGAG CGGCATCCTG TTTTTGGGCG CCCTGCACGA CATGGAGTCC CCCAATTACG ACAGCATCGC GTGGTTCATC ACGCAGGTCA TGCCGCGCAT GCCGGCGGAG ATGCATCTGA CGATCGCGGG CCATGTCGAT CCGTCGGTTC ATTTCAGCGC CCTGGCCGGC CATGGGCGCG TCACCTTCCT GGGCGCGGTC GATGATCCGC GGCCGCTCTA TGACCGGCAC CGTGTGTTCG TCGCCCCCAC CCGCTTCGCC GGCGGCCTGC CCTACAAGGT TCACGAGGCA GCCGCCCATG GCCTGCCGGT GGTGGCCAGC ACGGTACTGT GCCGGCAGGT CGGCTGGGAT GTGGGCACGG ATATCCTGTG CGGCGGATCG GACGACCCGC AATGCTTTGC CGATGCGATC ATGGCGCTGT ACGAGGATGC CGGATTGTGG CGTACGGTGC GCGACGGTGC GATCGGACGC ATTGCGCGGG AGAATGATCC GCATGCGTAT CGCCGCCGGT TGGCGGACAT TTTGGAAAAA CTGTTATCCA TGGGATAA
|
Protein sequence | MPTPPGDTVV GLLLDIACLL LSNGADAPAI ENTVGAFASC LGHDAGLTIT YRVDAFLLDI GTSSGEQARH AVPIATMRVA PSVIEPLLAL AGHASCTQDD IHAARARALR RDAAADPGHQ PAWYRFDPDW YRAAYPFVAE QMVFLGCDDV VAYFRDFGIG LGHSPNPFFD EAWYRTAHPD IARLIADGVV QNGFVHYLTT GFADRSPHWL FDSGLYRRAH PDLSPEGLAI RGYRNLYDHY LEVGDPSGLR GQWLFDPSGR FRQVAASLPH VAPTLSLSPC FDAIWYLKTY PEVAALIAAG AYSCALHHYL ANPTPTRFCA TPWFSEDYYR VFYEDVDSAL RNGTFRTGYE HFLEFGLREW RRPHPDVDLR AFRDRLARTN PEMRLEPDPF RFWLAVPADL RIAPAPPRID EAISRDAFRQ AAEDMLLLHA HEPIDFTPDG PADLAVVMVA HNRFSLTMQA LAALRTGGPG NMQVIIADSG SHDETRHLER YVAGARIIRF ARNVGYIEAC NAALRMVTAP CTLYLNNDLI VEYGAIARAL RRLHAAPDIG AVGAKIVRSN GILQEAGSIL WRDGTTSGYL RDRDPATPEA NFVREADYCS GAFLLARTGL LHQLDGFDPA FSPAYYEEVD LCVRMRKAGY RVVYDPSVMV RHLEYGSSDT DHSRVLMHRN HRVFSDRHRD ILRYCQPRAA GNAIFARSPR GARRRILYIE DRPPIRRHGA GYARSNDIVR LMVEMDYQVT IFPILMTDTP LLDIYGALPD SVEILHDRHI GMLADLIRER PGYYDLVWVG RTHNLAQILP ILAASPAALP VEGFILDTEC IAAPRTAERA RVLGLASPPK LDQAVRDELA CAYFCQQIVA VSDHDAALVR SAGYDNVAVL GHMLEPAPTP SGWAERSGIL FLGALHDMES PNYDSIAWFI TQVMPRMPAE MHLTIAGHVD PSVHFSALAG HGRVTFLGAV DDPRPLYDRH RVFVAPTRFA GGLPYKVHEA AAHGLPVVAS TVLCRQVGWD VGTDILCGGS DDPQCFADAI MALYEDAGLW RTVRDGAIGR IARENDPHAY RRRLADILEK LLSMG
|
| |