Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2886 |
Symbol | |
ID | 6976318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3161358 |
End bp | 3164384 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643392394 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002277232 |
Protein GI | 209545003 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0431865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCGGT TCCGGCCGCA GGGGGTGACG GGTGCCGATC GGGCCGCGTG GCAGGCGTGG GCCGATGCGC GTGCCCAGGC GGCCTTTCGC CGTGCGCGCC TGCTGGAGGA TGCGGGCGAC CCGGACGGGG CCTGGCGATG GTACGAGCGG GCGAACCGCC TGGCTCCCGA CAGCCCGAAC GTGATGTTCC CGCTGGCGGT GGCGCGGCTG CGCGGCGGGG ACACCGCCGG CAGCGTGCGT CTGCTGCGTG AACTGACCCG GCGTTTCGAT TTCCCCGAGG GCTGGGCGGC CCTGTCCGGC GCCCTGCTGG CGGCGGGCGA GGACGCGGAC GCCATCGCGG CGGCGCAGCA CACCCTGTCC CGCCACGCGC CGCCGGCCGG CATGGATGCG CTGTTCGGGC GACTGGCGGC CCGGGCGGGC CGGCCCGGCT GGTGCGGCCT GTCGGCATCG GGCCGGCTGC ATGGCGGTGG TGTCCGCGAC CCCGATATAT TCTTCGATGG GCAGCCGGTC CGATCGCGGG CCGGGGCGGA CGGGCCGGTG CTGCCGCCCT GCTGGCGCCT GGCCCGGCAG GTCGAGGTGC TGGCGGGCGG CGTGCCGCTG CTGGGCAGCC CGCTGGACCC GGCGGCGGTG CAGCGGACCG AGGGATTCGT CGAATCCGGG CCACGGGGCC TGTCGGGCTG GCTCTGGCAC CCGGCGGACC CGGACCATGT GCCCGAACTG CGCCTGCTGG ACGGGCGGAC CGGGCGCCTG CTGCTGCGCC AGGCGGTGCC CGAACCGGCG ACCGAGGTCA GCAGTGCGGT GCCGCTGGCC CGGCCGCGCG GCTTTGCCAT TCCGGCCGCC TGCCTGCCGG CCGGCCCGGT GCGCGTGCTG GGGTCCGACG GGCGCGACCT GCTGGGCAGC CCGCTGGCCA CGGCCTGCGA ACCGTGCGGG GGCGGCTGCC CGGGCGAAGC CGATTGCCCG AAACGCCTGC CTACCGGCCG GCCGCCCTGC CGCCCGCCGG ATGCGCCGCC GGCCCGTGGA CGCCCGGCGA TCGTGATTCC GGTCCATGGC GACCGCGCGG CCACGCTGGC CTGCCTGGAA TCGGTGCGCG ACAGCGTGGG GCCGGACGTG CCGGTGGTCG TGGTCGATGA CGCGACGCCG GTCGCGGCCC TGGCCGCCGA CCTGGACCGG CTGGCGGCGG CGGGGACGAT CCGGCTGGTC CGGCACGCGC GCACGCTGGG CTTCCCGGCC TCGGCCAATG ACGGGATGCG CGCGATGGCG GGGCACGACG TCGTGCTGCT GAACAGCGAT ACGCTGGTGG CGGGCGGCTG GCTGGCGGAA CTGGCCGACG TGGCCTATGG CGCGGCGGAT ATCGGCACCG TGACCCCGCT GTCGAACCAG GCCAGCATCT TCTCGTGGCC CGGACCGGAC CGGGCGCCGG ACGCGGAATT CTGCCCGGCG TTCGACCTGG CCCGGGTGCG GCACGTGATG GCGGCGGCGC GGGCGGCCAA TCGCGGGGTG GCGGTCGATG TGCCGACCGG GCACGGATTC TGCCTGTTCA TCCGCCATGA CTGCCTGGAT GCGTTCTGGC GGCTGCAGCC CGGCGCGCCG GATTCGGGCC CCTTGCGGGC CGACCTGTTC GCCCAGGGCT ATGGCGAGGA AAACGATTTC TGCCTGCGCG CCGCCGCCGG GGGGTGGCGG CATGTCGCCG CGCCCGGGGC GTTCGTCGGC CATGTGGGCG GGGCCTCGTT CGGTGCGGCG CGGCCGGACC TGCTGCGGCG GAACCTGGAC ATCGTGAACG CCCTGCACCC GGGCTATGAC GCGCGGGTCG CCGCCCATGT CGCCGCCGAC CCGCTGTTCG CGGCACGGCG GCGGCTGGAC CTGGTGTGCT GGCGGCGCGG CGGGGCCGGG TGGGATGGGG CGGTGCTGCT GGTCACGCAC GACCAGGGCG GCGGGGTGGA ACGCGTGGTG CGCGCCCGCG CCGCCGCCTG GCGCGCACAG GGCGCGCGGC CGGTGGTGCT GCGCCCCGCC CCCGGCGGCT GTCGGCTGGA GGACGTGCCG GACACGGCGT CCGGGCCGTC CGCCGCGGCG TTCGCCAGCC CGCGATTCCG CCTGCCCGGG GAAGGGGACA TGCTGCTGGC CGTGCTGCGC GCGTCGGGGG TCGATTCGGT GGAATGGCAT CATTCGCTGG GCCACGGGAT CGATGTCCGC ACGCTGGCCG CCTCGCTGGG CGTGCCGTAC GAGGTGCATG TCCATGACTA TGTGTGGTTC TGCCCCCGGG TGGCGCTGGT CGGGGCGTCG GGCCGCTATT GCGGCGAACC GGGACCGGCG GGATGCGCCG CCTGCCTCGC CCCCGGCGGC AGCCTGCTGG AGGACGGGGA GGGGACGATC CCCGATCGGC TGGCGCGCCA CCTCGCGCGG TCCCGCGCGG ACCTGCTGGG CGCGCGCGCC GTCATGGCGC CGTCGGATGA CGCGGCCCGG CGCATCGGGC GGCATTTTCC CGAAATCGCG GTCCGGGTGA CGCCGCTGGA GGACGACCGG CCCGGGATGG ACCTGGCATC CTTCGTCCGG ATCTATGGCG GGGTGGCTTG TGGCGGGGCG GCGCCGGCGG GACGCGCGCG GCCCGCCGGC CGGGTGCGCG TGGCGGTGCC CGGCGCGATC GGGGTCGAGA AGGGTTATGG CATCCTGCTG GAGGCCGCGC GGGACGCGGC GGCGCGGGAC CTGAATCTGG AATTCGTGGT GGTGGGCCAC ACGCCCGACG ACGACGCGCT GATCGGCACC GGCCGGGTGT TCGTCACCGG CCCATACCGC GAATCCGATT CCGTCGCCCT GCTGGCGGCG CAGGGGGCGG ACCTGGCGCT GCTGCCCTCG GTGTGGCCGG AGACCTGGTG TTTCGCGCTG GGGCTGGCGT GGCGGGCCGG CCTGCGGGCC GTGGTCTTCG ACCTGGGGGC CATGGCCGAA CGGGTGCGGC GCACGGGCCG GGGCCGGGTC CTGCCATTGG GTATGCCGAT TCACGAATTG AATACGTTTT TGCTCTCCTA TGCCTGA
|
Protein sequence | MGRFRPQGVT GADRAAWQAW ADARAQAAFR RARLLEDAGD PDGAWRWYER ANRLAPDSPN VMFPLAVARL RGGDTAGSVR LLRELTRRFD FPEGWAALSG ALLAAGEDAD AIAAAQHTLS RHAPPAGMDA LFGRLAARAG RPGWCGLSAS GRLHGGGVRD PDIFFDGQPV RSRAGADGPV LPPCWRLARQ VEVLAGGVPL LGSPLDPAAV QRTEGFVESG PRGLSGWLWH PADPDHVPEL RLLDGRTGRL LLRQAVPEPA TEVSSAVPLA RPRGFAIPAA CLPAGPVRVL GSDGRDLLGS PLATACEPCG GGCPGEADCP KRLPTGRPPC RPPDAPPARG RPAIVIPVHG DRAATLACLE SVRDSVGPDV PVVVVDDATP VAALAADLDR LAAAGTIRLV RHARTLGFPA SANDGMRAMA GHDVVLLNSD TLVAGGWLAE LADVAYGAAD IGTVTPLSNQ ASIFSWPGPD RAPDAEFCPA FDLARVRHVM AAARAANRGV AVDVPTGHGF CLFIRHDCLD AFWRLQPGAP DSGPLRADLF AQGYGEENDF CLRAAAGGWR HVAAPGAFVG HVGGASFGAA RPDLLRRNLD IVNALHPGYD ARVAAHVAAD PLFAARRRLD LVCWRRGGAG WDGAVLLVTH DQGGGVERVV RARAAAWRAQ GARPVVLRPA PGGCRLEDVP DTASGPSAAA FASPRFRLPG EGDMLLAVLR ASGVDSVEWH HSLGHGIDVR TLAASLGVPY EVHVHDYVWF CPRVALVGAS GRYCGEPGPA GCAACLAPGG SLLEDGEGTI PDRLARHLAR SRADLLGARA VMAPSDDAAR RIGRHFPEIA VRVTPLEDDR PGMDLASFVR IYGGVACGGA APAGRARPAG RVRVAVPGAI GVEKGYGILL EAARDAAARD LNLEFVVVGH TPDDDALIGT GRVFVTGPYR ESDSVALLAA QGADLALLPS VWPETWCFAL GLAWRAGLRA VVFDLGAMAE RVRRTGRGRV LPLGMPIHEL NTFLLSYA
|
| |