Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2999 |
Symbol | |
ID | 6976433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3275377 |
End bp | 3278343 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643392507 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002277344 |
Protein GI | 209545115 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACAT CTGTTCCATG TGTTGGTCCG GCCTCCTTCT GGGCACCACA GCATATCCTG GAATCGGCAT GGCTGGAGCA CGCACCGTTC GCGTTCTGGC TGATGGAGAA GGTTCAGCCG GCCGTCTTCG TCGAACTGGG GACGCATGCC GGGTTCTCGT TCCTCGCGTT CTGCCAGGCG GTCCAGCGGC TCAGGCTGCC GACACGGTGC TATGCGGTCG ATACCTGGGC GGGGGACGAG CACGCCGGCT TTTACGGCGA ACAGGTATTC AACACCCTGT CCGGCCTTCA GGGGCAGCAT TATGCGGGAT TCTCGCGACT CATCCGCGCG TATTTCCACG ATGCGCTCGT CCATTTCACG GACGGCGAGA TCGATCTCCT GCACATCGAC GGACGACACC GCTACGAAGA TGTCCTCGAG GACTATACGA CCTGGCTGCC GAAAATGTCC GAACACGGCG TGGTGCTGTT CCACGACATC AACGTTCGGG AAGGCGATTT CGGGGTCTGG CGCCTGTGGG AGGAACTGCG CGCCAGACAC CCGTCATTTG AATTCTTTCA CGGGCACGGG CTGGGCGTAC TCTGTCCGGG TTCGCGCGTG CCGCCCGGAT TGCGCCCGCT GCTGGAGAGC GGCACCGAGG CCCGCGTGGC GATACGCGAA GCCTACTCAC GGCTGGGCGC CGCCGTTTCG ACCCAGTACG TGCTGGAACG GGCCCACAGG GAACTGAATG CCGAAATCGG CGCCTTGCAT CAGCAACTGG CGACCCGTGC CCAGGACGAA GAGCGGCTGC GCGCGGATCT GTCGTCGATC CAGCACGAAC GCGACAGCAT CCAGAAGGAT CTTTCCGCGA ATATCGACGA CCTGAACCAG CAGTTGGATG CCCGTGACCA GGAGGACGAG CGCCTGCGTA CCGACCTGTC GTCGGTTCAA GGCGAACTGT CCATTACCCA ACGGGATCTT TCCAGCGTCC AGAACGTGCT GGCGCTGACG AATACCGAGG TCGCGCAGAT CCGGGCCAGC ACGACCTGGC GCGCCACGAA GGTCGCCCGG GACGTGGGGT CGCGCCTTTC GCCCCGTGTG CGTCACCAGT TGCGGCGCGG TGCCAAGGCG GTGTGGTGGG CCATGACGCC CCATCGGATG CCCGCGCGAC TCCGTTTCCG CCGCGCGCGC GCGGTACAGC ACGCCGTCTT CAGCGAGACG CCAGGCGACC CGTCCTGCCC GCACATCGCC TATCAGCCCT GGCCGGCATC ATCCGGCACC GCACTGCCGA ACGGGATGCC GCAAGGCACC TATCGGCTCG CCTCCGACCC GTCGGGCTAC GTCTTCGTAC CCCGGCGGCG CCCCGACAAT CTGGACGCGC AGATCGCGCG CCTGGCGAAG CGCCCCGCAT TCTCGATCGT CGTGCCGCTG TACAACACGC CGGACGATCT GTTCCAGCGA ATGGTGGGCT CCGTCCTGGC GCAATGGTAT CCGCACTGGG AACTGATCCT CGTCGACGAC AAGAGTCCGC AGCAATCCGT GCGCGACAAT GCGTCGAAGC TCGTCGATCC CCGGATCAGG ACCATTCTGC TGGAAAGCAA CATGGGAATA TCGGGTGCGA CCAATCGGGG GCTGGCCGAA GCCGGCGGCG ACTACATCGT GTTCCTCGAT CACGACGACG AACTGACCGA CGACTGCCTG TTCGAACTGG CGAAATGCAT CGACGCGGAA GATCCCGACT ACGTCTACAG CGACGAGGAC AAGATCGAGC CGGACGGGCG TTTCAGCCAG CCGTTCTTCA AGCCCGACTG GTCGCCCGAT ACGCTGATGA GCACGATGTA CACGTGTCAC GTGTCGTGCG TGCGCCGCGC GCTGCTCGAA ACGGTGGGCG ATCTGCGATC GGAATTCGAC GGAAGTCAGG ATTGGGATTT CGTCCTGCGC GTGACGGAAG CGGCCAAACG CATCTCTCAC GTGCCGAAGG TGCTCTATCA CTGGCGTATC ATCCCGCAAT CAGTGGCTTC AGATTTGAAT GCGAAGCCTT ATGCGGTCGA CGCGGGCCGT CGCGCCCGCA TGGCGGCGCT CGAACGTCGT GGACTGAAGG GAACGATCGA AGCCGTCCCG CAACTGGCGG GATATTTCCG CGTCAATTAT GACGTGCAGG GAACCCCGCT GGTATCCATC ATCATTCCCA CCAAGAACAA CGGGACCGTA TTGAAAAACT GTCTCGATTC CATTTTTGGA CATTCGCATT ACCGAAACTT CGAAATCGTC CTGCTTGATA ATGGTTCAAC CGACGCGGCA ACGGTGAATT ACCTGGACTC CCTTCACGCC AACCCGAACG TGCGGGTGAT AAGGCATGAC GCACCGTTCA ACTATTCCGA GATCAACAAT ATCGGCGTCG GCGATGCGAA GGGCAGCCTA TTGCTGTTCC TGAACGACGA CACTCAGGTG ATCTCACCTG ACTGGATCGG ACGGATGGCC GGATATGCGC AACTTACGCA TGTCGGCGCT GTCGGTGCCA AGCTGTTGTA TCCAGACAGC CGGAAAATCC AGCATAGTGG CGTCCTGAAT CTGGCGGATG GTCCCAATCA CGCCTTTTGG TCTGCCGACG CATACACTCC GGGCTATTTC GCCCGCAATT TGCTCGAATA TGACTGGATC GCCGTGACCG GAGCCTGCCT CATGATCGAG CGAACGAAAT TTGATGCTGT CGGCGGGTTC GACGAGTCCT TCCCGATCGC CTATAACGAC GTCGATCTGT GTTTCCGTCT GGTAGAACAC GGCTTCTATA ATGTGGTGTG TCCTGGCGCT GAGCTTTTCC ATTATGAATC TCTAAGCCGA GGAAATGACA ACAAAAACAA AGAAAAACGG CGACGCCTCG ATCAGGAAAA AAATCGGCTT TATTATAAAA ATCCACATTT CCTGATGCAC GATCCTTTCT ACAATCCCAA TTTGGGACCC AATGATTTTT ATTTTTCGCT TTCATGA
|
Protein sequence | MDTSVPCVGP ASFWAPQHIL ESAWLEHAPF AFWLMEKVQP AVFVELGTHA GFSFLAFCQA VQRLRLPTRC YAVDTWAGDE HAGFYGEQVF NTLSGLQGQH YAGFSRLIRA YFHDALVHFT DGEIDLLHID GRHRYEDVLE DYTTWLPKMS EHGVVLFHDI NVREGDFGVW RLWEELRARH PSFEFFHGHG LGVLCPGSRV PPGLRPLLES GTEARVAIRE AYSRLGAAVS TQYVLERAHR ELNAEIGALH QQLATRAQDE ERLRADLSSI QHERDSIQKD LSANIDDLNQ QLDARDQEDE RLRTDLSSVQ GELSITQRDL SSVQNVLALT NTEVAQIRAS TTWRATKVAR DVGSRLSPRV RHQLRRGAKA VWWAMTPHRM PARLRFRRAR AVQHAVFSET PGDPSCPHIA YQPWPASSGT ALPNGMPQGT YRLASDPSGY VFVPRRRPDN LDAQIARLAK RPAFSIVVPL YNTPDDLFQR MVGSVLAQWY PHWELILVDD KSPQQSVRDN ASKLVDPRIR TILLESNMGI SGATNRGLAE AGGDYIVFLD HDDELTDDCL FELAKCIDAE DPDYVYSDED KIEPDGRFSQ PFFKPDWSPD TLMSTMYTCH VSCVRRALLE TVGDLRSEFD GSQDWDFVLR VTEAAKRISH VPKVLYHWRI IPQSVASDLN AKPYAVDAGR RARMAALERR GLKGTIEAVP QLAGYFRVNY DVQGTPLVSI IIPTKNNGTV LKNCLDSIFG HSHYRNFEIV LLDNGSTDAA TVNYLDSLHA NPNVRVIRHD APFNYSEINN IGVGDAKGSL LLFLNDDTQV ISPDWIGRMA GYAQLTHVGA VGAKLLYPDS RKIQHSGVLN LADGPNHAFW SADAYTPGYF ARNLLEYDWI AVTGACLMIE RTKFDAVGGF DESFPIAYND VDLCFRLVEH GFYNVVCPGA ELFHYESLSR GNDNKNKEKR RRLDQEKNRL YYKNPHFLMH DPFYNPNLGP NDFYFSLS
|
| |