Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_34680 |
Symbol | ggpS |
ID | 7762363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3541105 |
End bp | 3543357 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806334 |
Product | glucosylglycerol-phosphate synthase |
Protein accession | YP_002800592 |
Protein GI | 226945519 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0380] Trehalose-6-phosphate synthase |
TIGRFAM ID | [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01485] sucrose-6F-phosphate phosphohydrolase [TIGR02398] glucosylglycerol-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.966141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACTTG CCACCGATCT CGATGGAACC TTTCTCGCCG GCGATCCGCA GGATCGCCTG AGTCTCTATC AGACCATCAC CGCCCACCCC GACATCCAGT TGGCCTATGT CACCGGCCGC AGCCTGGAGG CCGTTCTGCC GCTGCTGGCC GATCCCACCC TGCCGCAGCC GGACTACATC GTCGCCGACG TCGGCGCCAG CCTCTATCAC GGCGAAACCC TGCAGCCGAT CCAGCCCCTG CAGCACGACA TCGACGCCCG CTGGCCCGGC GAAAGCCAGA TCGCCGGCGC TCTCGCCGGC CTTCCCGACC TGCAGCGCCA GGACGTGCCC CAGGCGCGCC GCTGCTCCTA CTTCTGCACC CCGGAACGCG CCGCCGACCC GGCCCTCGAG GTCATCGCCG AACGCCTCGG CTGCGACCTG CTGTACTCGG CCGGACGCTA TCTGGACTTC CTGCCGCGCG GGGTGAACAA GGGCAGCAGC CTGCTGCGCC TGGTCGAGCA CCTCGGTCTC GATCCCGAGC AGGTGCTGGT CGCCGGCGAC ACCCTGAACG ACCTCAGCAT GCTCACCTGC GGCCTCAAGG GCGTCTGCGT CGGGCAGGCG GAGGAAAGCC TGCTCGAACA TACGCGCCAC TGCACCCGCG TCCTGCACGC CGACAGCCCG GGCTGCGGCG GCATCATCCA GGCCTTCGCC CACTTCGGCT TCCTCGGCGT GCACGGCTTC GCCGCCGAAA CCCGCAAGGC GACCGAGCCG GGACACGCCG AGCTGGTGAT CGTCTACCAC CGCCTGCCCT ACGAGGAATA CCGCGAGCAC GGCCAGGTGC AGCGCCGCCG GCCAAGCTCG CCCAACGGCA TCATCCCCAC CCTGCTCAGC TTCTTCGCCG ACGGGCGCAA GGGCTCCTGG GTCGCCTGGG CCGAGCACGA GGAAGGCCAA GGCCTCTTCG AGAGCCACAC CACGGTGGAC GCCGAGCGCT ACCCGCGCCT GACCGCCGCC CGCGTGCCGC TCTGCAAGGA AGACATCGAT ACCTTCTACA AGCGTTTCTC CAAGGAGGCC TTCTGGCCGA CCTTGCACAC CTTCTGGGAG CGCGCCGTCT TCAACGAGGA CGACTGGCAG GTGTTCCTGC GGGTCAACCG CGCCTTCGCC GAGCGCACCG CCCTGGAGGC CGCCGAGGGC GCCACCGTCT GGCTGCACGA CTACAACCTG TGGATGGTTC CGGCCTATCT GCGCGAACTG CGCCCGGACC TGAAGCTGGC CTTCTTCCAC CACACCCACT TCCCCTCGGC GGATGTGTTC AACGTGGTGC CCTGGCGCCG GCAGATCGTC GGCAGCCTGC TGCAATGCGA CTACATCGGC TTCCACATCC CGCGCCAGGT GGAGAATTTC GTCGACGTCG CCCGCGGCGT GGCCCCGCTG AAGGTGCTCG GCCGGCAGAA CTGCGCGCCG CGCTTCGTCA CCTACGGCTG CGCGGTCGGC CTGGAGCGCG TGACCACCGC CATCGATACC GGCATGCGCC AGGTCCGCCT CGGTGCCCAT CCGGTCGGGC TCGACCTCGA CCGGGTGCGC AACGCCCTGG CCAGCCCTTC CGTGCAGAAT CAGATGGAAC AGTTGCGTCG CGAGTTGAAC GGCGTGCGCC TGGTCCTCGC CGTGGAGCGC CTGGATTACA CCAAGGGCGT GCTGGAAAAG CTCAAGGCCT TCGAGCGGCT GCTGGCGGAC AACCCCGAGT TGCAGGGCAG AATCACCCTG GCCACCATCT GCGTCCCGGC GGCGCGGGAG ATGACCGTCT ACGACGAGTT GCAGGGGCAG ATCGAACAGG CCGTGGGACG CATCAACGGT CGCTTCGCCC GGGTCGGCTG GACGCCGGTG CAGTTCTTCT TCCGCAGCCT GCCGTTCGAG GAAGTGGTGG CCTGGTACGC CATGGCCGAC GTCATGTGGA TCACCCCGCT GCGCGACGGC CTCAACCTGG TGGCCAAGGA GTTCGTCGCC ACCCAGGGCC TGCTGGACGG CAGCGGCGTG CTGGTGCTCT CCGAATTCGC CGGCGCCGCC GCCGAACTCA AGGGCGCCCT GCTGACCAAT CCCCACGACA TCGCCGATCT AGTGCAGAAC TGCCATCTGG CGCTGAACCT GCCCAAGAGC GAGGCCCGGG CGCGGCTGCG CGAGCTGTTC GACATCGTCG CCTTCAACGA CATCCGCCGC TGGGGCGACG AGTTCCTCGG CGCGCTGGAG GAGCCCCGGA CGGACATCCG CGCCATCGCC TGA
|
Protein sequence | MLLATDLDGT FLAGDPQDRL SLYQTITAHP DIQLAYVTGR SLEAVLPLLA DPTLPQPDYI VADVGASLYH GETLQPIQPL QHDIDARWPG ESQIAGALAG LPDLQRQDVP QARRCSYFCT PERAADPALE VIAERLGCDL LYSAGRYLDF LPRGVNKGSS LLRLVEHLGL DPEQVLVAGD TLNDLSMLTC GLKGVCVGQA EESLLEHTRH CTRVLHADSP GCGGIIQAFA HFGFLGVHGF AAETRKATEP GHAELVIVYH RLPYEEYREH GQVQRRRPSS PNGIIPTLLS FFADGRKGSW VAWAEHEEGQ GLFESHTTVD AERYPRLTAA RVPLCKEDID TFYKRFSKEA FWPTLHTFWE RAVFNEDDWQ VFLRVNRAFA ERTALEAAEG ATVWLHDYNL WMVPAYLREL RPDLKLAFFH HTHFPSADVF NVVPWRRQIV GSLLQCDYIG FHIPRQVENF VDVARGVAPL KVLGRQNCAP RFVTYGCAVG LERVTTAIDT GMRQVRLGAH PVGLDLDRVR NALASPSVQN QMEQLRRELN GVRLVLAVER LDYTKGVLEK LKAFERLLAD NPELQGRITL ATICVPAARE MTVYDELQGQ IEQAVGRING RFARVGWTPV QFFFRSLPFE EVVAWYAMAD VMWITPLRDG LNLVAKEFVA TQGLLDGSGV LVLSEFAGAA AELKGALLTN PHDIADLVQN CHLALNLPKS EARARLRELF DIVAFNDIRR WGDEFLGALE EPRTDIRAIA
|
| |