Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2074 |
Symbol | |
ID | 7316464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 2202690 |
End bp | 2204867 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643616967 |
Product | glycogen branching enzyme |
Protein accession | YP_002514141 |
Protein GI | 220935242 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.296286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTC AGGAAAAGTC CATGATCCCA TCAGAGATCC GCCGGATTCT TGATGCCCGT CATCATGACC CCTTTTCGGT CCTGGGATAC CACGAGATCG GCGGCGAGTT CCTGGTGCGC AGCTTCCTGC CCCAGGCCGC CGAGGCCTGG GTGGTGGAGG CCGGCGACGC CCCCATGGAA CGCCTGGAGG GCACGGACCT GTTCGAATGG CGCGGCGATG CACTGCCCCG CCCCTATCGA ATCCGCTGGA GCGACGGCCA CGGCCACGAA CACGTGGCCT GGGACCCCTA CGGCTTCGAG CCCCTGCTCT CCGACTTCGA CATCCACCTG TTCAACGAGG GTCGTCACTG GCATGCCTGG CGCCTCATGG GGGCCCATGT TCAGTCCGTG GGCGAGGTGG ACGGCGTGCG CTTCACCGTC TGGGCCCCCG GTGCGGAACG GGTCAGCGTG GTGGGCGACT TCAACCGCTG GGACGGACGC TGCCACCCCA TGCGGGTGCG CGGCGGCAGC GGCATCTGGG AGCTGTTCAT CCCCGGGCTC GAACCCGGCT GCCTGTACAA GTACGAGATC CGCAACCGCG ACAGTGGCGA GATCATGGTC AAGACCGACC CCTACGGCCA GAACTTCGAA CTGCGCCCCA AGACCGCCGG CGTGGTGCCC GCGCCCGCCG ACTACGACTG GCAGGACGGC GAGTGGCTTG CCCGGCGCCA CGGCGACGCC TGGCTGCACC GCCCCATGTC CATCTACGAG GTGCACCTGG GTTCCTGGCG GCGGGGACCG GAGGGGGAGT TTCTCGGCTA CCGTGAACTG GCCCATGCCC TTGTGGACTA CGTGAAGCAG CTCGGGTTCA CCCACATCGA ACTGCTGCCC GTCACCGAGC ACCCCTTCGA CGCCTCCTGG GGCTATCAGA CCACCGGCTA CTACGCCCCC ACCAGCCGCT TCGGCACCCC CGAAGACTTC CGCTATTTCG TGGATCACTG TCACGCCAAT GACATTGGCG TGATCCTGGA CTGGGCCCCC GGCCACTTCC CCAAGGACCG CCACGCCCTG GCCCGCTTCG ACGGCACGGC CCTGTACGAA CACGAGGACC CGCGCAAGGG CGAGCACCGG GACTGGGGCA CCCTGATCTT CAACTACGGC CGCAACGAGG TGCGCAACTT CCTGGTGTCC AGCGCCCTGT ACTGGGTGGA GGAATTCCAC ATCGACGGCC TGCGGGTGGA CGCGGTCGCC TCCATGCTCT ACCTCGACTA CTCGCGCGAA CCCGGCGACT GGGAACCCAA CCGCTACGGC GGCAACGAGA ACCTGGAGGC CATCGACTTC ATCCGGGAAC TCAACAACGT GGTCCAGGGC CAGCACCCCG GCGCCCTGAT CATCGCCGAG GAATCCACCT CCTGGCCCCA GGTCACCCGC CCCACCTGGC TGGGCGGCCT GGGTTTCTCC ATGAAGTGGA ACATGGGCTG GATGCACGAC ACCCTGGAGT ACTTCAAGAA GGACCCCATC CACCGCCACT ACCACCACGA CCAGCTCACC TTCGGCCTGC TGTACGCCTT CACGGAGAAC TTCGTCCTGC CCTTCTCCCA CGACGAGGTG GTGCACGGCA AACGCTCCCT GCTCTACCGC ATGCCCGGCG ACGAGTGGCA GCGCTTCGCC AACCTGCGCC TGCTGTACAC CTACATGTGG GCCTACCCCG GCAAGAAGCT GCTGTTCATG GGCAGCGAGT TCGGCCAGGG GGATGAATGG GATGCCGCGA ACCAGCTGGA CTGGTACGTG CTCGACTACC CCCTGCACCA GGGCATGCAG CACCTGGTGG GCGACCTCAA CCGTCTGTAC CGGGACCACC CCGCCCTGCA CCAGCACGAC TTCGACTGGC AGGGCTTCGA GTGGATCGAC TGCCACGACG CCGCCCAGTC GGTGCTGAGC TTCCTGCGCA AGGACGACGA TGAACTGATG GTGGTGGCCC TGAGCTTCAC CCCCGTGCCC CGGGAGGGCT ACCGCCTGGG CGTGCCCCGC CCCGGCCGCT ACCAGGTGGT CCTCAACTCC GACTCCAGCC ACTACGGCGG CAGCAACCTG GGCCAGCCCG CCGCCCAGAG CGAGGACATC CCCTGGATGG GCCACCCCCA CTCCATCGTC CTGACCCTGC CGCCCCTGGG GGGGCTGATC CTGCGGCACA GCGGCTGA
|
Protein sequence | MKVQEKSMIP SEIRRILDAR HHDPFSVLGY HEIGGEFLVR SFLPQAAEAW VVEAGDAPME RLEGTDLFEW RGDALPRPYR IRWSDGHGHE HVAWDPYGFE PLLSDFDIHL FNEGRHWHAW RLMGAHVQSV GEVDGVRFTV WAPGAERVSV VGDFNRWDGR CHPMRVRGGS GIWELFIPGL EPGCLYKYEI RNRDSGEIMV KTDPYGQNFE LRPKTAGVVP APADYDWQDG EWLARRHGDA WLHRPMSIYE VHLGSWRRGP EGEFLGYREL AHALVDYVKQ LGFTHIELLP VTEHPFDASW GYQTTGYYAP TSRFGTPEDF RYFVDHCHAN DIGVILDWAP GHFPKDRHAL ARFDGTALYE HEDPRKGEHR DWGTLIFNYG RNEVRNFLVS SALYWVEEFH IDGLRVDAVA SMLYLDYSRE PGDWEPNRYG GNENLEAIDF IRELNNVVQG QHPGALIIAE ESTSWPQVTR PTWLGGLGFS MKWNMGWMHD TLEYFKKDPI HRHYHHDQLT FGLLYAFTEN FVLPFSHDEV VHGKRSLLYR MPGDEWQRFA NLRLLYTYMW AYPGKKLLFM GSEFGQGDEW DAANQLDWYV LDYPLHQGMQ HLVGDLNRLY RDHPALHQHD FDWQGFEWID CHDAAQSVLS FLRKDDDELM VVALSFTPVP REGYRLGVPR PGRYQVVLNS DSSHYGGSNL GQPAAQSEDI PWMGHPHSIV LTLPPLGGLI LRHSG
|
| |