Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1754 |
Symbol | |
ID | 7315776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1855366 |
End bp | 1856409 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616645 |
Product | beta-N-acetylhexosaminidase |
Protein accession | YP_002513823 |
Protein GI | 220934924 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTGG GCCCCGTCAT GCTGGACCTG GAGGGCACCG CGCTCACCGA GACGGAACGC CGCCTGCTGA CCCATCCCCG GGCCGGTGGG GTGATCCTGT TCACCCGCAA CTTCGAGTCC CTGGGACAAC TCACGGAACT GCTGCGCGAG ATCCATGCCC TGCGCACGCC TCGGCTGCTG GTGGCCGTGG ACCACGAGGG TGGACGGGTG CAGCGCTTTC GCGAGGGCTT CACGCGCCTG CCAGCCGCAG CCCGCTTCGG CGAGCAGTAC GACCGCAACC ATGCCCGTGG CAGGGAACTG GCGCGCATGG CCGGCTGGCT GATGGCCGCC GAGCTGCGCG CCGTGGGCGT GGATTTCAGC TTTGCCCCGG TGCTGGACCT GGCCCACGGG GTCAGCGGCG TGATCGGCGA CCGGGCCTTC CACCGGAATC CCGAGGTGGT GGCAGACCTG GCGCATCACT ATATGAGCGG CATGCAGCAC GCGGGCATGG CCGCCGTGGG CAAGCACTTT CCAGGGCATG GCGGGGTGCG CGAGGACTCC CATCTCGCCC TGCCGGTGGA CCGGCGCACA CCGGCGGATC TCTACACGGA CATCCTGCCG TTTGAACGCA TGGTGCGTTT CGGCCTGGCG GGGATCATGC CCGCCCACGT GGTCTACGAA CGCTGCGACC CCCTGCCTGC GGGCTTTTCC AGCTACTGGC TGCGCGGCGA ACTGCGCGAC CGGCTGGGCT TTGAAGGCGT GATCTTCAGC GACGACCTGA GCATGGCCGG CGCCGAGTGC ATGGGAGATT ACCCGGACCG GGCCCGCGCG GCACTCAAGG CCGGCTGCGA CATGGTGCTG GTCTGCAACC ATCCGGAACA GGCTGCCCGG GTGCTGGACG CCCTGGACGA CGAACCAGAC CCGGTCTCCA TCGCCCGGCT GGCCCGCATG CACGGACGCA AGGGCATGAC CTGGGGGGAG CTGACGGACT CCGATGAGTG GCAGAAGGCC AGGGCGGTGA TCAACGCCCT GGACGATTCG CCCCTGATGG AACTGGATGT CTGA
|
Protein sequence | MSLGPVMLDL EGTALTETER RLLTHPRAGG VILFTRNFES LGQLTELLRE IHALRTPRLL VAVDHEGGRV QRFREGFTRL PAAARFGEQY DRNHARGREL ARMAGWLMAA ELRAVGVDFS FAPVLDLAHG VSGVIGDRAF HRNPEVVADL AHHYMSGMQH AGMAAVGKHF PGHGGVREDS HLALPVDRRT PADLYTDILP FERMVRFGLA GIMPAHVVYE RCDPLPAGFS SYWLRGELRD RLGFEGVIFS DDLSMAGAEC MGDYPDRARA ALKAGCDMVL VCNHPEQAAR VLDALDDEPD PVSIARLARM HGRKGMTWGE LTDSDEWQKA RAVINALDDS PLMELDV
|
| |