Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1261 |
Symbol | |
ID | 4710726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1368955 |
End bp | 1369995 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639855734 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001002838 |
Protein GI | 121998051 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGGCC CGCTCATGAT TGGAATCGCG GGGGGTGAAC TCCCAGCGCA GGCGCGAGAG CTCCTGCGGC ATCCGGCCGT CGGTGGGGTC GTTCTCTTCA CGCGTAACTT TAAAAATACG CAGCAGCTTC ATGATCTGAC ACAGTCCATT CACGCCGTGC GTGAGCCGCC CCTGCTGGTC GCTGTCGATC AGGAGGGCGG CCGCGTGCAG CGCTTTCGGG AGGGCTTCAC CCCGCTGCCC GCCGCTTCCT GGTTCGGTCG ATTGTACGAT CTCGATCCCG ATCAGGCCCG GCAGAGTGCG CACCGGGCGG GGTGGGTGAT GGCCGCGGAA CTGCGCGCCT GCGGGGTGGA TCTGAGTTTC GCGCCGGTTT TGGATCTCGA TGGGGGGGTG AGCCACGTGA TCGGTTCTCG CGCCCTGCAC CGTGAGCCGG CCGCGGTGGC GCGGCTCGGT CACGCCTGGA TGCGCGGCAT GCGTCAGGCG GGGATGGCGG CGGTAGGCAA GCACTTCCCG GGGCACGGTT CGGTCGAGGC CGACAGCCAC GTGGCCTTGC CCTGCGATGA CCGTTCACTG GCAGAGATCG CGCGTCGGGA TCTGGTACCC TTTCAGCGCT TGGCGGCCTC GGGACTCCCA GGGGTCATGG CTGCGCACCT GGTCGTCCCC GATGTCGACG ACCGGCCGGC CGGTTTCTCC CCACGCTGGA TCGGTGACAT CCTGCGCCGC CGGGTCGGCT TCCTCGGTGC GGTCTTCAGT GATGACCTCG GCATGCGCGG GGCGGAGACC GCCGGTACCA TGTTCGACCG GGTGGATGCG TGCCTGGGGG CGGGCTGCGA TGTGGCGCTG GTCTGCGATC CGGCCGAGGC CGAGGCGCTG CTCGGCGAGG CGTCCGGGGA TCGCTGGCTG GATCCCACGT CGGCGCTGCG TCTGGTACGC ATGCACGGCC GACCGGCGCC CGGGTGGCCC GAGTTACGGG CGGACGGCGC CTACCAGACA GCGGTCTGCG AATTGACCGA GGGCGTTCCC GGGTTCGGGG CGCCGGGGTG A
|
Protein sequence | MLGPLMIGIA GGELPAQARE LLRHPAVGGV VLFTRNFKNT QQLHDLTQSI HAVREPPLLV AVDQEGGRVQ RFREGFTPLP AASWFGRLYD LDPDQARQSA HRAGWVMAAE LRACGVDLSF APVLDLDGGV SHVIGSRALH REPAAVARLG HAWMRGMRQA GMAAVGKHFP GHGSVEADSH VALPCDDRSL AEIARRDLVP FQRLAASGLP GVMAAHLVVP DVDDRPAGFS PRWIGDILRR RVGFLGAVFS DDLGMRGAET AGTMFDRVDA CLGAGCDVAL VCDPAEAEAL LGEASGDRWL DPTSALRLVR MHGRPAPGWP ELRADGAYQT AVCELTEGVP GFGAPG
|
| |