Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2440 |
Symbol | |
ID | 6376135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2602647 |
End bp | 2603768 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642684918 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001960816 |
Protein GI | 189501346 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA CCATAGTTCT CGTTTTCTCT CTCCTCCTGC TCACCGTGAA CGCATGTGCT GCGGATCTGC CCGAAGAGGA TGAGCTCAGC AGGAAAATAG GCCAGATGAT TATGGTCGGA TTCAGGGGAA CCTCTCTTCA GGAAGCCCCT GGGCTCATGC AGGATATAAC AAAACGTCAC CTTGGAGGCG TGGTTCTGTT CGACTATGAC GTTCCATCGA AATCTACCGG CAGAAACATC ACCTCCCGTG AACAACTCCA AAAACTGACG ACCGGGTTAC AGCAGACATC GGCAGTTCCT CTTTTTATCG CTATCGACCA GGAAGGCGGC AGAGTTTCCA GATTAAAGAC AACCTGCGGC TTCCCGGCGA GCGTCACGGC TGCCGGGCTC GGCAAGCTGA ACAACACTGA CAGCACCTTT CAGTCTTCGC TTGCAACAGC GAAAACCCTG CATAACAGCG GCATCAATGT CAATTTCGCT CCCGTTGTCG ATCTCAACAG CAATCCTGAA AACCCCGTTA TCGGTTCTCT TGAAAGAAGT TTTTCAGCGG ACGCTGCAAT TGTGTACAAA CATGCACGTG CAACAGTAGA GGCGTTTCAC ACACAAAACA TTATTGCCGC TCTCAAACAC TTTCCCGGCC ATGGCAGCTC AACTACCGAC ACCCATAAGG ATTTTACCGA TATTACCGGA AGCTGGAAAA AAAACGAACT TGACCCCTAC AGGCGTCTGA TTGAAAACGG CTATACCGAT CTTGTCATGA CTGCTCATGT CTACAACGCC AATCTTGACA ACCGTTACCC TGCGACACTC TCAAAGCAGA TCATATCAGG CCTGTTGCGC GATTCCCTTG GATTTAACGG CCCTGTCATA AGCGATGACA TGCAGATGCA GGCGTTAGCC GCGCATTACG ACCTTCGAAC CGCCATTACC CTGGCCCTCG AAGCCGGCGT AGACATTCTT CTCTTTGCCA ATAACTCGGT CTATGATCCA GATATTGCCG AAAAAGCCGT ATCGATCATC CGTTCACTGG TCGAAGAGGG AACGCTGAAC CCGAATCGTA TCGACGCCTC CTACAAGCGG ATCATGAAAC TGAAAACGCA CTACCTGAAA ACATCGACAT GA
|
Protein sequence | MKQTIVLVFS LLLLTVNACA ADLPEEDELS RKIGQMIMVG FRGTSLQEAP GLMQDITKRH LGGVVLFDYD VPSKSTGRNI TSREQLQKLT TGLQQTSAVP LFIAIDQEGG RVSRLKTTCG FPASVTAAGL GKLNNTDSTF QSSLATAKTL HNSGINVNFA PVVDLNSNPE NPVIGSLERS FSADAAIVYK HARATVEAFH TQNIIAALKH FPGHGSSTTD THKDFTDITG SWKKNELDPY RRLIENGYTD LVMTAHVYNA NLDNRYPATL SKQIISGLLR DSLGFNGPVI SDDMQMQALA AHYDLRTAIT LALEAGVDIL LFANNSVYDP DIAEKAVSII RSLVEEGTLN PNRIDASYKR IMKLKTHYLK TST
|
| |