Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4994 |
Symbol | |
ID | 8361170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 6243620 |
End bp | 6245917 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644967143 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003124628 |
Protein GI | 256423975 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00424832 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.110436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCGG TCTTATTCAG CGTATTGCTG AGTATATCTG TAGCCCGTGC GCAGGAGGCG GACCCTGCAC GTTATGCAAT TGTCCCTTAC CCTCAGCAAC TGGTGCCCGC TGCAGGCGAC TTTGTGATTA CTTCAAAGAC CAGACTGGTA TTACCTTCCG ACAAAACATT TTTCAGCAAG GAAGCCGCTC AGTTACAGGC ACTGATCAGA CAGGGGCTCG GACAGGCGTT ACCTGTTGCC GCAAAAGCAG GCAATGGTAC CATCGTGCTG AAACAGGACA CTCAGCTGGC AGGAGAAGAA GATTATACAT TAGCGGTAAC GCCACAGCAG GTGACCATCG CCGCGAAAAC ACCGACAGGT ATGTTCCGTG CGGTGCAGAC GCTACGTCAG CTGATGCCTG TCGCTATTGA AGGGGTGAAA AACGCAAAGC TGGCGAAGAT GGCGATACCT GCAGTAAAGA TCACAGATCA TCCTGTATAC GGATGGCGTG GTATGCACCT GGATGTGTCC AGACATTTTT TCTCTGTAGA TTATCTGAAA AAGTTCATCG ACCTGCTCGC CCTGTACAAA ATGAACAAGT TTCATATTCA TCTTACGGAT GACCAGGGCT GGCGTATCGA GATTAAAAAA TATCCTTTAC TGACAGAACA GGGCGCCTGG AGAGAATTTA ATAACCAGGA CTCTGTATGT ATGGAAAAGG CGAAGACCAA TCCGGATATG GCGATTGATC AATCCCATAT CATCCGGAAA AATGGCAAAA CGTTGTATGG CGGTTTCTAT ACCCAGGCAC AGATGAAAGA GGTGATCGCC TATGCAGCGG CACGTCACAT CGATATGATC CCTGAGATCG ATATGCCGGG TCATATGATG GCGGCTATCA AAGCATACCC TTATCTGAGC TGCGAAGGCG GTTCTACCTG GGGGCCATTA TTTACCACAC CGATCTGCCC ATGTAATGAG AAGACTTTTG AATTTGCAGA GAATGTTTAT TCAGAGATCT TCGCTTTATT CCCTTCTGAA TACGTTCACC TGGGCGCTGA TGAAGTAGAA AAAAGCAGTT GGGCAAAATC TCCGCTTTGT GAAGGTGTGA TGAAGGCTAA TAACCTGAAG ACTGTTGAAG AACTGCAAAG CTATTTTGTC AAGAGAATGG AGAAATTTTT CAATTCAAAA GGCAAAAAGC TGATCGGCTG GGATGAGATC CTGGAAGGTG GTATCAGTGA AACCGCTATA CTTATGTACT GGCGTAGCTG GGTACCTGAT GCGCCTGTAA AGGCAGCGAA AAACGGGAAC AAGGTGATTA TGACACCAGG TAATCCATTG TATTTCGATG GTATTCCTGA CCGCAATTCC ATCTCTAATG TATATCACTT CGACCCGGTA CCAAAAGGAC TGACAGCAGA TGAAGCAGCG CATATCATCG GTGCACAGGC AAATACCTGG ACGGAATGGA TTCCTTCTGA AAAACGTGCC GACTTTATGA TCTTCCCTCG TATGACTGCA CTGGCAGAAG TACTCTGGAC ACATCGTCAG GATTACGATA ACTATCTGTT GCGTCTGCAT CAACAATACA AGCGCCTGGA TCAGCTGAAG GTGCATTACC GTATGCCTGA CCTGGATGGT TTTACGGATG ATAACGTACT GGTAGGTAAA ACGGTATTGA AATTACAAAA GCCGGCTCCG GATATCAGTA TCCGTTATAC GACGGATGGA ACCGCGCCGA CGATGTCTTC ACCTGAATTA CCGGAAGCGT TTATCGTACC GGGGCCGGGT ACCATTAAAC TGGCTTCTTT CAGTCCGTCC GGCAGCAGCA GCGATATTTA TACGCTGAAC TACCGTAAGC AGTCTTTCTT CCCTCCGGTG AATGTGGAGG GTTTACAGAG CGGCTTACAG GTACAGTATT TTGGTAGCTC CTATAAAGGC GTCACCAAAC TGCCTGATAC TGCAGACAGC ATCGTTCATG TCAGCAATGC GATCATTCCG GAAGGAATGG GTAAAGGCGG TAAAGCTTTT GGTGCGAAAC TGGTAGGGTA TATTGAAGTG CCTGAAACAG CGATTTACAG CTTCTTCCTG ACAGCAGACG ATGGTGCAAA CCTCTATATC GAAGGAGATA AAGTAGTAGA TAATGACGGC TGGCACGCGC CTGTACAGAA GAGCGGACAG GTAGCCTTGC AGAAAGGGTT GCATCCGTTT GAACTGAAGT TTGTAGAAGG TGGCGGCGGA TACACGTTGA AACTCGAATA CAGGGTGAAT GGTGGTAAAA TCCAGGCAGT GCCGGATAGC TGGTTCAAGA GAAAATAA
|
Protein sequence | MGAVLFSVLL SISVARAQEA DPARYAIVPY PQQLVPAAGD FVITSKTRLV LPSDKTFFSK EAAQLQALIR QGLGQALPVA AKAGNGTIVL KQDTQLAGEE DYTLAVTPQQ VTIAAKTPTG MFRAVQTLRQ LMPVAIEGVK NAKLAKMAIP AVKITDHPVY GWRGMHLDVS RHFFSVDYLK KFIDLLALYK MNKFHIHLTD DQGWRIEIKK YPLLTEQGAW REFNNQDSVC MEKAKTNPDM AIDQSHIIRK NGKTLYGGFY TQAQMKEVIA YAAARHIDMI PEIDMPGHMM AAIKAYPYLS CEGGSTWGPL FTTPICPCNE KTFEFAENVY SEIFALFPSE YVHLGADEVE KSSWAKSPLC EGVMKANNLK TVEELQSYFV KRMEKFFNSK GKKLIGWDEI LEGGISETAI LMYWRSWVPD APVKAAKNGN KVIMTPGNPL YFDGIPDRNS ISNVYHFDPV PKGLTADEAA HIIGAQANTW TEWIPSEKRA DFMIFPRMTA LAEVLWTHRQ DYDNYLLRLH QQYKRLDQLK VHYRMPDLDG FTDDNVLVGK TVLKLQKPAP DISIRYTTDG TAPTMSSPEL PEAFIVPGPG TIKLASFSPS GSSSDIYTLN YRKQSFFPPV NVEGLQSGLQ VQYFGSSYKG VTKLPDTADS IVHVSNAIIP EGMGKGGKAF GAKLVGYIEV PETAIYSFFL TADDGANLYI EGDKVVDNDG WHAPVQKSGQ VALQKGLHPF ELKFVEGGGG YTLKLEYRVN GGKIQAVPDS WFKRK
|
| |