Gene Cpin_4994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4994 
Symbol 
ID8361170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6243620 
End bp6245917 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content48% 
IMG OID644967143 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003124628 
Protein GI256423975 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00424832 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.110436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCGG TCTTATTCAG CGTATTGCTG AGTATATCTG TAGCCCGTGC GCAGGAGGCG 
GACCCTGCAC GTTATGCAAT TGTCCCTTAC CCTCAGCAAC TGGTGCCCGC TGCAGGCGAC
TTTGTGATTA CTTCAAAGAC CAGACTGGTA TTACCTTCCG ACAAAACATT TTTCAGCAAG
GAAGCCGCTC AGTTACAGGC ACTGATCAGA CAGGGGCTCG GACAGGCGTT ACCTGTTGCC
GCAAAAGCAG GCAATGGTAC CATCGTGCTG AAACAGGACA CTCAGCTGGC AGGAGAAGAA
GATTATACAT TAGCGGTAAC GCCACAGCAG GTGACCATCG CCGCGAAAAC ACCGACAGGT
ATGTTCCGTG CGGTGCAGAC GCTACGTCAG CTGATGCCTG TCGCTATTGA AGGGGTGAAA
AACGCAAAGC TGGCGAAGAT GGCGATACCT GCAGTAAAGA TCACAGATCA TCCTGTATAC
GGATGGCGTG GTATGCACCT GGATGTGTCC AGACATTTTT TCTCTGTAGA TTATCTGAAA
AAGTTCATCG ACCTGCTCGC CCTGTACAAA ATGAACAAGT TTCATATTCA TCTTACGGAT
GACCAGGGCT GGCGTATCGA GATTAAAAAA TATCCTTTAC TGACAGAACA GGGCGCCTGG
AGAGAATTTA ATAACCAGGA CTCTGTATGT ATGGAAAAGG CGAAGACCAA TCCGGATATG
GCGATTGATC AATCCCATAT CATCCGGAAA AATGGCAAAA CGTTGTATGG CGGTTTCTAT
ACCCAGGCAC AGATGAAAGA GGTGATCGCC TATGCAGCGG CACGTCACAT CGATATGATC
CCTGAGATCG ATATGCCGGG TCATATGATG GCGGCTATCA AAGCATACCC TTATCTGAGC
TGCGAAGGCG GTTCTACCTG GGGGCCATTA TTTACCACAC CGATCTGCCC ATGTAATGAG
AAGACTTTTG AATTTGCAGA GAATGTTTAT TCAGAGATCT TCGCTTTATT CCCTTCTGAA
TACGTTCACC TGGGCGCTGA TGAAGTAGAA AAAAGCAGTT GGGCAAAATC TCCGCTTTGT
GAAGGTGTGA TGAAGGCTAA TAACCTGAAG ACTGTTGAAG AACTGCAAAG CTATTTTGTC
AAGAGAATGG AGAAATTTTT CAATTCAAAA GGCAAAAAGC TGATCGGCTG GGATGAGATC
CTGGAAGGTG GTATCAGTGA AACCGCTATA CTTATGTACT GGCGTAGCTG GGTACCTGAT
GCGCCTGTAA AGGCAGCGAA AAACGGGAAC AAGGTGATTA TGACACCAGG TAATCCATTG
TATTTCGATG GTATTCCTGA CCGCAATTCC ATCTCTAATG TATATCACTT CGACCCGGTA
CCAAAAGGAC TGACAGCAGA TGAAGCAGCG CATATCATCG GTGCACAGGC AAATACCTGG
ACGGAATGGA TTCCTTCTGA AAAACGTGCC GACTTTATGA TCTTCCCTCG TATGACTGCA
CTGGCAGAAG TACTCTGGAC ACATCGTCAG GATTACGATA ACTATCTGTT GCGTCTGCAT
CAACAATACA AGCGCCTGGA TCAGCTGAAG GTGCATTACC GTATGCCTGA CCTGGATGGT
TTTACGGATG ATAACGTACT GGTAGGTAAA ACGGTATTGA AATTACAAAA GCCGGCTCCG
GATATCAGTA TCCGTTATAC GACGGATGGA ACCGCGCCGA CGATGTCTTC ACCTGAATTA
CCGGAAGCGT TTATCGTACC GGGGCCGGGT ACCATTAAAC TGGCTTCTTT CAGTCCGTCC
GGCAGCAGCA GCGATATTTA TACGCTGAAC TACCGTAAGC AGTCTTTCTT CCCTCCGGTG
AATGTGGAGG GTTTACAGAG CGGCTTACAG GTACAGTATT TTGGTAGCTC CTATAAAGGC
GTCACCAAAC TGCCTGATAC TGCAGACAGC ATCGTTCATG TCAGCAATGC GATCATTCCG
GAAGGAATGG GTAAAGGCGG TAAAGCTTTT GGTGCGAAAC TGGTAGGGTA TATTGAAGTG
CCTGAAACAG CGATTTACAG CTTCTTCCTG ACAGCAGACG ATGGTGCAAA CCTCTATATC
GAAGGAGATA AAGTAGTAGA TAATGACGGC TGGCACGCGC CTGTACAGAA GAGCGGACAG
GTAGCCTTGC AGAAAGGGTT GCATCCGTTT GAACTGAAGT TTGTAGAAGG TGGCGGCGGA
TACACGTTGA AACTCGAATA CAGGGTGAAT GGTGGTAAAA TCCAGGCAGT GCCGGATAGC
TGGTTCAAGA GAAAATAA
 
Protein sequence
MGAVLFSVLL SISVARAQEA DPARYAIVPY PQQLVPAAGD FVITSKTRLV LPSDKTFFSK 
EAAQLQALIR QGLGQALPVA AKAGNGTIVL KQDTQLAGEE DYTLAVTPQQ VTIAAKTPTG
MFRAVQTLRQ LMPVAIEGVK NAKLAKMAIP AVKITDHPVY GWRGMHLDVS RHFFSVDYLK
KFIDLLALYK MNKFHIHLTD DQGWRIEIKK YPLLTEQGAW REFNNQDSVC MEKAKTNPDM
AIDQSHIIRK NGKTLYGGFY TQAQMKEVIA YAAARHIDMI PEIDMPGHMM AAIKAYPYLS
CEGGSTWGPL FTTPICPCNE KTFEFAENVY SEIFALFPSE YVHLGADEVE KSSWAKSPLC
EGVMKANNLK TVEELQSYFV KRMEKFFNSK GKKLIGWDEI LEGGISETAI LMYWRSWVPD
APVKAAKNGN KVIMTPGNPL YFDGIPDRNS ISNVYHFDPV PKGLTADEAA HIIGAQANTW
TEWIPSEKRA DFMIFPRMTA LAEVLWTHRQ DYDNYLLRLH QQYKRLDQLK VHYRMPDLDG
FTDDNVLVGK TVLKLQKPAP DISIRYTTDG TAPTMSSPEL PEAFIVPGPG TIKLASFSPS
GSSSDIYTLN YRKQSFFPPV NVEGLQSGLQ VQYFGSSYKG VTKLPDTADS IVHVSNAIIP
EGMGKGGKAF GAKLVGYIEV PETAIYSFFL TADDGANLYI EGDKVVDNDG WHAPVQKSGQ
VALQKGLHPF ELKFVEGGGG YTLKLEYRVN GGKIQAVPDS WFKRK