Gene Cpin_5867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5867 
Symbol 
ID8362047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7458330 
End bp7459523 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content49% 
IMG OID644968005 
ProductAlpha-N-acetylgalactosaminidase 
Protein accessionYP_003125486 
Protein GI256424833 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.326374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.716064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACAA CCAGTAGAAG GCATTTTTTG AAGAATGCCG CCATTTTATC CCCTGTACTT 
ACCTTTCTGC CATCATATCT GACGGCAGGG ACAGGTTCCC GCTTACGTAC CGGTTTTATT
GGTCTTGGTC CCTGGGGACA ACAATATCTT GCGCATGCCC TGCAGCATAA AGATGTTGAC
GTCAGGGCTA TCTGTGACAC AGATCCGGCA GCTATCCGTC AAAGCCTGCA CTTGTTCAGC
GAAGCAGGTT ATAGCCGTCC GGAGATCTTT AAAAACAGCT ACAGCGACTT ATTATCCCGA
AAAGACATTG ATGCTGTGAT CATAGCGGCC CCCTGGCAAC TGCACTACGA AATAGCCAAG
GCAGCGATGC TGGCAGGCAA ACACGTAGCT TGTGGCGCCA TTATGGGTAC GACACTGGAA
GAGCACAAGG ACATTGTCCG CATCAGTGAG CAGACCGGCT GTCAGTACTT CACTTTGGAA
GAAGACAGCT ATCGTTCTGA TCTGCAGGCT GCGGCCAATA TGGCGAAAGC TGGTCTTTTC
GGTGAATTAC AGGCTGTTCG CGCAGGCGCA CGTTACGATG TACTGCCAGC TGTCCATGAA
GGCGAAGCTG CTCCTTATCC TGTTTATCCC GCACTGGCAG CCACCAGGAT GCTGGGCGTG
GGAAAAGACA ATCCTTTTGT GTCATTGGAG ATAGTAAAGC AAACGCAGAC GTTTGCCGTT
AACAAGCCAC ATGCAAAGAC AGGAGAAGCG AGGCTGATGT ACCAGTCAGG TGAGGTCAGT
ACCATCCGTT TAACGACTAA ACAGGGACAG GTTTTATCCT TACAGATGGA CCGCGGTCAG
CGGCAGCCAG TTTCAACGGG ATTCAGGATA ACGGGAACGG CAGGTTCCTG GGTAGATACA
TTCAATAGTA TTTATTTAAA AAATCAGCTA ACTTGCAACC AACTTTGGAA TGCAGGGAAA
CCCTATATCA AGGAGTACGC ACAGCCAACT GATACACGAC GTTACAAACG TCTGACAGCC
AATAAGGAAT GTGCCCTGGC CCTACAGGAG TTTATCGACC TGGCAAATCA ACCTGCCGAA
GCGTTATCCG TCTATACCGC TGCTACCAAT AGTATGATCG GCCCTTTAGC CGCACTTTCT
GCGCAACATA ACCACACGAT GAACTTTCCC GAGTTCAGCA ATTCAACTAT TTAA
 
Protein sequence
MHTTSRRHFL KNAAILSPVL TFLPSYLTAG TGSRLRTGFI GLGPWGQQYL AHALQHKDVD 
VRAICDTDPA AIRQSLHLFS EAGYSRPEIF KNSYSDLLSR KDIDAVIIAA PWQLHYEIAK
AAMLAGKHVA CGAIMGTTLE EHKDIVRISE QTGCQYFTLE EDSYRSDLQA AANMAKAGLF
GELQAVRAGA RYDVLPAVHE GEAAPYPVYP ALAATRMLGV GKDNPFVSLE IVKQTQTFAV
NKPHAKTGEA RLMYQSGEVS TIRLTTKQGQ VLSLQMDRGQ RQPVSTGFRI TGTAGSWVDT
FNSIYLKNQL TCNQLWNAGK PYIKEYAQPT DTRRYKRLTA NKECALALQE FIDLANQPAE
ALSVYTAATN SMIGPLAALS AQHNHTMNFP EFSNSTI