Gene Cpin_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3372 
Symbol 
ID8359538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4134654 
End bp4135664 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content49% 
IMG OID644965545 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003123040 
Protein GI256422387 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.115677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.185322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAA AAATACTGGC CATAGAGTCA TCGTGCGACG AGACAAGTGC TTCCGTACTG 
GCCGATGGAA AGATCCTGTC TAATTTTATT GCCAATCAAA CTATTCATGA GCAATACGGT
GGCGTAGTGC CTGAACTGGC CTCCCGTGCT CACCAGGAAA ATATTGTCCC TGTCGTGGAC
CAGGCCCTGA AAGTGGCGGG TGTACGTAAG GAGGAACTGA ACGCCATTGC CTTTACCCAG
GCGCCGGGTC TGATCGGTTC CCTGCTGGTA GGTAGTTGTT TTGCTAAATC CATGGCGCTG
GCCCTGGACG TTCCGCTGAT AGCTGTACAC CACATGCAGG CGCACGTACT GGCTAATTTC
ATTGGTGAGG ATAAGCCTTC TTTCCCTTTC CTCTGTCTGA CAGTGTCTGG TGGTCATACC
CAGATCGTAC GTTGTGACAG TCCTTTGCAG ATGAAGGTAA TCGGTGAAAC ATTAGACGAT
GCCGCTGGTG AAGCGTTTGA TAAAAGTGCC AAATTACTGG GCCTGCCATA TCCTGGTGGT
CCGCTGATAG ATAAATACGC CCGTGAGGGC AATCCGGACA GGTTCAAATT CCCTGAACCA
CAGATCCCGG GACTGAACTT CAGCTTTAGC GGTCTGAAGA CCTCCATTCT CTACTTCCTC
CAGGAACAGC AACAGAAAGA TCCGCAGTTC GCCGAAAATA ACATGGCGGA TATCTGTGCT
TCTATCCAGC ATCGTATCGT CAGTATCCTG ATGAACAAAC TGGTAAAAGC ATCAAAGGAA
ACGGGTATCA AAGAGATCGG TATAGCAGGT GGCGTGAGCG CTAATTCCGG TTTACGTAAC
GCTTTACAAC AGTATGGTGA AAAGTATGGC TGGAAAACCT ACATACCGAA ATTTGAATAC
TGTACAGATA ATGCTGCGAT GATTGCCATG ACCGCCTGGT ATAAATATCA GGCAGGGGAG
TTTGTAGGAC TGGATGCCGT TCCAGGCGCC AGAGCAGGTT TTGAACATTA G
 
Protein sequence
MSVKILAIES SCDETSASVL ADGKILSNFI ANQTIHEQYG GVVPELASRA HQENIVPVVD 
QALKVAGVRK EELNAIAFTQ APGLIGSLLV GSCFAKSMAL ALDVPLIAVH HMQAHVLANF
IGEDKPSFPF LCLTVSGGHT QIVRCDSPLQ MKVIGETLDD AAGEAFDKSA KLLGLPYPGG
PLIDKYAREG NPDRFKFPEP QIPGLNFSFS GLKTSILYFL QEQQQKDPQF AENNMADICA
SIQHRIVSIL MNKLVKASKE TGIKEIGIAG GVSANSGLRN ALQQYGEKYG WKTYIPKFEY
CTDNAAMIAM TAWYKYQAGE FVGLDAVPGA RAGFEH