Gene Cpin_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1934 
Symbol 
ID8358085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2360677 
End bp2361771 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content49% 
IMG OID644964122 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003121631 
Protein GI256420978 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.82848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAT TTTCATTGAT ACGTGATATT GAACGCTACC TGGAGGGCGA AATGAACGAG 
CAGGAAAAAA CTGCCTTCGA AGCGCTCCGC CAGTCCGACC CGGCGGTCAA TGAGCAGGTG
CTGGCACATC AGCAGCTCAT TACACAGCTG GCGTATTTTG GACGCAGGTC AGCGTTGCAG
GAAAAAATGG ATAAGATCCA TGCTGGTCTG GCTAATAAAC ATGTATCAAT TGCACCTTCG
TCAACGCCGG AACCAAAGAA AGTATTCTCC ATCAGCAGAA GATTGCTCCT CAACATGGCT
GCAGCAGCCG GTATTGCCCT GCTGACCTCT GTGTCTACCA TTGCATTTAT GCAAAGAGCG
AGCAGACAAA AGACTACTGC CGAATACGAG GACGTAAGAC GTGTACTGAA TCACATTCAG
CGCTCCCAGA ACGCCCTTAT TAATGATATT AACAGCTCCA AAAAAGCACC TGCCAACCCC
GGCACTTATG GTGGCACCGG GTTTGCCGTA TCTAATAACG GTTATGTAGT GACTAACTAC
CACGTCATTG CAGGGGCTGA CTCTATTTAT ATACAGAATA CCAAAGGAGA AGCCTTTAAA
GCAGCCAGCG TATTTGAGGA TATTACCGCC GATCTGGCCA TCCTCAAAAT CACAGACTCT
ACCTTTAAAG GTCAGCCACT GCCTTACTCT CTGAAACCAC AGCGCGCCAT GCGTGGTGAA
CAGGTCTTTA CCCTGGGTTA CCCAAGAGAT GAGATCGTTT ACGGAGAAGG TTACATCAGC
GCTCAGACCG GCTTCAACGG TGACAGCGCC GCCTACCAGG TGTCTATCCC GGTTAACCCC
GGTAACAGCG GTGCTCCGCT GATGGACAAC AAAGGTGACG TAGTAGGTAT CGTAACCGGC
AAACAGACGA CTGCTGACGG CATCGCCTTC GCGGTAAAAT CAGCACACCT GAAAAGACTG
CTGGAACAAA TGCCCAAAGA CAAACTGCCT AAAAAAGAAT GGAACCATAA AAACAATAAA
CTCGAAGGAC TGAGCCGCGT AGAGCAGGTG AAGAAACTGG AAGACTTTGT TTACATGGTG
AAAGTCTATA ATTGA
 
Protein sequence
MNEFSLIRDI ERYLEGEMNE QEKTAFEALR QSDPAVNEQV LAHQQLITQL AYFGRRSALQ 
EKMDKIHAGL ANKHVSIAPS STPEPKKVFS ISRRLLLNMA AAAGIALLTS VSTIAFMQRA
SRQKTTAEYE DVRRVLNHIQ RSQNALINDI NSSKKAPANP GTYGGTGFAV SNNGYVVTNY
HVIAGADSIY IQNTKGEAFK AASVFEDITA DLAILKITDS TFKGQPLPYS LKPQRAMRGE
QVFTLGYPRD EIVYGEGYIS AQTGFNGDSA AYQVSIPVNP GNSGAPLMDN KGDVVGIVTG
KQTTADGIAF AVKSAHLKRL LEQMPKDKLP KKEWNHKNNK LEGLSRVEQV KKLEDFVYMV
KVYN