Gene Cpin_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4028 
Symbol 
ID8360201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5011644 
End bp5013842 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content48% 
IMG OID644966201 
Productpeptidase S9B dipeptidylpeptidase IV domain protein 
Protein accessionYP_003123690 
Protein GI256423037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.210313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.469025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTAC GTTATTTCAG CTGCCCTGGA AGCAGGGTAC TCACTTTTTT AGTCTGTATG 
GTCGCAGGTA GTGCCAGTGC ACAGCCATGG AAACATGTGA AATGGTCCGG CGACGGAAAG
ACTTTTTACC AGACGGAAGA CAATGGCGGT ATTGCCGCGT ATAGCGCCAA AGACGGTCAG
TCTACCCAGA AGATCGCTCC GGCCCTCCTG ACGCCCAAAG GGCAGTCAGT ACTTGAAATA
GAAGATTTCT CCTATACACC GGATGAAAAA AAGTGGCTGA TTTATACAAA AGCGCAGAAA
GTATGGCGCT ATAAAACCCG TGGCGACTAC TGGGTGCTGG ATATTGCCAG CGGTAAACTG
GTACAACTGG GTAAAGGATT ACCCGCCTCT TCCCTAATGT TTGCGAAATT TTCTCCCGAT
GGACAGAAAG TTGCTTACGT CAGCGAACAT AATCTCTATG TAGAAGAACT CGCTACTCAT
AAAATCAAGG CACTGACAAA AGATGGTACC CGTCGTCTTA TCAACGGTAC TTTTGACTGG
GCATATGAAG AGGAGTTTGA CTGTCGCGAT GGTTTCCGCT GGAGCCCGGA CAGCCGTGCA
ATCGCTTACT GGCAGATTGA TGCCCGTAAG ATCAGGGACT TCCTGATGAT CGATAATACC
GATTCCCTGT ATTCCTACAC CGTTCCGGTA GAATATCCTA AAGCAGGTGA AAGTCCGTCT
GCCTGCCGCG TAGGTGTGGT AGATATTACA ACCGCCAAAA CGATCTGGTT GCAGGTTCCC
GGTGATGCGC AGCAGCATTA TATTACCCGT ATGGAGTGGA ATCCTGCCCG TACCGGTCTG
ATATTGCAGC AGCTGAACAG AAAACAGAAC CAGAGTATTC TTTATACGGC GAATCCTACT
ACCGGTAAAA CTACAGAACT CTATAAAGAA AGCGATTCTG CCTGGATAGA CATCCGTTCC
CGCTGGAATG ATGAACTGGC AGGATGGGAC TGGACCAATG GCGGTAAATC CTTCATATGG
GTCAGTGAGA AAGATGGCTG GAGACACCTC TACAGCATCG ATATGAATGG TAAAGAAACC
CTGATCACAC CAGGTCAATA TGATATTATC AACCTGCTGC GTATCGACGA AGCGCATAAC
CTTGCTTATG TGCTGGCGTC TCCTGACAAT GCAACCCAGC AGTATCTGTA TAGAGTGTCC
CTGGATGGTA AAGGTCAGCC GGAAAGAGTT TCTCCGCTGG CCGAATCAGG TACGCATGAA
TATGAGATTT CTCCGACTGC GGACTATGCC CTGCACAGTT TCTCTAATCA CTACTATCAG
CCACATTCAG AACTGGTATA TCTGCCGGAG CATAAAGATG CGACCAGCAG CCGTATCATC
CGTGACTTAC AGACTTCCCG TTTTGCAATC CGCCAGGAAT TCTTCCAGGT GACTACTGCT
GATGGTGTGA CCATGGATGG CTGGATGGCC CGTCCGGCTA ATTTCGATTC TACGAAGAAA
TACCCGGTGG TATTCTATGT ATACGGCGAA CCGGCAGCTG CTACTGCTAA AGACGAATTT
GGCGCAGGAC GCAATTTCAT CTACAATGGC GATATGGCGG CGGACGGTTA TATCTATATT
TCTATGGATA ACCGTGGTAC ACCTTTACCG AAAGGCCGTG CCTGGCGTAA AGCGATCTAC
CGTAAGGTGG GGCAGGTGAA TATGCAGGAT CAGGCGATGG CTGCAACGGA GCTCTTTAAA
CGTCACGCTT ACCTGGACAC CTCCCGCGTA GCGGTATGGG GCTGGAGTGG TGGCGGTGGT
ATGACGCTAA ACCTGTTGTT CCGTTATCCG CAGATCTACA AAACAGGTAT TGCTGTTGCA
GCGGTAGGTA GTTTGTTTAC TTATGATAAT ATTTACCAGG AGCGTTATAT GGGATTACCA
CAGGAAAACC GCGAAGACTA TGTAAAAGGT TCTCCGGTGA CCTACACCAA AGGACTAGTG
GGTAATTTGC TCTATATACA CGGTACCGGT GATGATAACG TACATTTCCA GAATGCAGAG
TTGCTGCAGA ATGAACTGAT CCGTAATGGC AAGCAATTCC AGTTTATGTC TTATCCTAAC
CGTACGCATA GTATCAGTGA AGGAGCAGGT ACTTTCCAGC ACCTTTCTGC ATTATATACC
AATTACCTGA AAGAGCATTG TCCTCCGGGA GCGAGATAA
 
Protein sequence
MRLRYFSCPG SRVLTFLVCM VAGSASAQPW KHVKWSGDGK TFYQTEDNGG IAAYSAKDGQ 
STQKIAPALL TPKGQSVLEI EDFSYTPDEK KWLIYTKAQK VWRYKTRGDY WVLDIASGKL
VQLGKGLPAS SLMFAKFSPD GQKVAYVSEH NLYVEELATH KIKALTKDGT RRLINGTFDW
AYEEEFDCRD GFRWSPDSRA IAYWQIDARK IRDFLMIDNT DSLYSYTVPV EYPKAGESPS
ACRVGVVDIT TAKTIWLQVP GDAQQHYITR MEWNPARTGL ILQQLNRKQN QSILYTANPT
TGKTTELYKE SDSAWIDIRS RWNDELAGWD WTNGGKSFIW VSEKDGWRHL YSIDMNGKET
LITPGQYDII NLLRIDEAHN LAYVLASPDN ATQQYLYRVS LDGKGQPERV SPLAESGTHE
YEISPTADYA LHSFSNHYYQ PHSELVYLPE HKDATSSRII RDLQTSRFAI RQEFFQVTTA
DGVTMDGWMA RPANFDSTKK YPVVFYVYGE PAAATAKDEF GAGRNFIYNG DMAADGYIYI
SMDNRGTPLP KGRAWRKAIY RKVGQVNMQD QAMAATELFK RHAYLDTSRV AVWGWSGGGG
MTLNLLFRYP QIYKTGIAVA AVGSLFTYDN IYQERYMGLP QENREDYVKG SPVTYTKGLV
GNLLYIHGTG DDNVHFQNAE LLQNELIRNG KQFQFMSYPN RTHSISEGAG TFQHLSALYT
NYLKEHCPPG AR