Gene Cpin_5159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5159 
Symbol 
ID8361336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6439084 
End bp6441084 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content44% 
IMG OID644967308 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003124792 
Protein GI256424139 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.017012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000228688 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACAATCA GTAGTGCTCA GATATCCCAC CAGGTGGGAA CAACCAGGCC GGGTAATCCG 
GCCCTGGTTT CTTCTGAGAG CGCATTACAG GAATTGATGG ATGCGTCAGC GGGTAACTAC
AAATATAGTG TAGAAGACTA TTTTGCACAT CCGCAGTCCG CTGATTTTCA GTTATCGCCA
AATGGCAGCT ATATTTCCTT CAGAGAACGT GACGAACATG GTAAAAGCAA TGTGATGGTC
AGAGAAGTGA GTACGGGTAA AACTACCTGT GCGCTGAAAG AAACAGAGAA CATTATCATC
GGATACGGCT GGGCATCAGA TAGCCGGCTG TTATATATGA TGGACGATGG CGGTAATGAG
AACTATCACT TATATGCCGT GAATACAGAT GGTTCAGGTA ATATCGATCT GACACCCTAC
GAAGGCGTAC GGGCCACTAT ACTGAAACGG CTGCCGGAGC ATAGAGAATT TATTATCGTC
TCTATGAATA GAGACAATCC GCAGAATTTC GAACCTTACA AAATCAATGT TAATACTGGA
GCAATGGTCA GATTGTATGA GAATAGTGAT CTGACCAATC CTGTGAATCT ATATGATTTT
GATAAGGACG GCAATCTTCG GGCATTCTCC AGAATGAATA ATAGAAAGGA GATGCAGTAC
TTCTATAAAT CGAAAGATGC AAAGGAATAT TCGCTGATAA AGACTATTCC CTGGTATAAT
AAATTCACCA TTTTATCTTT TAATTATGCA TCGGCTAATC CGGATGAAGC ATATGTACTG
ACGAATCTTG AGGCAGATAA AGCACGTATT GTACTGTATG ACCTGAAAGC AGGAAAGATC
ATCAGGGAGA TCTATTCAAA CAAAGACTAT GATGTATCCA ATCTTTCCTT GTCGCGAAAG
AGGAACTGGG AAATTGATTA TATAGATTAT GAAGGAGAAA AGCATATCAT TAAACCGGTC
AGCAAACATT TCAGCAGTAT CTATAAGCAA TTGAAGAAGC AGTTTAAAGG CTACCAGTTC
CAGATTGCTG CACAAACCGA GAACGAAGAG CAATACCTGG TGAAAGTAAG CAGTGATCGT
TTATACGGCA GGTTTTACCA CTACGACAGG AAGACGGGTA AAACAGCCTT GTTGTGTGAT
CTGATGCCAC AACTAAGGGA GGCGGATATG GCCGTTATGC GCCCGATCAC CTTTAAATCG
CTGGATGGAT TAACGATACA TGGATATATC ACTTTACCTG CCAACGCCTC AAAAAGAAAG
AAGGTACCGC TGATTGTCGA TCCGCATGGT GGTCCTCATG GTATCCGTGA TACCTGGGGT
TTTAATCCGG AAGCCCAGCT GTTTGCCAGC CGGGGCTATG CCACTTTGCA TATTAACTTC
CGTATCTCTG ATGGATATGG CCTGGACTTT TTCCGTGCAG GTTTCAAACA AACCGGTCGT
AAGATCATGG ATGACCTGGA AGACGGTGTG CATTATGTCA TTGACCAGGG TTGGGCCGAT
CGCGAAAATA TAGGTATTTA TGGCGGCAGT CATGGGGGAT ATGCCACGCT CATGGGACTG
ATCAAAACGC CTTACCTGTA TAAAGCAGGC GTGGATTATG TAGGTATCTC TAACATATTT
ACTTTTTTTG ATGCCTTACC GCCGTACTGG AAGCCCTTAA AAAATATGCT GAAAGACATC
TGGTATGACC TGGATGATCC GGAGGAAGCT AAGATCGCGA AGGAAGTTTC TCCGATTTAT
CATACGTATA AGATCAATGC GGCCTTATTT GTTGTACAGG GGGCCAATGA TCCCAGGGTA
AATATCCTGG AGTCGGACCG TATCGTTGCC GCTGTACGTA AAAAGGGCGT AGAGGTGCCT
TATATGGTGA AGTATGATGA AGGGCATGGT TTTCAGAAAG AAGCAAATCA GCTTGCTTTT
TATAAGGCAA TGCTGGGCTT TTTCAGCCTG CACTTCAACA AGCCGGCTCC GATTATTTTG
GATGATTGGG ATGTTTATTA A
 
Protein sequence
MTISSAQISH QVGTTRPGNP ALVSSESALQ ELMDASAGNY KYSVEDYFAH PQSADFQLSP 
NGSYISFRER DEHGKSNVMV REVSTGKTTC ALKETENIII GYGWASDSRL LYMMDDGGNE
NYHLYAVNTD GSGNIDLTPY EGVRATILKR LPEHREFIIV SMNRDNPQNF EPYKINVNTG
AMVRLYENSD LTNPVNLYDF DKDGNLRAFS RMNNRKEMQY FYKSKDAKEY SLIKTIPWYN
KFTILSFNYA SANPDEAYVL TNLEADKARI VLYDLKAGKI IREIYSNKDY DVSNLSLSRK
RNWEIDYIDY EGEKHIIKPV SKHFSSIYKQ LKKQFKGYQF QIAAQTENEE QYLVKVSSDR
LYGRFYHYDR KTGKTALLCD LMPQLREADM AVMRPITFKS LDGLTIHGYI TLPANASKRK
KVPLIVDPHG GPHGIRDTWG FNPEAQLFAS RGYATLHINF RISDGYGLDF FRAGFKQTGR
KIMDDLEDGV HYVIDQGWAD RENIGIYGGS HGGYATLMGL IKTPYLYKAG VDYVGISNIF
TFFDALPPYW KPLKNMLKDI WYDLDDPEEA KIAKEVSPIY HTYKINAALF VVQGANDPRV
NILESDRIVA AVRKKGVEVP YMVKYDEGHG FQKEANQLAF YKAMLGFFSL HFNKPAPIIL
DDWDVY