Gene Cpin_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4041 
Symbol 
ID8360214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5027338 
End bp5028969 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content47% 
IMG OID644966214 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003123703 
Protein GI256423050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.457031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.172562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAGA GACTATTCTG CATACCGTTT ATCGTATTAG TGGCTGCCTG CGGACGTCCC 
GCTTCTTCCG GCAAACAGGT GTTCCGGTAT AATGTGCCGG AGGGAATTTC CTCCCTGGAT
CCGGCATTTG CCAAAAACCA GGCAATCATC TGGCCGGTCC GACAGCTGTA TAATACCCTG
GTTGAACCGG ATGAACAACT GAATATCCGT CCTTCGCTGG CAAAACGCTG GGATGTCTCT
GCGGATCATA AAACCTTTGT CTTCCATCTA CGTACAGACG TTCATTTCCA TGATAATGAG
ATCTTCCCGG AGGGGAAAGG GCGTTTGATG ACAGCTGCAG ACGTGGTGTA CAGTCTGCGG
CGTATTATGG ATCCCGCCAC CGCATCTCCC GGCGCCTGGA TCTTCAATGG TAAGGTAGAC
CTGGTTAAAG GCTTTCAGGC GGTCAATGAT TCCACCTTTC AGCTGAACTT ATTACAACCT
TTCCATCCTA TACTGGGTAT TCTGAGTATG CAGTACTGCT CCGTTATCCC ACATGAGGCA
GTCGAAAAGT ATGGGAAGGA CTTCCGTAAA CATCCCTGCG GAACGGGTCC TTTCAGTTTC
TCTTTCTGGG AAGAAGGACA AGCCCTCGTA CTGCATCGTA ATCCTCATTA TTTTGAGAGA
GACAGTGCCG GACATGCATT GCCCTACCTG GATGCGGTGA AAGTCAGTTT CCTGGACAGC
AAGGCGACGG AATTCCTATT ATTCCGTCAG GGGCAGCTGG ACTTTATGAA TGATATAGAT
GCGTCGTTTA AAGATGAAGT GCTGACGAAA AAGGGAAAAC TGAAGAAGGA ATGGGAGGGG
AAACTGATCC TTGACAAGAG CCCTTATCTG AATATTGAAT ACTTCGGGTT TTTGCTGGAT
ACCAGCAAAG TAAACGTAAA ACATTCACCC TCAGCAATAA AGAAAATCAG ACAAGCCATC
AACTACAGTA TCGACAGAAT GCGGATGATC ACATATCTGC GTAACGGTAT TGGCTACCCG
GCTACATCCG GATTTGTACC TATGGGATTA CCTTCCTTCG ATACGACCAA GGTAAAAGGC
TTCCGTTATG ATCCTGAAAG GGCGAGATCC TTGCTGAAGG AAGCCGGATT CCCGGAGGGG
AAAGGACTGG CTTCGATCAG GTTACTCTCT ATTCCTGTTT ATGAGGATTA CGCCAATTAT
GTAGCTAATC AGTTGCAGCA GGTCGGTATC CCTGTACAGG TGGAAGTGAT GCAGAAAGCA
CTCCTTCTGG AACAGACCGC AAAGTCGGAA GCGCTGTTTT TCAGAGGTAG CTGGATGGCT
GATTATGCAG ATGCCGAAAA CTACCTCGCT GTGTTTTACA GTAAAAATCC GGCTCCCCCA
AACTATACCC GTTATGTCAA TCCGGCATTC GATAAGCTCT ATGAAAAGTC GCTGAGCGAA
AACAACGATT CGCTTCGTTC GCTCCTTTAT CAGGAGATGG ACCGGATGAT TATTGACGAT
GCTCCTGTCG TTCCATTATT CTATGACGAA GTCATACATC TGGTACAACC CAACGTTGAA
GGGTTTACAA GTAATGCCCT GAATTTGCTT GAACTCCGGA AAGTAAAGAT CCGGCTTTCT
CAACATCCCT GA
 
Protein sequence
MYKRLFCIPF IVLVAACGRP ASSGKQVFRY NVPEGISSLD PAFAKNQAII WPVRQLYNTL 
VEPDEQLNIR PSLAKRWDVS ADHKTFVFHL RTDVHFHDNE IFPEGKGRLM TAADVVYSLR
RIMDPATASP GAWIFNGKVD LVKGFQAVND STFQLNLLQP FHPILGILSM QYCSVIPHEA
VEKYGKDFRK HPCGTGPFSF SFWEEGQALV LHRNPHYFER DSAGHALPYL DAVKVSFLDS
KATEFLLFRQ GQLDFMNDID ASFKDEVLTK KGKLKKEWEG KLILDKSPYL NIEYFGFLLD
TSKVNVKHSP SAIKKIRQAI NYSIDRMRMI TYLRNGIGYP ATSGFVPMGL PSFDTTKVKG
FRYDPERARS LLKEAGFPEG KGLASIRLLS IPVYEDYANY VANQLQQVGI PVQVEVMQKA
LLLEQTAKSE ALFFRGSWMA DYADAENYLA VFYSKNPAPP NYTRYVNPAF DKLYEKSLSE
NNDSLRSLLY QEMDRMIIDD APVVPLFYDE VIHLVQPNVE GFTSNALNLL ELRKVKIRLS
QHP