Gene Cpin_5150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5150 
Symbol 
ID8361327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6429054 
End bp6430754 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content48% 
IMG OID644967299 
Productglycoside hydrolase family 39 
Protein accessionYP_003124783 
Protein GI256424130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0964315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00110099 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA TCTTACTGTT CAGCCTTTGT ATAGCCCTGG GTGATACTTT ATCCGCCCAA 
TCCACCCCTA CTACTTTAAC AGATAATATA CCTGCGTCCA TTGAGGTACG TCTGGGGCAG
ATAACTGGTC CGATGAAACC TGTATGGGCC TGGTTTGGTT ACGATGAACC GAATTATACC
TATATGAAGG ACGGGAAAAA GTTACTCAGT GAAATTGCGG CACTCAGTCC GGTACCGGTA
TATGTCAGGA CGCATAGTCT GCTGGTAAGC GGAGACGGTG TTGCCGCCCT CAAGTGGGGT
TCTACCAATG TATATACCGA AGATGCAAAC GGCCAGCCAG TTTACAACTG GACAATCATC
GACAGCATCT TCGACACCTA TATCAACCGC GGCATGAAAC CGCTGGCACA GATCGGCTTT
ATGCCGCAGG CACTCTCCAC CCATCCTGAG CCCTACCGGC ACTACTGGAA ACCCGGTGAT
CCATATACAG ATATTATTAC CGGCTGGGCA TATCCGCCTA AAGACTATGA CAAATGGCGG
GAGCTTGCCT ATCAGTGGGC TAAGCACTCC GTAGAGCGTT ACGGCCAAAA GGAAGTGGAA
AGCTGGTATT GGGAAGTCTG GAATGAGCCA AACGGTCACT ATTGGAAAGG TACGCGGGAA
GAGTTCTTCA AACTATACGA CTATGCCGCT GATGGCATTA AAAAGGCCCT GTCCACTGCC
AGGATCGGCG GTATCAATAT AGCTGGTACA AGTAGTAAAA CGGCTACTGA ATGGACCACA
CAATTTATTG AGCACTGTAT TTCCGGGACC AATTACGCTA CCGGTAAAAC CGGCGCTCCG
CTGGATGCCT TGTTATTTCA TGCCAAAGGA AATCCTAAAC TGGTCAATGG TATCGTTAGG
ATGAATATGT CGCCTCAACT GCGTGATATA GCAGCAGGTT TCCGTATTGC CGCATCCTAT
CCCCAGACGC GTAATCTGCC ATTGATCATC GGGGAATCAG ATCCTGAAGG TTGTGCTGCC
TGTGGTATGG CTACCAATCC TGAAAATGCC TATCGCAACG GTACACTATA TTCCAGCTAT
ACAGCGGCCT CCTTTGCCCG TAAATACCTG CTGGCCGATC AATACGCGAT CAATTTCCTG
GGCGCCGTAT CCTGGTCATT TGAATTTGAA AACCAACCCT GGTTTTACGG ATTCAGGGAC
CTGGCTACCA ATGGTGTAGA TAAACCGGTG CTCAATGTAT TCCGGATGTT TGGTATGATG
CGCGGCGACA GGGTCAACGT TTCCTCCAGC CGCATGTATC CATTGGAAAC GGTCCTGGAT
TCCAGTATCA GGGGGCAGCA GACCGAAATC GGCGCATTGG CTTCCAAAGC AGCTCACACA
GCAGCGGCAA TGGTATGGAA TTATCATGAT GAAGATAAAA AAGGCCCTGC TGAGCTGGTG
AATCTGACTT TTAAAGACGT ACCGGCCCAA AAGGTAATCA TAAAAACCTA TCTTATAGAT
AGTGATCACA GCAATTCCTA CGAAGTATGG AAAAAGATGG GATCTCCACA GCATCCGACT
AAAAAGCAGA TCAGTACACT GGAAAAAGCA GGAAAGCTAC AGATTGTACA GACAATACAA
AAAGCAAGCA TGAACGGAGA GGTGCAGCTG CCCCTTCGTT TGCAGCGTCA GGCAGTTGCA
CTAGTGACAC TCAGCTGGTA A
 
Protein sequence
MKKILLFSLC IALGDTLSAQ STPTTLTDNI PASIEVRLGQ ITGPMKPVWA WFGYDEPNYT 
YMKDGKKLLS EIAALSPVPV YVRTHSLLVS GDGVAALKWG STNVYTEDAN GQPVYNWTII
DSIFDTYINR GMKPLAQIGF MPQALSTHPE PYRHYWKPGD PYTDIITGWA YPPKDYDKWR
ELAYQWAKHS VERYGQKEVE SWYWEVWNEP NGHYWKGTRE EFFKLYDYAA DGIKKALSTA
RIGGINIAGT SSKTATEWTT QFIEHCISGT NYATGKTGAP LDALLFHAKG NPKLVNGIVR
MNMSPQLRDI AAGFRIAASY PQTRNLPLII GESDPEGCAA CGMATNPENA YRNGTLYSSY
TAASFARKYL LADQYAINFL GAVSWSFEFE NQPWFYGFRD LATNGVDKPV LNVFRMFGMM
RGDRVNVSSS RMYPLETVLD SSIRGQQTEI GALASKAAHT AAAMVWNYHD EDKKGPAELV
NLTFKDVPAQ KVIIKTYLID SDHSNSYEVW KKMGSPQHPT KKQISTLEKA GKLQIVQTIQ
KASMNGEVQL PLRLQRQAVA LVTLSW