Gene Cpin_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4149 
Symbol 
ID8360322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5175865 
End bp5177637 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content47% 
IMG OID644966320 
Productglycoside hydrolase family 5 
Protein accessionYP_003123809 
Protein GI256423156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.533031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.39445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAT TTAAAAGCGT CACGTTATTA TTACTATCAT CCATTTCCCT CAAACCATTA 
CAGGGATGGA GCCAGGGATT TCTTGCAGCC CAGGGCGCCA AAATTGTAAA TGAAAAGGGA
GAAAATGTAC TGTTGCGGGG TATCGGTATG GGAGGCTGGA TGCTTCAGGA AGGGTATATG
CTCCGGGTAA ATGGTGAAGG CCAGCAATAT AAGATCCGCG CTGGCATTGA AAAGCTGATT
GGTCCGGAGA AAACACAGGA GTTCTACGAC GCCTGGCTGA ACAATCACAC GACGAAAGCG
GACATTGATT CTCTGAAAGC CTGGGGATTT AATTCCGTTC GTTTACCCAT GCACTTTAAT
CTGTATACCC TGCCGGTTGA CAAAGAGCCT GTCGCCGGAC AAAACACCTG GCTGGAAAAG
GGATTTACGA TGACGGACAG CCTGCTGTCC TGGTGTAAGG CCAATCATAT GTACCTGATC
CTGGATCTGC ACGCAGCGCC AGGCGGACAG GGGAATGACC TGAATATCTC CGACAGGGAT
GGTTCCAAAC CTTCTTTATG GGAAAGTGAG CCTGACCGTC TGAAGACCAT TGCGCTCTGG
AAAAAACTGG CCGAACGTTA CCATAATGAA CCTTATATCG GTGCATACGA TATCCTGAAT
GAGCCTAATT ATGGCTTTGA AGATCCGGAC AGTGACAAAA ACGGTCTGAA GGAGCAAAAC
AACGTTCCTC TGAAAAAACT TATGCAGGAT ATTACCCGCG CTATCCGGGA AGTAGATGAC
CGCCATATCA TCATTATTGA GGGAAATGGC TGGGGAAACA ATTACAACGG CATCCTGCCG
CCATGGGATA AGAACATGGT GCTGAGTTTT CATAAATACT GGAATCTGAA CGACACGAAA
TCTATACAGC ATATCCTTGA CTTCCGCCAG AAATACAATG TGCCGGTATG GTTGGGTGAA
ACCGGTGAAA ACTCTAATGT ATGGTTCACA GAAGCGATCC GTTTATTTGA AAAGAACAAT
ATAGGCTGGT CCTGGTGGCC GCTGAAGAAA CTAGGCGCTA ATAACCCACT GGAGATCATG
CCAAATGACA ACTACATGCG CGTAATAGAC TATGTTAGCG GTAAAGGCAG CGCACCGTCT
GCAGATGTAG CCTACGAAGG ACTGATGGAA GTAGCCAGAG GTGCAAATAT CAACCGGAAC
ATCCTGCACC GTGATGTGAT TGATGCGATG ATCCGCCAGC CGTTCAGTGA TAAAACGCTT
CCATTTAAAG CCAATAATAT CAATAACAAA GGCGAAATCA AAGCGGTAGA TTATGATCTG
GGTGCATTAC GGGCAGCTTA TTTTGATACA GATACGGCTA ACTATCGCAT TTCCAACCCG
AAATGGACTG GCGGCAATCG CGGCAGAACA TATCGTAACG ACGGTATTGA CATCGTTGCT
GATTCGACAG AAAAGGATAC TTATTATGTA ACGGATATTG TGACCGGTGA GTGGATGAAG
TATACGGTGA CTGTACTGCT GAAAGGCAGA TATAATATTT CATTTGTGGC GGCTCCTGGT
AAAACTCCGG GTAAACTGGC GCTGAGTATT GACGGTAAAC CTATTGGAAA AGTAACGGTT
ATTCCGGCGA AAGAGAACAC CAATGGCGCA TGGCAGACCT ATACCCTCAA TAACATTCCG
TTAAAGAATG GCCGCCAAAC CCTCCGCTTA TCCGCCGAAT CGGGAGGATT TGACCTGAAG
GCCATCCGTT TTGAGAAGTC CCCTGTCCCG TAA
 
Protein sequence
MKLFKSVTLL LLSSISLKPL QGWSQGFLAA QGAKIVNEKG ENVLLRGIGM GGWMLQEGYM 
LRVNGEGQQY KIRAGIEKLI GPEKTQEFYD AWLNNHTTKA DIDSLKAWGF NSVRLPMHFN
LYTLPVDKEP VAGQNTWLEK GFTMTDSLLS WCKANHMYLI LDLHAAPGGQ GNDLNISDRD
GSKPSLWESE PDRLKTIALW KKLAERYHNE PYIGAYDILN EPNYGFEDPD SDKNGLKEQN
NVPLKKLMQD ITRAIREVDD RHIIIIEGNG WGNNYNGILP PWDKNMVLSF HKYWNLNDTK
SIQHILDFRQ KYNVPVWLGE TGENSNVWFT EAIRLFEKNN IGWSWWPLKK LGANNPLEIM
PNDNYMRVID YVSGKGSAPS ADVAYEGLME VARGANINRN ILHRDVIDAM IRQPFSDKTL
PFKANNINNK GEIKAVDYDL GALRAAYFDT DTANYRISNP KWTGGNRGRT YRNDGIDIVA
DSTEKDTYYV TDIVTGEWMK YTVTVLLKGR YNISFVAAPG KTPGKLALSI DGKPIGKVTV
IPAKENTNGA WQTYTLNNIP LKNGRQTLRL SAESGGFDLK AIRFEKSPVP