Gene Cpin_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1838 
Symbol 
ID8357989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2242611 
End bp2243966 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content49% 
IMG OID644964026 
Productbeta-galactosidase 
Protein accessionYP_003121535 
Protein GI256420882 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAT TCAGCGCACC TTTCAGCCGT CATGATTTTG GAGAGAATTT CCTTTGGGGA 
GTGACGATTT CCGCATTTCA GAATGAAGGG GCCTGCGATG CCGATGGTAA AGGGAAGTCT
ATCTGGGACG AGTTTACCAC CAAGAAAGGA AAGATTAAAG ATGGTACCAA TGCACTTGTG
ACAGTCGATT TCTATAACCG TTACCGGGAA GATATTCAGC TGGTCAAACA AATGGGTTTT
GATGTGTTCC GTTTTTCTAT TTCCTGGCCG CGTATTCTGC CGGAAGGTAC CGGCAAGGTG
AATCAGGCGG GCATCGATTA TTATCACCGG GTGATAGACG CCTGTCTGGA AGCCGGACTG
ATACCATACG TTACTTTATA TCACTGGGAT TTACCGCATG CAATTGAACA CAAGGGAGGA
TGGTGCCACC GTGGCGTTAT TTTCGCTTTT GAAGAATATG TGCGTATATG TGTACAGGCG
TTTGGAGATA AGGTGAAGAA CTGGATTGTG ATGAACGAGC CTTTTGGTTT TACATCATTA
GGCTACATGT TGGGCGTCCA TGCACCCGGT AAATTTGGCG TATCCTATTT TCTTCCGGCG
GTACATCATG TGCTGCTTGC ACAGGCAGTA GGTGCGAAAG TCATCCGGGA GACCGTCAAA
GAAGCCAATA TAGGGACTTG TCTGTCCTGT TCCTATATTT ACCCTTACAC ACAGCAAGAG
GGCGATATAC AGGCTGCAAA GAAGGCGGAT GCGCTATTCA ATCGCCTGTT CCTGGAGCCG
GTACTGGGCA TGGGGTATCC GGTGGATGAT TTCCCCTTAT TACGCCGGAT AGAAAGACGC
TATGCCTTGT GGCGGGATTG GGACCAGCTG TCATTCGATT TTGATTTTAT CGGCGTCCAG
AACTATTTCC CGTTGGTGGT ACGCAGGAAT GCGTTTATGC CGGTGGTCGG CATATCCGAA
GTAAAACCGA AGTCCCGCCG GGTGCCAATG ACGGCGTTAG GCTGGGAGAT CAGCGGGGAA
GGGATGTATG CCATCCTGAA ACAGTTTGGT GCTTATAAAG GCATTAAACG GCTGATGGTA
ACGGAAAGCG GGGCTGCCTT TGCTGATATT ATGAGTGGGG GTGCTATCGA TGATCAGGAC
CGTATTGGTT ACTTTAAGGA ATATTTGGCT GGTGTACTGA AGGCGAAGAG GGAGGGGATA
CCGATAGACG GGTATTTTGC CTGGACGCTG ACAGATAATT TCGAATGGGC GGAAGGATAT
AAGGCCCGCT TTGGGTTAGT GCATGTAGAT CGTGATACGC AGGTGCGGAC GATGAAAAGC
TCCGGACAGT GGTTCAGCGA GTTGTTAAAA AAATAA
 
Protein sequence
MEQFSAPFSR HDFGENFLWG VTISAFQNEG ACDADGKGKS IWDEFTTKKG KIKDGTNALV 
TVDFYNRYRE DIQLVKQMGF DVFRFSISWP RILPEGTGKV NQAGIDYYHR VIDACLEAGL
IPYVTLYHWD LPHAIEHKGG WCHRGVIFAF EEYVRICVQA FGDKVKNWIV MNEPFGFTSL
GYMLGVHAPG KFGVSYFLPA VHHVLLAQAV GAKVIRETVK EANIGTCLSC SYIYPYTQQE
GDIQAAKKAD ALFNRLFLEP VLGMGYPVDD FPLLRRIERR YALWRDWDQL SFDFDFIGVQ
NYFPLVVRRN AFMPVVGISE VKPKSRRVPM TALGWEISGE GMYAILKQFG AYKGIKRLMV
TESGAAFADI MSGGAIDDQD RIGYFKEYLA GVLKAKREGI PIDGYFAWTL TDNFEWAEGY
KARFGLVHVD RDTQVRTMKS SGQWFSELLK K