Gene Cpin_6044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_6044 
Symbol 
ID8362226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7642530 
End bp7644035 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content48% 
IMG OID644968178 
Productglycoside hydrolase family 43 
Protein accessionYP_003125657 
Protein GI256425004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.845022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0554344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGGA CTAAAAAGTA TCTTGCTCTA CTGCTTTCGT TTGTTTGTCT GTCAATGTGT 
GCCACGCTCC ACGCTCAGTC CACAACAGGC CCTGTTATCG CAGGCGACCT GGCAGATCCA
TCCATCATAA AAGTCGACAG CGTCTACTAT GCAACAGGTA CCTCCTCAGA ATGGGCGCCT
TATTATCCTG TCTACAAGTC GTCTAACCTG AAAGACTGGC GACAGACAGG CTACGTGTTT
GACAAAGCAC CGGATTGGAC AGTAGGCTCC TTCTGGGCGC CGGAATATTA TCAAATAGGC
GACACCTATT ACATGTATTA TACCGCCAGA CGAAAGTCGG ACAACCAATC CTTTATCGGT
GTCGCAACAT CCCGCTACCC CGATCATGGA TTTATTGACC ATGGCGTCAT CATCGAACAT
GGAAAAGAAG CCATCGATGC TTTTATCTAC GATGACAATG GCCAACGATA TATCACATTT
AAGGCATACG GACTGGAAAA CAGACCCATT GAAATACTTG GGTATAAATT GTCTGCCGAC
GGACTGAAAA CGGAAGGCGA AGCATTCACC TTACTGAAAG ATGATAACCG CGCAGGCATG
GAAGGACAAA GCATCCTGAA AAAAGATAAT TATTATTATC TCTTTTACTC TGCCGGCAAT
TGCTGCGGCG GTGGATGTAC CTATTCTGTA AACGTCGCCC GCTCCACCAG CTTCAAAGGC
CCTTATGAAT ACTTTACAGG CAACCCTGTC CTCAGTGAAA ACGACAGCTG GAAATGTATG
GGACACGGTA CCTTCGTTAC CGCTGATGAT AATCAGACCT ACTACCTGCA CCATGCGTAC
AATAAGAAAA GCACCGTGTT CACAGGGCGA GAAGCACTCC TCTCCCGGTT ATCCTGGCAA
ACACCTTCTG GCTGGCCCGC ACTGAAAACA GTCGATATCA GCACAACAAC ACCGGTAGAT
CTTTACGATC CGTTTGATGG AAAAAAAACA GAGAAATACT GGCAATGGGA CTTCCGCCAC
TCTACCCCTT CCATACAACA ACAGAAAGGA ACGCTTCGCT TATCAGGTGT AGCAACAAAA
GAAAATCCAG CCGGTATCGT ACTGACAGTA AGACCAACTG CCGATAACTT TGAAATGTCC
ACCAGCGTAA CGAATCACAA CAAAGCCCTC AAGGGACTGG TCATCTATGG GGACGCAAAC
GCCGCCATTG GTATCGGTGT AGAAGGAGAC AGTGTAAAAG TCTGGAAAAC TGAAAACAAA
CAACGTATCA CCATAAAGGC AGCCGCTGTG CCGTCCTCCG CTATCGGACT GAAAATAGCC
ATGTCTGGCG GCAGCAACTG CGAGTTCTTT TATCAAACGG ACGACGCTAC CTGGATACCG
CTGGCTACCG GCTTAGCAAC AGGATCTTTA GCGCAATGGG ACAGAAGTCC ACGACTGGGT
CTGCAATACA GCGGCAATAA AAACGAAAAC GCGCAGTTTG CCTTCTTCAG ATTGCACAAC
AAATAA
 
Protein sequence
MYRTKKYLAL LLSFVCLSMC ATLHAQSTTG PVIAGDLADP SIIKVDSVYY ATGTSSEWAP 
YYPVYKSSNL KDWRQTGYVF DKAPDWTVGS FWAPEYYQIG DTYYMYYTAR RKSDNQSFIG
VATSRYPDHG FIDHGVIIEH GKEAIDAFIY DDNGQRYITF KAYGLENRPI EILGYKLSAD
GLKTEGEAFT LLKDDNRAGM EGQSILKKDN YYYLFYSAGN CCGGGCTYSV NVARSTSFKG
PYEYFTGNPV LSENDSWKCM GHGTFVTADD NQTYYLHHAY NKKSTVFTGR EALLSRLSWQ
TPSGWPALKT VDISTTTPVD LYDPFDGKKT EKYWQWDFRH STPSIQQQKG TLRLSGVATK
ENPAGIVLTV RPTADNFEMS TSVTNHNKAL KGLVIYGDAN AAIGIGVEGD SVKVWKTENK
QRITIKAAAV PSSAIGLKIA MSGGSNCEFF YQTDDATWIP LATGLATGSL AQWDRSPRLG
LQYSGNKNEN AQFAFFRLHN K