Gene Cpin_5519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5519 
Symbol 
ID8361696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7040561 
End bp7041703 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content47% 
IMG OID644967665 
Productglycoside hydrolase family 76 
Protein accessionYP_003125149 
Protein GI256424496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000740708 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0122661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACAT ATACGACTAT TATGAAGCGT CTCACCTGTC TGCTGACGGC TACGGTTTTG 
CTGATAAGCA TTTCCTGTAG CAAGAGCAAC CCACGGGGGC CGGTTAATGA AGAGCCACCT
GTGCTGCCGC CTGTATCTTT TACATCGAAA GATGCTACGG CTGCATTTAA CACGTTCAAT
CAGTACTTCT ATAGTACGAC GGACAAACTG TATTATTCCA ATACAGAAAA AAAAGATATC
GGCGCCATCT GGACACAGGC GGTGTACTGG GATCTGATCA TGGATACTTA CAAGCGTACG
GGAGATGCGG CACATCGCAG GATGATCGAT GATCTGTACC AGGGAGGTTA CAACCGTTAT
GACAAGTATA ACTGGCGTAA CAAAGTGGTA TGGTTTATTT ATGATGATAT GATGTGGTGG
ATCATTTCAC TGGCACACGC ACATCAGATT ACCGGTAACC AGGAGTATCT GACCCGTTCT
ATAGAAGGAT TCCAGTATGT ATACCAGGAG TCTTATGACA AGGAACATGG TGGTATGTGG
TGGGATTTCA ATCATACAGG TAAGAATTCC TGTATTAATT TCCCGACAGT TATTGCTGCG
ATGACTTTGT ATAATATTAC GAAGGAGACT GCATATCTTG ACAAGGCGAA AGACCTGTAT
AGCTGGTCGC TGGCGAATCT GACTGACAGT ACAACCGGCC GCGCCGCTGA TAACAATATC
AGGGGTAACA AAGGCTGGTC TGATTATACC TATAACCAGG GTACTTTCAT CGGCGCTGCG
GTGATGCTGT ATAAGGCGAC CGGTCAGCAA TCTTATCTCG ATAATGCGAA GAAAGGAGCT
GACTATACGC AGCAGAAAAT GTCTGACGCT GATGGCATTC TGCCGGCAGA GGGAGACTGG
AATGAACAGG GCGTATTAAA GGCGATCCTG GCACATTATC TGCTGACATT GGTGAAAGAC
GCCAATCAGC CGCAATATCT GCCATGGATC AGAAAGAATA TTAATATGGC ATGGGGTAAC
AGGGATGCTG CGAGGGGGAT CATGCACAGG AACTATAAGA TACCTTGTCC GACAGGGATC
GTACAGTCCT ACGAAGCCAG CAGCGCTGTT GAGTTCATGC AGGTTTGTCC ACCAGAGAAG
TAA
 
Protein sequence
MLTYTTIMKR LTCLLTATVL LISISCSKSN PRGPVNEEPP VLPPVSFTSK DATAAFNTFN 
QYFYSTTDKL YYSNTEKKDI GAIWTQAVYW DLIMDTYKRT GDAAHRRMID DLYQGGYNRY
DKYNWRNKVV WFIYDDMMWW IISLAHAHQI TGNQEYLTRS IEGFQYVYQE SYDKEHGGMW
WDFNHTGKNS CINFPTVIAA MTLYNITKET AYLDKAKDLY SWSLANLTDS TTGRAADNNI
RGNKGWSDYT YNQGTFIGAA VMLYKATGQQ SYLDNAKKGA DYTQQKMSDA DGILPAEGDW
NEQGVLKAIL AHYLLTLVKD ANQPQYLPWI RKNINMAWGN RDAARGIMHR NYKIPCPTGI
VQSYEASSAV EFMQVCPPEK