Gene Cpin_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5903 
Symbol 
ID8362084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7497298 
End bp7498863 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content49% 
IMG OID644968042 
Productglycoside hydrolase family 28 
Protein accessionYP_003125522 
Protein GI256424869 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.672596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TACTGACTGC CACCCTGCTA TGCTTCAGCC TGTCGGCCGC TGCCCAGTTG 
AACGTCCATG TCCCCACCAT CGATGAAGTA GGCGCCACCA GGCTTCCCGC TAACATAGCT
CCCGTGCATG CCCCTTTTTC AGTACCGGCT TTTAAAAAAC CCATATTCCC GAAATACACG
ATCACCATCA AGGGACAGGG AAGCGCACAA ACAAAAGAGA TCCAGCAGGC CATTGACGCC
GTTAGCAAAA AAGGCGGCGG AACAGTTATC ATACCTGCGG GCAACTGGCA TTCAGGCCGT
ATCGCGTTGA AATCCAACGT CAATCTCCAC CTGGAAGAAA ACGCGGTACT GGAGTTCGGT
GGAGAAATAA GAGATTATCT CCCTGTGGTC TTCACCCGTA CGGAAGGGGT CGAGGTCATG
TCTCTCGGCG CCTGTATCTA TGCGAACGGA CAACATAATA TTGCCGTAAC CGGCAAAGGG
AAACTGGTTG GTCCGCCGGC CAATTGTCCT GTCAGAAAAC AGGTCATGCG CCAGGATGTG
ATAGAAAATG TCGTAGCTGC CAATAAACCG GTCTCGCAGC GGATATATGA TGGCCATGAT
GGCGGTCCTG TGTACCTGCC AATGTTCGTT TCTGCCGTCA ACTGTAAAAA TGTTTATTTA
GAGGGCTTGC AACTGGAAAA TACCCCTTTC TGGAACATTG TCCCTATCTA TTGCGATAAC
GTCATTATAC GGGGTATAAC CGTCAATTCT GTCGGTATTC CCAGCGGTGA CGGTATTGAC
ATTGAATCCA GCAAAAATGT ACTGATAGAA TATTGTACGC TGAACTGCGG CGATGACTGC
TTTACATTAA AAGCCGGTCG CGGAGAGGAC GGTTTACGTA TCGGCAAACC AACAGAAAAC
GTCGTTATCC GCTATTCACT GGCACGGCAG GGACACGGTG GCATCACCGT TGGCAGCGAA
ACAGCTGCCA TGATCCGGAA CCTGTATGTA CATGATGTAG TTTTTGACGA TACAGAAGTT
GGTCTCCGTT TTAAAACAAG ACGTCCGCGC GGCGGTGGTG GTGAAAACCT GCACTATGAA
CGTATCCGTA TGCGCCTGCG GCTCGATGCT TTCAGATGGG ATATGCTGGG CGCAAGAATG
TATGTAGGCG CGCTGGCTGA TCGCCTGCCC GCCTTACCTG TCAATAAACT GACGCCGGTA
TACAGGAACA TTTACGCTAA AGACATTGTG GTAGACAGCG CGAGAGCGCT GGTAAGAGTG
GATGGTATTC CGGAATCACC TATGACAGGC TTTCACCTGC AAAATGTAGA AGCGCATTGT
ACGAAGTTCT TACAGAGTAT AGACGCCAAT GTTATCAGTA TCTCCAACGC AAACATATAT
ACAACAGATT CCGCTGTAAC ACTGACCGAT AGCAGGAATA TTACTTTTGA TAAGGTACAC
GTTATCAACC CCGCCAATAA AGTCGTGGTG AATATTTCCG GAGAACTGAC CGATAATATA
CGCTTTAGTA ATTCTGTACC GGAGAAACCC GAAGGCTGGG AAACCGCTAC CTGGAAGAAG
AATTAA
 
Protein sequence
MKKLLTATLL CFSLSAAAQL NVHVPTIDEV GATRLPANIA PVHAPFSVPA FKKPIFPKYT 
ITIKGQGSAQ TKEIQQAIDA VSKKGGGTVI IPAGNWHSGR IALKSNVNLH LEENAVLEFG
GEIRDYLPVV FTRTEGVEVM SLGACIYANG QHNIAVTGKG KLVGPPANCP VRKQVMRQDV
IENVVAANKP VSQRIYDGHD GGPVYLPMFV SAVNCKNVYL EGLQLENTPF WNIVPIYCDN
VIIRGITVNS VGIPSGDGID IESSKNVLIE YCTLNCGDDC FTLKAGRGED GLRIGKPTEN
VVIRYSLARQ GHGGITVGSE TAAMIRNLYV HDVVFDDTEV GLRFKTRRPR GGGGENLHYE
RIRMRLRLDA FRWDMLGARM YVGALADRLP ALPVNKLTPV YRNIYAKDIV VDSARALVRV
DGIPESPMTG FHLQNVEAHC TKFLQSIDAN VISISNANIY TTDSAVTLTD SRNITFDKVH
VINPANKVVV NISGELTDNI RFSNSVPEKP EGWETATWKK N