Gene Cpin_2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2720 
Symbol 
ID8358880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3320046 
End bp3323207 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content46% 
IMG OID644964900 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_003122401 
Protein GI256421748 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTT CATTATTAAG TCTATTGATT GCCGTCAGCA TGACCGGGTC GTCTTACGCA 
TTCCAGCAAA TACAGCTACA GCACGGCCCA TGGCACAGCA AAACTGACAC CGCGACTGTG
CCCAAAGAAA TTGAAGATCC GGAATGCCTG GGTATCAACA AAGAACCTGC CCATGCCACC
CTGATGCCTT ACGCAGATCT GAAAGAAGCA CTCAACGCAA ATCGCTATGC CTCTTCTTTT
TCAAAGTCAC TCAACGGCAC CTGGAAGTTC AATTACGTAC CCTGGCCGCA ACAAAGACCG
GTGGACTTTT ACAAACCAGA TTTTTCAGTG GAAAAATGGG CAGATATCAA AGTACCGTCC
TGCTGGCAGG TAGAAGGTTA TGGCACACCA TACTATAGTA ATTTCAATTA TATCTTTCAG
AAGGATTTTC CCCGGGTAAT GAGTACTCCG CCAGTCAATT TTACAGCCTA TAAAGAACGT
AATCCGGTAG GCAGCTATCG CCGTAATTTT GATGTACCTG CTGACTGGGA TGGCCGGCGG
ATATTTATAA CCTTTGATGG CGTAGACGCA GGTTTTTTCC TATGGGTGAA TGGTCATAAA
ATTGGCTATA GCGTCAATAG CCGCAATGCT GCGGAGTTTG ACATCACTGA ATTTGTGAAA
CCCGGCGCAA ATATCATTGC TGTGGAAGTG TATCGTTTCA CTACCGGCAG TTATATGGAA
GATCAGGACA TGTTCCGCCT TAGCGGCATT TTCCGCAATG TAACGCTGTG GAGCGCGCCC
CAGGAGCATA TCCGTGACTT CCTTATAAGT ACTGATCTCG ATGCGAACTA TGTGCATGCC
ACCCTGAATG CATCCGGTAA AATAAAAAAC TACGGCACTA CAAAAACACC AGCAAGAAAA
GTATCCGTAG AATTGTATGA TGGTACAAAA CTGATTAAAT CAGGTACAGC GGATGTAAGC
GCCCTTCAGC CAGGCGAGGA AGCATCCTGG AAAGTCGCTT TTCCGGTATA CAATCCCCGT
AAATGGACCG CTGAAACCCC AGCGCTTTAC ACTACTGTAG TTACTGCACT TGACGGAAAA
AAGGTAATAG AAACCATGTC GTCCCGCACC GGTTTCCGTA AAATAGAAAT CAAGGGAAGG
GTATTCATGG TAAATGGTGT TGCAGTAAAA CTGAAAGGTG TAAATCGTCA CGAGAACTGG
CCGGAAACAG GACATACAGT GTCTGAACCC GATATGGTCC GGGACATCCT GCTGATCAAA
CAGGCTAACT GTAACCACGT GCGCACGTCA CACTACTCCA ATGATCCCCA GTGGTATGAG
CTATGCAACG AGTACGGCCT CTACCTGCTC GCAGAGGCGA ATGTGGAATC ACACGGCGCA
TGGGACGAAT TTAATGAAGA TCCCCGCATT AAAGCGGCTA TCATAGACCG GAATATCTCT
AATGTGGAGA ATTTCAAAAA TCATCCTTCA GTCATTATCT GGTCTCTCGG CAACGAATGC
GGTAGCGGAG GCACCAATTT CCGCGCAGCA TTGGCCGCGA TCCAAAAGCT GGATCCCACC
CGCCCTACGC ACTATCAGGG ATTTGGCACA GGTGATAAAA ACCCCAGTGA TATGGACAGC
GAGATGTATA CGCAATTACA CGAGGTACGT CGCCACGCTA CTGAACCAGG CCTGACCAAA
CCATTCTATC TCTGCGAATA CGCGCACGCG ATGTTTAACT CGATGGGCAG CGTAGAGGAA
TATAGCGACT TGTTTGATGA ATATCCGGCA TTGTTAGGCG GTTGTATCTG GGAATGGGAA
GACCAGGGTA TCTGGAACCG TATTGATCCT CAACACCCCA TCATTGCTTA CGGCGGAGGT
TTTGGTGAAT GGCCAAACGA CCGTTTCTTT ATTCATAAAG GCGTAGTATT TTCTGACCGC
AGTGTCAAGC CTCATTACCC GGAACTAAAA CATGCATATC AGTGGATTAA AGTTAAACCA
GTTGATATTG CCCGGGGTAA ATTACGAATA TTGAACAAAT ACCAATTTAT CACGCTGAAT
GGCTATAATG CCAGCTGGAC GTTGATGAAA AATGGTGTTA TTACCGATAG CGGAAAAATA
ATGCTTCCGC CTGTTGCAGC AGGCGATTCT GCTGACGTAC AATTACCCAG ATTCACTAAG
CGCACATCAA ATGCTGAGTA TATTGTGCGA CTTTCTTTCA AATTGACAAA TGAAGAAAGC
TGGGCGCATA AAGGTTTTGA AGTAGCCTCA CAGCAACTGG AAATACCTGG CGGGAATGGC
GACTTCACAG GCATTATGCA AAAAGGCCCA CTTGCAGTAA AAGAAGCTGA CAATGATATC
GAGATTGATG GCAGTAATTT TTCACTTGTT TTTAATAAGC AGAAAGGAAC CTTTACCTCG
ATAAAAAGCA AGGGACAGGA AATGCTGGAA ACAAACGGTG GCCCCCTACT CCATTTATGG
CGTGCCCCTC ATCGCAATGA CGACATGTGG GCGGATGATG AATGGGAGCG CTATGGTCTT
AAAAGAATGG AGTGGAGCGC AAGCGATATA GCAATAAAAA ATGAGGGTGA TATAACTATT
ACTGCAACAC TAAATGGAAA AGGCCGCAAC GGCTTCACCG TAAATCACCA GGTTTCCTAT
GTAATAAGCG GTGATGGTGT GGTAAAAGTA AATAATAACC TCAGCTTTGG AACTGTGAAT
ATTCCACTCG CAAGAGTTGG TGTAAGATTA CTACTAAAAA GTAATCTGGA TAAATTTCAG
TATTATGGCC GTGGCCCTGA AGAGAACTAT GCTGATCGTA AATCAGGCGA AGATGTAGGT
ATCTGGGGAA GCAGTGTAAA AGCGCAACTC ACTCCTTACG AAAAACCCAT GGAATGCGGT
AACCATGAAG ATGTGCGCTG GGCGACTGTG CGTAGCAATG AGGCTTCACT GTCTGTTGTA
AAAACTAAAG ACCTCCTGCA AGTATCTGCC CTGCCGTATC GCGACGAGGA TCTGCAGCAT
GTTGAGTATC GCATAGACTT GCCGGAGAGC AAACTTACGG CGCTTTGCAT ATCCACACAT
ACTTTGGGTG TAGGCTCAAA CTCCTGTGGT CCCCGTCCTC TTGACCGCTT TGTGCCGCGT
GCTGTGTCAC AATCTTTTTC TTATGTACTA AAAATACAAT AA
 
Protein sequence
MKFSLLSLLI AVSMTGSSYA FQQIQLQHGP WHSKTDTATV PKEIEDPECL GINKEPAHAT 
LMPYADLKEA LNANRYASSF SKSLNGTWKF NYVPWPQQRP VDFYKPDFSV EKWADIKVPS
CWQVEGYGTP YYSNFNYIFQ KDFPRVMSTP PVNFTAYKER NPVGSYRRNF DVPADWDGRR
IFITFDGVDA GFFLWVNGHK IGYSVNSRNA AEFDITEFVK PGANIIAVEV YRFTTGSYME
DQDMFRLSGI FRNVTLWSAP QEHIRDFLIS TDLDANYVHA TLNASGKIKN YGTTKTPARK
VSVELYDGTK LIKSGTADVS ALQPGEEASW KVAFPVYNPR KWTAETPALY TTVVTALDGK
KVIETMSSRT GFRKIEIKGR VFMVNGVAVK LKGVNRHENW PETGHTVSEP DMVRDILLIK
QANCNHVRTS HYSNDPQWYE LCNEYGLYLL AEANVESHGA WDEFNEDPRI KAAIIDRNIS
NVENFKNHPS VIIWSLGNEC GSGGTNFRAA LAAIQKLDPT RPTHYQGFGT GDKNPSDMDS
EMYTQLHEVR RHATEPGLTK PFYLCEYAHA MFNSMGSVEE YSDLFDEYPA LLGGCIWEWE
DQGIWNRIDP QHPIIAYGGG FGEWPNDRFF IHKGVVFSDR SVKPHYPELK HAYQWIKVKP
VDIARGKLRI LNKYQFITLN GYNASWTLMK NGVITDSGKI MLPPVAAGDS ADVQLPRFTK
RTSNAEYIVR LSFKLTNEES WAHKGFEVAS QQLEIPGGNG DFTGIMQKGP LAVKEADNDI
EIDGSNFSLV FNKQKGTFTS IKSKGQEMLE TNGGPLLHLW RAPHRNDDMW ADDEWERYGL
KRMEWSASDI AIKNEGDITI TATLNGKGRN GFTVNHQVSY VISGDGVVKV NNNLSFGTVN
IPLARVGVRL LLKSNLDKFQ YYGRGPEENY ADRKSGEDVG IWGSSVKAQL TPYEKPMECG
NHEDVRWATV RSNEASLSVV KTKDLLQVSA LPYRDEDLQH VEYRIDLPES KLTALCISTH
TLGVGSNSCG PRPLDRFVPR AVSQSFSYVL KIQ