Gene Cpin_5243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5243 
Symbol 
ID8361420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6631702 
End bp6634947 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content47% 
IMG OID644967391 
Producthypothetical protein 
Protein accessionYP_003124875 
Protein GI256424222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00071392 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000387225 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAATTA ATAATTCTAC CCCTTACCCG GTTTTCAAAA AGGGGCAGCA GTTGAAGAGT 
AGCAGCCTGA AAGGGATCGT TGATTTCGCG GAAGGAGAGA TCATGGATAC CCGCATGTTC
CTGGAAGGTT CAGGTATATT TTACGGACTG GATATTGAGA TCAGTGAGCA GGCAGGTACG
CTCCGTCTGT CGCCTGGGGC CGCGACCACC TCTGACGGGA AATTGTTTAC TCTGGAACAG
GAGGTGGTGT ACAATGGATT CAGTGCAGAG ACACCGATCA CCCTGTTGAA GAAGACGCCT
GTTGTTGCTG TACTGAGCAC CAACAACGAG AATCATAATG AATTTGCTTA TCGTATCACA
GGTAAGGATC CCGGTAACCC GAACAGAACG CCAAATACCA CACCATACAT CGTCCTGCTG
ATCCTGACGG AGAAGACGAC CAGTAATGAT AGTTGTCTGT ACGGACAGGA TAACAGCGAG
ACATCTCAGA CTAAAACAGT ACAGGCGGTA CTGATCGATA AGAACCTGGT TGACCAGTCA
GATCTTGATA AATGGTTTAT TACGGATAAT ACCACTGATG GAGCTGATGA TGCTGTGATC
AACCGTTTTG GATACAATAC CGGAGATGGT AAGTCTTATA TTTCATTTGA GAACTTTACC
AGCTGGAAGA ATGTCAGCGA TGGCTTTGAT AGCGTATGTA CCGCCGCAGA ACCATTAATC
GGCACTGCTC TCAAACAATT GTATGACCTG GTGAAGGAGA AGCTGGGACT GAACCCAACA
AACCCTTTCA ACAACCTGGA AAGTAATCTT AAACAACTGC GTGAAGCGGT AAAGAACAAA
GGTGGTGTAT CATTTCCCTG GCTTTATGAT TATTACAGGG ACCTGATTGC CGGCTATCAG
GAACTGGTAT CGACGGATCT TTTCAGTTAC CTGTCATTCT TTCCCAAGCG CGACCGTTTT
GAAGAATATA TTGCATTAGG CAGCATCCGT GCGAAGCATG GCGATAAGGA TGCCAATTAC
CGTATGGGAT TATACCGTCC GCCTTTTGCA GACCTCAGCA TTAACGCATT GGAGAAGCCA
TTGCAGCTGA TCAGAAGATT ACAATACCTG GCAGATACTG GACATACGCG CTTCAATACG
CAGGGTTTCC CACCGGCTGA AGTCGTATTT ACACCTGATG CTGGTATCAA TGAGGTCCTT
TCCAGACGTG CGATCCCTTT TTACTATAAT GATCCGCAGG AGTTGTCAAA GAGCTGGAAT
GCCGAACTTA CACGTAACCG CCGGACCTTC ACCATTCCGG GTATTACGGA TGAGAAAGAC
CGTAACCTGT TGCTGGCGAA TATGGACAGT TACGAGTTCT TCCGGATCAA AGGACATACC
GGCGCATTAA TAGAGCCTAC GCTGGATGCG ATTGCAGCAC AACGCAGCGA GTTACATCTG
CCATTTGATG TGAAGGTATT GTACCTGGGA GAGAAAGAAG ATATGATCAA ACTGGTAAAG
GAGCGCTCTG CAGCTTTCAG TGATCTGACG GTGCTGCTGG AGAAGATTGT GTATGATATC
CGTTGTGCGC GCACTTGTTC AGATAATTTT GAAGAAAGCA TCTTTGGCCA GCCATTCGAT
AGGAATGATA TCGGCAGCAT GTTTGAGGCG TTGGTTACCT TATTCGGTCC GCTACCTGTA
GATCTTGATA AGAAGCTGGA AGAACTTTGC AGTCGTGAGG GTACCTGTTA TGATGAAGAC
AAGACTTGTT GCCGTTCGCA TCTGACGGCG CTGTTCGCGG TGTATGAAGA ATACGTACGT
CGTAAGGATG AACTGGCAGG TAACCTGCTG TTCCATCTTT TCGCAGAGAA ACATCCGGGT
ATAGAGCATA ACGGCGGCGT ACCTAAAGGC GGTACACTGA TCCTGGTATG TGGTAAGACT
GATCCGGCGT TTCTGTCTGA AGAAAAGAAA TCAAATCTGC TGAAACTGGC GCTGAGTAAT
GCCGCCGGTG TCAACATTGC GGATGAACTG GAGAACTACA CGGTGATGGC TGATTTTTGT
CTGCCGTATA TCTGTTGTTC CAATAAACCG TCTATTAACC TGATCTTCCA GGAGGCGCCA
CCTGTGGCAC TGTTCTCTAT TGCAGAGCAG GAGCAGCTGC CGGAAGGACA AGGTACTGCT
GTTGTACTGA AGAACGAATC ACTGCGCGCT GATACTTATC ATTGGGAGTT GCAGGATTAC
CAGGGCAATA TGCTGAAAGA AGAATATACT ACGGATATCA ATGAAACGTC GAAGTTTGAA
TTACTGATAG AAAACGGGGT GGTATTTACG CTTTTGCTGA CCGCTTCGCG CGAGGGTATG
AACAGTAAGT ATGCGCTTGA GATAACGATC TGTCCACAGG GTAATGTAAA AGTGACCAGT
AAAGGTCAGT CATCTATCGA CTGGGATATC ACCAAGTCGC CTGAACTGGA GATAGAAGCC
TTTCCTTACG GGGGTAGTTT CGGACTGGTA TTAGAGCAGA AGGAGAAAGG GAACGAAGTG
GAACTGGATC CGTCGGAATA TGATATTACC TGGAAACAGG ATAAGGAACA CCTGACATTG
ACGATACAAC AACCTCAGGC GGGCATTTAC CGTTTGCGTT ATACGTTTGA AGATATCGAG
AATTGTGAGA AAGGATTTGC GGTGCTTCAG ATCAATACGT TTGTGCCTTC TAATCCACAG
CCCAAAGTAG CAGGTACAGA AACTGGTCCG GTGGCCGCAC CTGTGACAGC GTCTGCCAGT
GCACCGGTGG CGGTCAGCCG TGGTTTGTCA GCCGGAGGAA ATGAAGCGGT GTTCAATAAG
CGCATACTGA GTTATCGTAG TGGTATCAAC GCTATGTCTA AGGAAGATGA TACACTGCTG
GAAGACAGTC GCTGGTCAGA TACTAAAACC TTCTTATTGG CCAGTGGTGC ACCAGAAGAA
CTACATGCAG CGTATGAGCG TTTGCAGGGT GTGTTACAGG GTGGATTCGG TAAACTGAAG
GTGGCACAGA AAGCACAGAT GATCAGATTG CTGACTTATG CGACTGCTTA TTATATGGAC
AGACTGATTG CTGCTTCTCC GGATAAAGTA CCGGCGATAG CGAGAAAGCT GATTAAAGTA
GCGGGAGAAA GCATTGCCAT GCAAAAGGAT GGTGTACAGC AATGGTCACA GATATGGAGC
AGCGAAGGTA TTGTGACACC TGAGAATGAA AAGACAGTCT CCGCTTATAA AGCTATGATC
GCGTAA
 
Protein sequence
MIINNSTPYP VFKKGQQLKS SSLKGIVDFA EGEIMDTRMF LEGSGIFYGL DIEISEQAGT 
LRLSPGAATT SDGKLFTLEQ EVVYNGFSAE TPITLLKKTP VVAVLSTNNE NHNEFAYRIT
GKDPGNPNRT PNTTPYIVLL ILTEKTTSND SCLYGQDNSE TSQTKTVQAV LIDKNLVDQS
DLDKWFITDN TTDGADDAVI NRFGYNTGDG KSYISFENFT SWKNVSDGFD SVCTAAEPLI
GTALKQLYDL VKEKLGLNPT NPFNNLESNL KQLREAVKNK GGVSFPWLYD YYRDLIAGYQ
ELVSTDLFSY LSFFPKRDRF EEYIALGSIR AKHGDKDANY RMGLYRPPFA DLSINALEKP
LQLIRRLQYL ADTGHTRFNT QGFPPAEVVF TPDAGINEVL SRRAIPFYYN DPQELSKSWN
AELTRNRRTF TIPGITDEKD RNLLLANMDS YEFFRIKGHT GALIEPTLDA IAAQRSELHL
PFDVKVLYLG EKEDMIKLVK ERSAAFSDLT VLLEKIVYDI RCARTCSDNF EESIFGQPFD
RNDIGSMFEA LVTLFGPLPV DLDKKLEELC SREGTCYDED KTCCRSHLTA LFAVYEEYVR
RKDELAGNLL FHLFAEKHPG IEHNGGVPKG GTLILVCGKT DPAFLSEEKK SNLLKLALSN
AAGVNIADEL ENYTVMADFC LPYICCSNKP SINLIFQEAP PVALFSIAEQ EQLPEGQGTA
VVLKNESLRA DTYHWELQDY QGNMLKEEYT TDINETSKFE LLIENGVVFT LLLTASREGM
NSKYALEITI CPQGNVKVTS KGQSSIDWDI TKSPELEIEA FPYGGSFGLV LEQKEKGNEV
ELDPSEYDIT WKQDKEHLTL TIQQPQAGIY RLRYTFEDIE NCEKGFAVLQ INTFVPSNPQ
PKVAGTETGP VAAPVTASAS APVAVSRGLS AGGNEAVFNK RILSYRSGIN AMSKEDDTLL
EDSRWSDTKT FLLASGAPEE LHAAYERLQG VLQGGFGKLK VAQKAQMIRL LTYATAYYMD
RLIAASPDKV PAIARKLIKV AGESIAMQKD GVQQWSQIWS SEGIVTPENE KTVSAYKAMI
A