Gene Cpin_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3643 
Symbol 
ID8359810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4573815 
End bp4577075 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content47% 
IMG OID644965812 
ProductTonB-dependent receptor 
Protein accessionYP_003123306 
Protein GI256422653 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.747331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATG TAGACCTGTT GAAGGTCAGG CATCGTTATG CCTGGCAGTC AGCACTATTG 
CGCGTGAAAT TTACCTTGCT GCTTTTTTTA CCTGCCCTGT ATCCGCTTGT AGCGGCGGCA
GATGGTTATC ACACCTTATT CTCTGTACAA CAGGATGTCA CTTTAAAGGG AAAGATCGTT
GATAAAAATA AAGAACCTGT AATCGGTGCT ACTGTCAAAG TGGTAGGCAC TTCCAAAGGT
ACCACTACGC TTCCGGACGG GACTTTTACT TTAAACGTCG CGAATGGCGC TCAAATCACT
GTTTCTGCAA TCGGTTTCCT ACCGCAAACG CTCACTGTAA ATGGAAATGG GCCGCTCACT
ATTACCCTCG ATGCCGACAC GAAAGGTCTT AGTGAAGTAG TCGTGGTAGG TTACGGCGCA
CAGAAAAGAG AAGCCGTATC GGGTTCTATC GCGACTGTAA AAGGTGCTGA CCTGGTGAAA
TCCCCCGCTA CTAACTTATC CAACTCCATC GCTGGTCGTA TGCCCGGTGT GACCGCTATG
CAAAACAGTG GTGAACCTGG TGCAGACGGT TCCTCTCTGC GTATCCGTGG ATCCAATACC
CTGGGTAATA ATGACCCACT TGTGGTTATC GATGGTGTGG CAGGTCGTAC CGGTGGACTG
GAACGCCTGA ATCCGCAGGA TATTGAAAGC ATCTCCGTAT TGAAAGATGC TTCTGCGGCC
ATCTATGGTG CACGTGCAGC AAATGGCGTT ATCCTCGTGA CCACCAAACG TGGTAAAAGT
GGTAAGCCGC AGCTGAATTA TTCCTTTAAC CAGGGATGGT CTCAGCCGAC GCATATCCCG
GAAATGTCAG ACGCAGCTGA ATATGCACAG GTGCGTGATG AACTGAGCCT CTACCGTGAT
GTTCCCGGCA AGGAATGGTC AGATGCATGG TCCGTATACA AACAGGGGGG GGCTTATACT
TCACCTACTA CCAATAAAAA ATGGGATGTC GCTTTCGGTC CTTCCGAAAT TGACAAGTTC
CGCAACAGCA CTGATCCATG GTTGTATCCT AATACAGACT GGTACAAGTC TACATTTAAA
ACATGGGCGC CACAGTCAAA CCATAGTGTA CAGGTCTCCG GCGGCAACGA GAACATGCGC
TATCTGACGT CCTTCGGTTA TCAGTCACAG GACGCTTATT ATAAGAAATC AGCGACCGGT
TATGACCAGT ATAGTCTGCG GGTTAACCTC GACGCCAAAA TCAATAAATA TGTCAACGTG
AGCGTAGACA TGTTAGGTCG TCAGGAAGAA CGTAATTACC CTACCCGTAG CGCCAGCGAT
ATCTTCCGTA TGCTCATGCG TGGTAAGCCG ACAGAACCTG CTTTCTGGCC GAACGGCCTG
CCTGGTCCGG ATATCGAATA TGGAAACAAC CCGGTAGTCA TCACCACCGA CCAGACCGGC
TACGATAAAG ACAAACGTTA TTACGTACAG ACCAATGCCC GTCTCGACAT CCTTGTACCT
GGCGTAGAAG GATTGAAACT GAGCGGTAAC GTGGCATTTG ATAAATACCT GAAACGTACG
AAGAGATGGA TGACACCATG GTACCTCTAT ACATGGGATA AGAAAACGTA TGAACAGGAT
GGTACGACGC CGCAGCTGGT GAGAAGTAAA CGTGGTACTG ATCAGGCGAC ACTCAACCAG
GGTGATGAAG ATCAGAGCAA CGTATTGCTG CGTGGTTTAT TGACTTACGA CCGCACCTTT
AACAAAAATC ATACACTGAA CTTTGTGTTT GGTATTGAAA GAGAAACTGT ACACTCTGAT
AATTTCAACG CACTCCGCAA ATACTTTATT TCTCCATTGA TCGACCAGAT GTTTGCTGGT
GGCGATGCAG AAAAAGACAA TGGGGGCTCT GCGTGGGACC GTGCACGTCT GAACTATTTT
GGTCGTGTGG CTTACAACTA CAAAGAGAAA TACCTCGCTG AGTTTGTATG GCGTAATGAT
GGTTCCTACA TGTTCCCGTC TACTTCCCGT TTCGGATTCT TCCCGGGTCT GATGCTCGGC
TGGCGCGTGT CAGAAGAGAG TTTCTGGAAG GACAACGTGA AGTTCATGAA TTACCTGAAA
CTGCGCGCAT CCTGGGGTAA ACTGGGTAAT GACCAGGTAT ACTTCGATGA CAACAATGAC
GGTACGCTTA CCTTACAGGA ATATCAGTAC TATTCTACAT ATGGATTCAG CAGTTACATC
CTGGGTGGTA ACGTCCTCAA ATCATTGTAT GAGGCGCGAA TTCCGAATCC GTATATCTCC
TGGGAGGTGG CTAACAACTA CAACGTTGGT CTGGATGGAG CTTTACTCGA CGGTAAGATC
TACTTCGAAC TGGACGCGTT CATGAACAAA CGTTCCGGTA TCCTGGTAAG AAGAAATGCT
TCTGTTCCGC AAACAACCGG TATGACGCTG CCAGCAGAAA ATATCGGTAA AGTAGATAAT
AAAGGTTTCG AATTCCGCAT CGGATACAAC AGTACCGTAG GTACTAAATT CCGCTATGAT
GTGAGTGTGA ATGGCGGGTA CTCTAAAAAT AAGATCGTAT TCTGGGATGA AGCACCTGGT
GCACCGGTAT GGCAACGTTC CACCGGTATG CCAATGAATA CGGCGCTCTA TTATGAATAT
GACGGCGTAT TTAAAGATTA TGCAGAGATC GATAAGAACA CCATTAACTA CTCTGCACTC
ACAAACAATC TCCGCCCGGG TGATATGAAG TTCAAAGATA TCGATGGTAA CGGTAAGATT
GATGGTGATG ATAAGATCCG TTATAATAAA AGTTCCCAGC CTACATTCAC CGGCGGTCTG
AACGTGAATC TGCAATACAG CAACTTCGAC TTCGCCTTGC TCGTACAAGG TGCTACCGGC
GGTGCATTGC ATATCAATAC AGAATCCGGT GAGATCGGCA ACTTTACACA GGACTATTTC
GATCATCGCT GGAGTCCGGA CAATCCAAGC AGCGTTGATC CCCGTGTAAA CGACCGTAAC
GACACTTATT GGGCTACCGG TAATACTTAC TGGGTACGCA GCACTAACTA CGTGCGTCTG
AAAAATCTGG AGATCGGCTA CAGCATTCCT GAGTTTCTGA AGAAGAAAGC AGGTATCACA
AATCTGAGGA TTTACGCCAA CTGTTTAAAC CTGTTCACCA TTGACAACCT GAAGATCCTT
GATCCGGAGT CAGTGAACGG AAATGGCCAG TATTATCCGC AATCAAGGGT ATTGAATGCC
GGTTTCAATT TAACCTTTTA A
 
Protein sequence
MKNVDLLKVR HRYAWQSALL RVKFTLLLFL PALYPLVAAA DGYHTLFSVQ QDVTLKGKIV 
DKNKEPVIGA TVKVVGTSKG TTTLPDGTFT LNVANGAQIT VSAIGFLPQT LTVNGNGPLT
ITLDADTKGL SEVVVVGYGA QKREAVSGSI ATVKGADLVK SPATNLSNSI AGRMPGVTAM
QNSGEPGADG SSLRIRGSNT LGNNDPLVVI DGVAGRTGGL ERLNPQDIES ISVLKDASAA
IYGARAANGV ILVTTKRGKS GKPQLNYSFN QGWSQPTHIP EMSDAAEYAQ VRDELSLYRD
VPGKEWSDAW SVYKQGGAYT SPTTNKKWDV AFGPSEIDKF RNSTDPWLYP NTDWYKSTFK
TWAPQSNHSV QVSGGNENMR YLTSFGYQSQ DAYYKKSATG YDQYSLRVNL DAKINKYVNV
SVDMLGRQEE RNYPTRSASD IFRMLMRGKP TEPAFWPNGL PGPDIEYGNN PVVITTDQTG
YDKDKRYYVQ TNARLDILVP GVEGLKLSGN VAFDKYLKRT KRWMTPWYLY TWDKKTYEQD
GTTPQLVRSK RGTDQATLNQ GDEDQSNVLL RGLLTYDRTF NKNHTLNFVF GIERETVHSD
NFNALRKYFI SPLIDQMFAG GDAEKDNGGS AWDRARLNYF GRVAYNYKEK YLAEFVWRND
GSYMFPSTSR FGFFPGLMLG WRVSEESFWK DNVKFMNYLK LRASWGKLGN DQVYFDDNND
GTLTLQEYQY YSTYGFSSYI LGGNVLKSLY EARIPNPYIS WEVANNYNVG LDGALLDGKI
YFELDAFMNK RSGILVRRNA SVPQTTGMTL PAENIGKVDN KGFEFRIGYN STVGTKFRYD
VSVNGGYSKN KIVFWDEAPG APVWQRSTGM PMNTALYYEY DGVFKDYAEI DKNTINYSAL
TNNLRPGDMK FKDIDGNGKI DGDDKIRYNK SSQPTFTGGL NVNLQYSNFD FALLVQGATG
GALHINTESG EIGNFTQDYF DHRWSPDNPS SVDPRVNDRN DTYWATGNTY WVRSTNYVRL
KNLEIGYSIP EFLKKKAGIT NLRIYANCLN LFTIDNLKIL DPESVNGNGQ YYPQSRVLNA
GFNLTF