Gene Cpin_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3749 
Symbol 
ID8359917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4713600 
End bp4717007 
Gene Length3408 bp 
Protein Length1135 aa 
Translation table11 
GC content50% 
IMG OID644965918 
ProductTonB-dependent receptor plug 
Protein accessionYP_003123412 
Protein GI256422759 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0080305 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0015737 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTACATTC CAAACCCGAA AGCAAAGTGG CTATGCCTGC TTCTGTGCAG TGCATTCAGC 
CAGGTGCATG TAAATGCTGC CCCTGTCATT GCAAAAAGCG TTTCGCCCGG CGCCTATGGA
TTATTTCAAC AGCCTGTTAC TGAAACAGTT ACCGGTATTG TAAGAATCTA TGGAAAAAAC
GGTCGCCCCG AAGCCATCCC CGGTATTAAC GTAATAGAAA AAGGTACTGT CAACGGGACT
GTCACAGATG CCGCAGGCGC CTATACGTTA AAGGTAAAAA AGGGTGCTAC GCTCACCTTC
TCCATGATCG GGTATAAATC CCGCGAAATG AAAGCGTCGT CTGCAGAACC ACTCAACGTC
ATCCTGGATG AAGATGTGAG CGCACTGAAA GAAGTCGTTG TAACCGGTTA CCAGACGATC
GACCGTAAAC TTTTCACCGG TGCTGCCGCC AATGTAAAAG CATCCGACAT CAAACGTGAC
GGTATCACCG ATGTCAGCCG TATGCTGGAA GGCCAGGTAG CCGGTGTCAG CGTACAGAAC
GTATCCGGTA CCTTCGGTGC GGCGCCTAAA ATACGTGTAC GCGGCGCCAC CTCCATCACC
GGCGATAATA AACCACTCTG GGTAGTGGAC GGTGTCGTAC TGGAAGACGT CATCAACGTA
TCCAACGAAC AGCTCTCCAC CGGTAACCCT TCTACCCTGC TCGGTTCCTC TGTAGCGGGC
CTGAACCCGG ACGATATCGA AAGCTTTCAG ATCCTGAAAG ATGCTGCCGC CACCGCCCTC
TATGGTGCGC GTGCGATGAA CGGCGTGGTA GTCGTAACGA CGAAAAAAGG TAAGGTCGGT
CAACCCGTAA TTTCCTATAC CGGCAACTTC TCCACCTACC TGAAACCTAC TTACGATCAG
TTCGACATCA TGAAGTCGGA TGATCAGATG GCTTTCTATA ACGAACTGTC CCTGAAAGGA
TGGTTAAACC ATTCTGATGC TACACGTGCC GAAAATGGCG GCGTCTACAC CAAAATGTAT
AATCTGATCG ATAAATACAA CGCCACCAAC GGCTCCTTCG GTCTGTATAA CAGCCCTGAA
AGCAAACGCC AGTTCCTTGA GCGTTACGCC AAAGCGAATA CCGACTGGTT TGATGTATTG
TTCCGCAACT CCTTTATGCA GGAACACTCG CTCAGCGTAT CCTCCGGTAC CGATAAATCT
CAACTCTACG TCTCCACCAG CTTCCTGCAG GACCAGGGCT GGACAATGGC CGATAAAGTA
AAACGCTTCA CCGGTAACGT AAGAGCCAAC TACAATATCA ACGATAAAGT CTCCTTCGGT
TTTATCACCC AGGGTGTCGT ACGTGACCAG CGGGTACCTG GTACGCTGAA CAGGAACAGT
AATGCGGTAA GCGGACAATT CGACCGCGAC TTTGACATCA ATCCCTATAG TTACGCACTC
AATACCAGCC GCGCCTTGAC TGCCTACGAT GAAAAAGGTA ACCTGGAATA CTTCACCCGC
AACTTCGCTC CTTTCAATAT CGTCAATGAA ATGAGCAACA ACTACATCGA CCTGACCCAG
CTCGATATCA AGTTGCAGGG CGACTTCTCC TATAAGATCC TGAAGAACCT GAAGTATACC
TTCCTCGGTA ACCTCCGCTA TGTGAAATCC AGCCAGGATC ACAAGATCAA GGAGAACTCC
AATATGCCGC AGGCATACCG TGCAGCAGGC GATGCCACCA TCCGCTCCCG TAACCGCTTC
CTCTACCGCA ACCCGGATGA TCCGGAAGCA GAACCGGTAG TAGTGCTGCC CTACGGTGGT
ATCTATACAA CGGAAGATGA CTACCTGGTC AGCTACTATT TCCGTAACAT GATCGAATGG
AATAAAAACT GGAATGATAA ACATATGGTG AACTTCCTCG GATCGCAGGA ATTGCGCTAT
GCGAACCGTC AGAACAGGCA GTTTGACGGA TATGGCTATC AGTTCGACAA AGGTGGCGTG
CCCTTCATCG ATCCGAATAT CATCAAGCAG AACGTAGAAA ACAACTTCAA CTACTACAGC
ATCTCACAGA ACTATGACCG CTACCTGGCC TACCTAGCCA ATGCGGCTTA CTCCTACAAA
GGAAAGTATA ACTTCAACGC CAGCATCCGC TACGATGGTT CCAACCTCCT CGGTGAATCC
CGTACCGCAC GCTGGTTACC TACCTGGAAC GTCAGCGGTT CCTGGAACGT AGATACGGAA
GACTTTATGC GCAGACAGGA AACCGTCAAC CGACTGACAC TCCGTGCCAC TTACGGCCTG
ACGGGCAGTA TGGGTAATGC CCGTAATTCC AGTGTCGTAT TGCAGAACCG TAGCGCCAAA
CGCCCTTATC TTTCCGAAAT AGAATCTGTT ATTTATATCC AGAGCCTCGA AAACTCTGAA
TTAACCTGGG AGAAACAATA CGAGACCAAC GTTGGTATCG ATGCGGGCCT GTTCCAGGAC
AGACTGACCT TCACCATCGA CGGTTACCTG CGTAAAGGAT TTGATCTGAT CGGTCCGATC
CGTACCTCCG GTATCGGTGG AGAAAGCGTA AAAATTGCCA ACTACGGCGA TATGAAATCT
CATGGTGTGG AAGCTACTGC TGCCTACAAG ATTGTCGATA ACAAGACATG GGGGCTCCGG
ACGCAACTGA CCGTTGGTTA CAACAAAGGA AAAATTACCA AACTGGAAAA TCTGCCGATT
ATATGGGATC TGGTAGTTGC AGATGGTGGC GCGACTGTTG GTCACCCGGT ACGCGGACTG
TACTCCATTC CGTTTGAAGG GCTGAACCCG AAAGATGGTA CGCCTTTATT CACCAACCAG
GACGGACAAA AAAGTGGTAA CGTATACCTG CAAAGCGATA AGGTAAACAA CCTGGTATAC
AATGGTCCGG TAGATCCTAC CCTGACCGGC GGCTGGTTCA ATACCCTCCG TTACAGCAAC
TTCTCCCTGT CAGCCCTGGT AACCTACAGC ACGGGTAATA AGATCCGTCT GAACCCAGGT
TTCAAACAGC AGTACACCGA TCTGGATGCC AGTTCCAATG ACTTCCTGAA CCGCTGGACA
CTGCCGGGAG ATGAAACCCG TACCGACGTA CCTTCCGTAC TGGATAAACT GGGCAACTCA
CTGCTGGACG GCGCATATCC GTACAACAAC TACAACTACT CCACTGCTCG CGTAGCCAAT
GGCGATTTTG TAAGACTGAA ACAGGTATCC GTCGCTTACA ATCTTCCGCC AAAAATGATC
CGCAGGGCAG GATTTAATAA CCTGTCTGTC AGCCTGGTAG CCAATAACGT ATGGCTGATC
TATGCGGACG ACCGTCTCAA CGGACAGGAT CCTGAATTCT TCAATTCCGG AGGTGTTGCA
CTGCCGATCC CAAGACAGTT CACCCTTTCC CTGAAAGCAG GTTTATAA
 
Protein sequence
MYIPNPKAKW LCLLLCSAFS QVHVNAAPVI AKSVSPGAYG LFQQPVTETV TGIVRIYGKN 
GRPEAIPGIN VIEKGTVNGT VTDAAGAYTL KVKKGATLTF SMIGYKSREM KASSAEPLNV
ILDEDVSALK EVVVTGYQTI DRKLFTGAAA NVKASDIKRD GITDVSRMLE GQVAGVSVQN
VSGTFGAAPK IRVRGATSIT GDNKPLWVVD GVVLEDVINV SNEQLSTGNP STLLGSSVAG
LNPDDIESFQ ILKDAAATAL YGARAMNGVV VVTTKKGKVG QPVISYTGNF STYLKPTYDQ
FDIMKSDDQM AFYNELSLKG WLNHSDATRA ENGGVYTKMY NLIDKYNATN GSFGLYNSPE
SKRQFLERYA KANTDWFDVL FRNSFMQEHS LSVSSGTDKS QLYVSTSFLQ DQGWTMADKV
KRFTGNVRAN YNINDKVSFG FITQGVVRDQ RVPGTLNRNS NAVSGQFDRD FDINPYSYAL
NTSRALTAYD EKGNLEYFTR NFAPFNIVNE MSNNYIDLTQ LDIKLQGDFS YKILKNLKYT
FLGNLRYVKS SQDHKIKENS NMPQAYRAAG DATIRSRNRF LYRNPDDPEA EPVVVLPYGG
IYTTEDDYLV SYYFRNMIEW NKNWNDKHMV NFLGSQELRY ANRQNRQFDG YGYQFDKGGV
PFIDPNIIKQ NVENNFNYYS ISQNYDRYLA YLANAAYSYK GKYNFNASIR YDGSNLLGES
RTARWLPTWN VSGSWNVDTE DFMRRQETVN RLTLRATYGL TGSMGNARNS SVVLQNRSAK
RPYLSEIESV IYIQSLENSE LTWEKQYETN VGIDAGLFQD RLTFTIDGYL RKGFDLIGPI
RTSGIGGESV KIANYGDMKS HGVEATAAYK IVDNKTWGLR TQLTVGYNKG KITKLENLPI
IWDLVVADGG ATVGHPVRGL YSIPFEGLNP KDGTPLFTNQ DGQKSGNVYL QSDKVNNLVY
NGPVDPTLTG GWFNTLRYSN FSLSALVTYS TGNKIRLNPG FKQQYTDLDA SSNDFLNRWT
LPGDETRTDV PSVLDKLGNS LLDGAYPYNN YNYSTARVAN GDFVRLKQVS VAYNLPPKMI
RRAGFNNLSV SLVANNVWLI YADDRLNGQD PEFFNSGGVA LPIPRQFTLS LKAGL