Gene Cpin_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3553 
Symbol 
ID8359720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4427541 
End bp4430786 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content49% 
IMG OID644965724 
ProductTonB-dependent receptor plug 
Protein accessionYP_003123218 
Protein GI256422565 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.467854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00228984 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCATCGGA TCCTGATGCT TGTTTTCCTA TTGGCTGTCT GTAGCAGGCT ATCGGCACAG 
CAGCCCCTGG AAAAGGTACT GACCTTCAAA GGCAGCACCA TCACCGTACA GCAGTTCATT
CAACAGGTAA AAAGCCAGCA ACAGCTGCAG TTTACCTTTG ATGAAGATGT GAATGCCCTG
TTGTCCCGCA CAGTGCATAT CCGTAGAAAC ACCGTCTCAT TAAAAGAAGC CCTCGACTGG
CTGGCCACAG ACGCTTCCAT CCATTGCCGC ATCCTCAATA ATTATGCGAT CCTCAGCGTG
ATGCCTGCCC ATCAGAAAAC GCCTGCTGCC AAGCCCGATC CGGCCTCCCT GGAACTGCCA
CAGCCATCTA CAAAGGCGCT GGGAGAAGTG GTGGTGACCG CCCTGGGTAT CAAAAAAAGC
GAACGTAACC TGGGTTATTC CGTAGCACAG CTGTCTTCCA AGGATGTCAG CGATGTCCCA
CAACCCAATG TACTGAACAG CCTCGCTGCC CGCGTTCCGG GTGTAAATAT CCGCAGTACG
GGTTCCGATC CGGGCTCCAG CGTGATGGTA ACGATCCGCG GACAATCCTC CATCAGCAAA
GACAATCAGC CCCTGTATGT GGTAGACGGT GTACCGGTTG CTCCTGCTAT CCAGAATCCG
GCACAGGCGG TAGATGGCAA ACAGACCATC GATTATGGTA GTCCGATCAG TGATATCAGC
GCTGACGATA TCGCCAGTAT CACCGTATTA AAAGGCGCCA GTGCTGCTGC CCTTTATGGT
AGCCGCGCAG GATCAGGTGT TATCCTCATC ACTACTAAAT CCGGTGCTGC AAAAAAAGGA
TTGGGAGTCA GTTTTAATTC CAGCGCCGTA TTCGACAAAG CCTGGATGTT CCCGCATTTC
CAGCATGAAT ATGGTTCCGG TGACTGGACA GGCAGCGACA ATACCATCAG CACCGGCGCA
TGGGGTCCGA AACTGAATAC CGGTCGTAAG CTGGTACAGT GGAACAGTCC ACTGGATGAA
AATGGCGATC CTTTGCCGCT CGACTGGGTA GCTTATCCTG ACCGTATAAA AGATTTCTTC
CGTACCGGAC ATACCTTTAC CAATAATATC GCGGTATCCA AAAGTGGAGA TGCCGGCAAT
TTCAGACTGT CCTACGGCAA TCTGCAGAAC CGGGGTATCG TACCGAATAC CGACCTGAAA
CGTGATAACC TGACCTTCTC AGGCACCTAT CACCTTAATA AATCCATCCA CCTCAGTACT
AACCTCGCTT ATACCAATAA CCGTAGTGCC AACCGTCCTT CTGCTTATAC AGAAAGCGTG
ACCATGCAGG TCTATAAACT GACGCCCAAC GTCAATATTC ATGCCCTGCG TAACTACTGG
ATACCTGGTA AAGAAGGACT GCAACAATAC AATCCTTACA GCACGGATGA CAACCCTTAC
CTGATCGCAT ACGAAGAAAC CAACAGTTAC ACCCGTAACC GCATGACGGG CAACGTACAG
GCTTCTTTCG ATATCAGACC TGATCTGACA CTGATGCTGC GTACCGGCCT CGATTACTAT
GCACTGAGCG AAGACCAGCG CAGACCATTC AGCGCCAAAA GAAATCCTAA AGGGGCTTAT
GTGCTGGAGA ATGACAACTT CAAAGAGCAG AACAGTGATT TCCTCCTGAC CTATAAACCC
AACCTGAAGA GCGATTTCAA AGTATCTGTT GCTGTCGGCG GTAACCGTAT GGACCAGGAA
AGTTCCTCCA ATCAGCAGTC TACCGGAAGT CTGACACTCC CAGGTGTATA CAATCTGTCT
AACGCAGCAG CGGGCGCGCT GGTCAATACA CAGTCCTTCC GTAAAAAGCG TATCAACAGC
TTATACGGTA TCGGTGAGGT AAGTTATCAT AACTACCTTT TCCTGAACCT GACGGCCCGT
AATGACTGGA GCAGCACCCT GCCCAAAGAG AACAATTCCT ACTTCTACCC TTCCGTATCC
CTGAGTGCGA TTCTCTCTGA TATGCTGAAT ATCAAATGGC AGGACCTGTC TTACCTGAAA
CTACGTGCCA ACTGGTCACA GGTAGGTGCT GATACCGATC CATATCAGCT GTACAACACC
GTACCATTTG ATACCGACTG GGGTAGTGTA AAACGTGCGA CCATCAGTTT CAATCAGAAA
AACAGTCTGC TGAAACCGGA GATCGCTACT TCTTATGAAG CCGGTGTAGA TGCAAGTTTC
TTTAAAGACC GTGTAGGTTT TAATATCACC TGGTATAAAA CCAACAACCG CAACCAGATC
ATCCAGGTAC CTACGACGAT TGCTTCCGGC GCCAGTACGA TGCTGATCAA TGCCGGTAAT
ATCCAGAACA GCGGATGGGA AATTGGTCTG AACCTGGTAC CTGTCAAAAG CACTGCTTTT
ACCTGGAAAT CTGATATCAA CTTCACCCGC AACGTCAATA AAGTCATCGA ACTGACACCA
GCGGTGACCA GTTATATGCT GGGCAGCACA GATGGCAGTA ACATCGAATA CCTGATCAAA
GAGGGGACTA AAATGGGCGA TTTCTATACA AGGAGCTGGG TGAAAGTGAA AGAAGGGGCT
TACGCCGGAC AAGCACTTCT GAACAGCAAC GGACTGCAGC AACAGGATGC TGATTTTGTA
AAGATCGGTA ACTACAATCC GGATTTTATG GTGGGTTTCA ACAACAATTT CCGCTACAAA
AACCTGAGCT GCAATTTCCT GATCGACTGG CGCAAAGGCG GTTCTTACTA TTCCTATGTC
GCGAAAAGCC TGATCCAGGA TGGTCGTACA ACCAATACAT TACGCGGCCG TGATGCGGCA
CATGGCGGTC TTACCTGGAC CGATGCCAAT GGTACTGTAC GCGATGACGG AATGATCATG
GAAGGTGTGA TCGCTAATGG CGACGGTACT TACAAACCGA ATGACGTCAT CATTGATGCG
GCTACCTATT ATGACAATAA ATACTGGAAG TATTACGAAA ACGAAACCTA CAGCGCTACT
TATGTAAAGC TGAAAGAAGT ATCGCTCACT TATGCATTCG GTCGTAGTGT AATGCAGCGT
ATCCCTTTCC TGAGCAACCT GTCCCTGTCC CTGATCGGTA ACAATCTTTA CACCTGGGCA
GCAGCAAAGA ACGGCTATGA TCCTGAGATC ACCATGAGCC TTTCCAACCA GCGTTACCAG
GGTGTTGGTC ACTGGACATT ACCAGGTACC CGCTCGTACG GTGCTAAACT CAGTTGCAAC
TTCTAA
 
Protein sequence
MHRILMLVFL LAVCSRLSAQ QPLEKVLTFK GSTITVQQFI QQVKSQQQLQ FTFDEDVNAL 
LSRTVHIRRN TVSLKEALDW LATDASIHCR ILNNYAILSV MPAHQKTPAA KPDPASLELP
QPSTKALGEV VVTALGIKKS ERNLGYSVAQ LSSKDVSDVP QPNVLNSLAA RVPGVNIRST
GSDPGSSVMV TIRGQSSISK DNQPLYVVDG VPVAPAIQNP AQAVDGKQTI DYGSPISDIS
ADDIASITVL KGASAAALYG SRAGSGVILI TTKSGAAKKG LGVSFNSSAV FDKAWMFPHF
QHEYGSGDWT GSDNTISTGA WGPKLNTGRK LVQWNSPLDE NGDPLPLDWV AYPDRIKDFF
RTGHTFTNNI AVSKSGDAGN FRLSYGNLQN RGIVPNTDLK RDNLTFSGTY HLNKSIHLST
NLAYTNNRSA NRPSAYTESV TMQVYKLTPN VNIHALRNYW IPGKEGLQQY NPYSTDDNPY
LIAYEETNSY TRNRMTGNVQ ASFDIRPDLT LMLRTGLDYY ALSEDQRRPF SAKRNPKGAY
VLENDNFKEQ NSDFLLTYKP NLKSDFKVSV AVGGNRMDQE SSSNQQSTGS LTLPGVYNLS
NAAAGALVNT QSFRKKRINS LYGIGEVSYH NYLFLNLTAR NDWSSTLPKE NNSYFYPSVS
LSAILSDMLN IKWQDLSYLK LRANWSQVGA DTDPYQLYNT VPFDTDWGSV KRATISFNQK
NSLLKPEIAT SYEAGVDASF FKDRVGFNIT WYKTNNRNQI IQVPTTIASG ASTMLINAGN
IQNSGWEIGL NLVPVKSTAF TWKSDINFTR NVNKVIELTP AVTSYMLGST DGSNIEYLIK
EGTKMGDFYT RSWVKVKEGA YAGQALLNSN GLQQQDADFV KIGNYNPDFM VGFNNNFRYK
NLSCNFLIDW RKGGSYYSYV AKSLIQDGRT TNTLRGRDAA HGGLTWTDAN GTVRDDGMIM
EGVIANGDGT YKPNDVIIDA ATYYDNKYWK YYENETYSAT YVKLKEVSLT YAFGRSVMQR
IPFLSNLSLS LIGNNLYTWA AAKNGYDPEI TMSLSNQRYQ GVGHWTLPGT RSYGAKLSCN
F