Gene Cpin_5203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5203 
Symbol 
ID8361380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6577045 
End bp6580059 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content47% 
IMG OID644967351 
ProductTonB-dependent receptor 
Protein accessionYP_003124835 
Protein GI256424182 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00178942 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC AGTTTCTTTT AATGGTAGCG GTATGGCTTT CGGCCGTTAG CCCTTCCATG 
GCCAACGCGG CAGGTCATTT TGTATATGAT CAGGTAGCCC GTCAGGAACC TATCAAGGGA
GTGGTAGTCG GTTCCGATGG TGCGCCGATA CCAGGCGCTT CGATCAAATC GCTGGCCTCC
AACAAGGGTA CAACGACTAA CGAGCGCGGA GAATTTACCT TGCAGGGAGC TGCAGGCGAC
AAACTGGTGA TCACCTTCAT TGGTTACACC CAACAGGAAG TGGTAGCCAC CAGTACGCCA
ATGCGTATTA CCTTGTCAGG CAGCAACAGC CAGCTGGGCG AGGTAGTAGT AGTAGGTTAT
GGTAGCCAGA CCAAGGCTGA CGTAACCGGT GCACTGACCC AGCTGAAAGC AGACAATATC
AAACAGGGTG TAAACGTCTC TGTTGACAAC ATGCTCCAGG GTAAAGTATC TGGTGTGCGT
ATCGCACAGT CCAGCGGTGA ACCTGGTGCA GGTGTGGATG TATTCATCCG TGGTGTGGGT
TCTATCCGTA GTGGTAGTAC TCCGCTTTTC GTAGTGGATG GTATTCCATT GAGTAATGAT
AACGTGAGTG CAGGAGGTAC AGACTTCGGT CTGGGTTCTT CTGAACCTAA AAACCCGCTG
AACTTCCTGA ACACCAGCGA TATTGAAACC ATCACCGTTC TGAAAGACGC TTCTGCAGCG
GCTATCTACG GTGCAAGGGG TTCTAACGGT GTGGTACTGA TTACTACCAA ACGTGGTAGC
AGAGGAACTT CTACACTGAC TTACGATGCG TACCTCGGTA CTTCTAAAGT GATCAAAAAA
CTGGACGTAC TGAATGCTGA CGAATATCGT AAAGCGATCA AAGATCCGGC TTATGACCAC
AAAGGCAATA CTGACTGGCA GGATGTGATC TATCGCAATG CATTTGTACA GAACCATAAC
CTGTCTTTCG CTAAAACTAC CAACACCGGT AGCTATATGG CGTCTTTCTC CCGTATGGAC
CAGGATGGTA TCGTTGAAAC CAGCTCATTC AAAAGAACAA CTGCAAGACT GAATGCGGAA
GAATCTTTCT TCGACGACAG GCGCCTGGTT GTTAAGCTGA ACCTGACTGC AAGTGATATC
GACGAAACCG GTATCCCTAA CGGTAATACC GCAGGTTCTG ATGGTCAGCT GATCATTCAT
GCACTGATGG CTAACCCTAC CCGTTCAGTA TATGATTCTC TGGGTGGTTA TACCAACTTT
AACATGAACG CTCACTATAA CCCTGCTTAT CTGTTGAGCA TCTATAAAGA TAAAACTAAC
ACCTTCCGTG TACTGGGTAA CGTAGAAGCG TCCCTGAGAC TGTTCAAAGG TCTGAACTAT
CGTATTAACT ACGGCATCGA TAAGTCAACT TCTGAGCGTA ACTCTACCAT CTATCCGAAC
ATTACGGACA GAACGCCACT GGGAGCTTAT GCACAGAACA ACCTGCAGTC ACATACAACC
CTGCTGGATC AGTACCTGAC TTATAACCGT TCTATCGATA AACATTCATT CGAAGTATTG
GGTGGTTTCT CTTATCAGCA GTTTAAATTC GCTGGTACTG CTTTCGGTAT GATCAATATC
GCAAAACAGG GACAGGGTGT AGATCCTGAA TACAACCCGG GTTATTCCGG TACGCCAACC
ATTCCAAGTG GTTACGCACA GGAGAATGAA TTACAGTCTT ACTTCGGACG TGTAAACTAT
AATTACGATA ACCGTTACCT GGTAACAGCC TCCCTCCGTG CAGATGGTTC TACCCGTTTC
GGTGAAAGTA AAAAATATGG TTACTTCCCA TCCTTCGCAC TGGGCTGGAC CCTGTCTCAG
GAAGGTTTCA TGAAAGACAT CAGCGCTATT CAGAACTTAA AGCTGAGAGC TAGCTGGGGT
CAGACTGGTA ACCAGGAAGT ACCAAACAAA ATCACACAGG CAAGCTACTC TCTGGCTACT
TCAGCAGGTT ACTACCTGTA TGACGACCTG AAAGTGGTAA ATGGTGTACT GGTAAACCGT
ACTGCTAACC CGAACCTGAA ATGGGAAATG GTACAACAGT ATAACATCGG TGCTGACTTC
GACCTGTTCA AGGGTAAATT ATACGGTTCC GTGGAGTACT ATAACAAGAC TACCAAAGAT
CCTATCCTGA ACATCCCTTC TGGTCCGCTG AGCCCAACCA CTACCGTATG GAAAAACGTA
GACGCCAGCA TCGTGAACAA AGGATTCGAG TTCACCCTGG GAACTACCCT GATCCGTACA
AAAGATTTCA GCTGGTCACT GGATGTGAAC GGTGCTACTA TCTCTAACGT GATCAAAGAT
CTGCCGGTTT CTGAACTTTA CTCGGGTAGC ATTTCCGGTC CTGGTCTGTC TGGTGTAAAC
GCTAACATCT ACAAGAATGG CTACGAAGCG GGTTCATTCT TCATGCTGAA ACACCTGGGT
TATGACAAGG ATGGTAAAGA TATCTTTGAA GACAAAAACG ACGACGGCGT AATCAACGCT
GCTGACAGAC AGATCTTCGA AGGTGCTATT CCTAATTTCA ACTTTGGTCT GAATAGCCAG
ATGCGTTACA AAAAGTTCGA CCTGTCATTT GCGGTGATCG GACAGACAGG CGGTTACCTG
GTTAACAACA CTGCACTGGA TCTGAACATC AACAGCCTTG CTTCTGACCG TAATGTACTG
AGAAAGTTCT ATGAAGCAAA CGCAAACCCT GCAAATGCGG TACAGCTGTC TACTTTATAT
CTGGAGAAAT CTGACTTCGT TCGTTTAAGC AACCTGCGTC TCGGATACAC TTTACCGCTT
GAGCGCGTAC AGTGGCTGAA ATCCGTTAAC GTATATGTAA GTGCTTCTAA CCTGCTGACC
ATCACTGGTT ACTCTGGTTA CGATCCGCTG GTAAACACTA CCAAAACAGT TGGTGGTAAC
CAGTCTCTGG GTATAGATTA CACTACTTAT CCGGCTGCAA AAACATTCCT CTTCGGCGCA
ACAGTTAAAT TTTAA
 
Protein sequence
MNKQFLLMVA VWLSAVSPSM ANAAGHFVYD QVARQEPIKG VVVGSDGAPI PGASIKSLAS 
NKGTTTNERG EFTLQGAAGD KLVITFIGYT QQEVVATSTP MRITLSGSNS QLGEVVVVGY
GSQTKADVTG ALTQLKADNI KQGVNVSVDN MLQGKVSGVR IAQSSGEPGA GVDVFIRGVG
SIRSGSTPLF VVDGIPLSND NVSAGGTDFG LGSSEPKNPL NFLNTSDIET ITVLKDASAA
AIYGARGSNG VVLITTKRGS RGTSTLTYDA YLGTSKVIKK LDVLNADEYR KAIKDPAYDH
KGNTDWQDVI YRNAFVQNHN LSFAKTTNTG SYMASFSRMD QDGIVETSSF KRTTARLNAE
ESFFDDRRLV VKLNLTASDI DETGIPNGNT AGSDGQLIIH ALMANPTRSV YDSLGGYTNF
NMNAHYNPAY LLSIYKDKTN TFRVLGNVEA SLRLFKGLNY RINYGIDKST SERNSTIYPN
ITDRTPLGAY AQNNLQSHTT LLDQYLTYNR SIDKHSFEVL GGFSYQQFKF AGTAFGMINI
AKQGQGVDPE YNPGYSGTPT IPSGYAQENE LQSYFGRVNY NYDNRYLVTA SLRADGSTRF
GESKKYGYFP SFALGWTLSQ EGFMKDISAI QNLKLRASWG QTGNQEVPNK ITQASYSLAT
SAGYYLYDDL KVVNGVLVNR TANPNLKWEM VQQYNIGADF DLFKGKLYGS VEYYNKTTKD
PILNIPSGPL SPTTTVWKNV DASIVNKGFE FTLGTTLIRT KDFSWSLDVN GATISNVIKD
LPVSELYSGS ISGPGLSGVN ANIYKNGYEA GSFFMLKHLG YDKDGKDIFE DKNDDGVINA
ADRQIFEGAI PNFNFGLNSQ MRYKKFDLSF AVIGQTGGYL VNNTALDLNI NSLASDRNVL
RKFYEANANP ANAVQLSTLY LEKSDFVRLS NLRLGYTLPL ERVQWLKSVN VYVSASNLLT
ITGYSGYDPL VNTTKTVGGN QSLGIDYTTY PAAKTFLFGA TVKF