Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_5203 |
Symbol | |
ID | 8361380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 6577045 |
End bp | 6580059 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644967351 |
Product | TonB-dependent receptor |
Protein accession | YP_003124835 |
Protein GI | 256424182 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00178942 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC AGTTTCTTTT AATGGTAGCG GTATGGCTTT CGGCCGTTAG CCCTTCCATG GCCAACGCGG CAGGTCATTT TGTATATGAT CAGGTAGCCC GTCAGGAACC TATCAAGGGA GTGGTAGTCG GTTCCGATGG TGCGCCGATA CCAGGCGCTT CGATCAAATC GCTGGCCTCC AACAAGGGTA CAACGACTAA CGAGCGCGGA GAATTTACCT TGCAGGGAGC TGCAGGCGAC AAACTGGTGA TCACCTTCAT TGGTTACACC CAACAGGAAG TGGTAGCCAC CAGTACGCCA ATGCGTATTA CCTTGTCAGG CAGCAACAGC CAGCTGGGCG AGGTAGTAGT AGTAGGTTAT GGTAGCCAGA CCAAGGCTGA CGTAACCGGT GCACTGACCC AGCTGAAAGC AGACAATATC AAACAGGGTG TAAACGTCTC TGTTGACAAC ATGCTCCAGG GTAAAGTATC TGGTGTGCGT ATCGCACAGT CCAGCGGTGA ACCTGGTGCA GGTGTGGATG TATTCATCCG TGGTGTGGGT TCTATCCGTA GTGGTAGTAC TCCGCTTTTC GTAGTGGATG GTATTCCATT GAGTAATGAT AACGTGAGTG CAGGAGGTAC AGACTTCGGT CTGGGTTCTT CTGAACCTAA AAACCCGCTG AACTTCCTGA ACACCAGCGA TATTGAAACC ATCACCGTTC TGAAAGACGC TTCTGCAGCG GCTATCTACG GTGCAAGGGG TTCTAACGGT GTGGTACTGA TTACTACCAA ACGTGGTAGC AGAGGAACTT CTACACTGAC TTACGATGCG TACCTCGGTA CTTCTAAAGT GATCAAAAAA CTGGACGTAC TGAATGCTGA CGAATATCGT AAAGCGATCA AAGATCCGGC TTATGACCAC AAAGGCAATA CTGACTGGCA GGATGTGATC TATCGCAATG CATTTGTACA GAACCATAAC CTGTCTTTCG CTAAAACTAC CAACACCGGT AGCTATATGG CGTCTTTCTC CCGTATGGAC CAGGATGGTA TCGTTGAAAC CAGCTCATTC AAAAGAACAA CTGCAAGACT GAATGCGGAA GAATCTTTCT TCGACGACAG GCGCCTGGTT GTTAAGCTGA ACCTGACTGC AAGTGATATC GACGAAACCG GTATCCCTAA CGGTAATACC GCAGGTTCTG ATGGTCAGCT GATCATTCAT GCACTGATGG CTAACCCTAC CCGTTCAGTA TATGATTCTC TGGGTGGTTA TACCAACTTT AACATGAACG CTCACTATAA CCCTGCTTAT CTGTTGAGCA TCTATAAAGA TAAAACTAAC ACCTTCCGTG TACTGGGTAA CGTAGAAGCG TCCCTGAGAC TGTTCAAAGG TCTGAACTAT CGTATTAACT ACGGCATCGA TAAGTCAACT TCTGAGCGTA ACTCTACCAT CTATCCGAAC ATTACGGACA GAACGCCACT GGGAGCTTAT GCACAGAACA ACCTGCAGTC ACATACAACC CTGCTGGATC AGTACCTGAC TTATAACCGT TCTATCGATA AACATTCATT CGAAGTATTG GGTGGTTTCT CTTATCAGCA GTTTAAATTC GCTGGTACTG CTTTCGGTAT GATCAATATC GCAAAACAGG GACAGGGTGT AGATCCTGAA TACAACCCGG GTTATTCCGG TACGCCAACC ATTCCAAGTG GTTACGCACA GGAGAATGAA TTACAGTCTT ACTTCGGACG TGTAAACTAT AATTACGATA ACCGTTACCT GGTAACAGCC TCCCTCCGTG CAGATGGTTC TACCCGTTTC GGTGAAAGTA AAAAATATGG TTACTTCCCA TCCTTCGCAC TGGGCTGGAC CCTGTCTCAG GAAGGTTTCA TGAAAGACAT CAGCGCTATT CAGAACTTAA AGCTGAGAGC TAGCTGGGGT CAGACTGGTA ACCAGGAAGT ACCAAACAAA ATCACACAGG CAAGCTACTC TCTGGCTACT TCAGCAGGTT ACTACCTGTA TGACGACCTG AAAGTGGTAA ATGGTGTACT GGTAAACCGT ACTGCTAACC CGAACCTGAA ATGGGAAATG GTACAACAGT ATAACATCGG TGCTGACTTC GACCTGTTCA AGGGTAAATT ATACGGTTCC GTGGAGTACT ATAACAAGAC TACCAAAGAT CCTATCCTGA ACATCCCTTC TGGTCCGCTG AGCCCAACCA CTACCGTATG GAAAAACGTA GACGCCAGCA TCGTGAACAA AGGATTCGAG TTCACCCTGG GAACTACCCT GATCCGTACA AAAGATTTCA GCTGGTCACT GGATGTGAAC GGTGCTACTA TCTCTAACGT GATCAAAGAT CTGCCGGTTT CTGAACTTTA CTCGGGTAGC ATTTCCGGTC CTGGTCTGTC TGGTGTAAAC GCTAACATCT ACAAGAATGG CTACGAAGCG GGTTCATTCT TCATGCTGAA ACACCTGGGT TATGACAAGG ATGGTAAAGA TATCTTTGAA GACAAAAACG ACGACGGCGT AATCAACGCT GCTGACAGAC AGATCTTCGA AGGTGCTATT CCTAATTTCA ACTTTGGTCT GAATAGCCAG ATGCGTTACA AAAAGTTCGA CCTGTCATTT GCGGTGATCG GACAGACAGG CGGTTACCTG GTTAACAACA CTGCACTGGA TCTGAACATC AACAGCCTTG CTTCTGACCG TAATGTACTG AGAAAGTTCT ATGAAGCAAA CGCAAACCCT GCAAATGCGG TACAGCTGTC TACTTTATAT CTGGAGAAAT CTGACTTCGT TCGTTTAAGC AACCTGCGTC TCGGATACAC TTTACCGCTT GAGCGCGTAC AGTGGCTGAA ATCCGTTAAC GTATATGTAA GTGCTTCTAA CCTGCTGACC ATCACTGGTT ACTCTGGTTA CGATCCGCTG GTAAACACTA CCAAAACAGT TGGTGGTAAC CAGTCTCTGG GTATAGATTA CACTACTTAT CCGGCTGCAA AAACATTCCT CTTCGGCGCA ACAGTTAAAT TTTAA
|
Protein sequence | MNKQFLLMVA VWLSAVSPSM ANAAGHFVYD QVARQEPIKG VVVGSDGAPI PGASIKSLAS NKGTTTNERG EFTLQGAAGD KLVITFIGYT QQEVVATSTP MRITLSGSNS QLGEVVVVGY GSQTKADVTG ALTQLKADNI KQGVNVSVDN MLQGKVSGVR IAQSSGEPGA GVDVFIRGVG SIRSGSTPLF VVDGIPLSND NVSAGGTDFG LGSSEPKNPL NFLNTSDIET ITVLKDASAA AIYGARGSNG VVLITTKRGS RGTSTLTYDA YLGTSKVIKK LDVLNADEYR KAIKDPAYDH KGNTDWQDVI YRNAFVQNHN LSFAKTTNTG SYMASFSRMD QDGIVETSSF KRTTARLNAE ESFFDDRRLV VKLNLTASDI DETGIPNGNT AGSDGQLIIH ALMANPTRSV YDSLGGYTNF NMNAHYNPAY LLSIYKDKTN TFRVLGNVEA SLRLFKGLNY RINYGIDKST SERNSTIYPN ITDRTPLGAY AQNNLQSHTT LLDQYLTYNR SIDKHSFEVL GGFSYQQFKF AGTAFGMINI AKQGQGVDPE YNPGYSGTPT IPSGYAQENE LQSYFGRVNY NYDNRYLVTA SLRADGSTRF GESKKYGYFP SFALGWTLSQ EGFMKDISAI QNLKLRASWG QTGNQEVPNK ITQASYSLAT SAGYYLYDDL KVVNGVLVNR TANPNLKWEM VQQYNIGADF DLFKGKLYGS VEYYNKTTKD PILNIPSGPL SPTTTVWKNV DASIVNKGFE FTLGTTLIRT KDFSWSLDVN GATISNVIKD LPVSELYSGS ISGPGLSGVN ANIYKNGYEA GSFFMLKHLG YDKDGKDIFE DKNDDGVINA ADRQIFEGAI PNFNFGLNSQ MRYKKFDLSF AVIGQTGGYL VNNTALDLNI NSLASDRNVL RKFYEANANP ANAVQLSTLY LEKSDFVRLS NLRLGYTLPL ERVQWLKSVN VYVSASNLLT ITGYSGYDPL VNTTKTVGGN QSLGIDYTTY PAAKTFLFGA TVKF
|
| |