Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3538 |
Symbol | |
ID | 8359705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 4405093 |
End bp | 4408113 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644965709 |
Product | TonB-dependent receptor plug |
Protein accession | YP_003123203 |
Protein GI | 256422550 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.740617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.292963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC GTTTATTTCA ATTATTGGGA ATTTTCCCAA TACTGCTCAT CCCAATACTT TGTTTTTCGC AGCAGCTGAC GGTGAAGGGA AGGGTCACCA GCGCTGATAA CGGGGAGGCA TTACCCGGCG TGACTGTCAG GGTGAAAGGC GCCGCTACCG GCGCTGTATC GGGTCCTGAT GGCAGTTATA CCTTAGCGCT TAAACAACCT GCTACCACCC TCGTATTTAC ATTCACCGGT TATGCCACAC AGGAAGTAGC TGTGGAGGGG AAAAGTCAGC TTGACGTTGC ACTGCAATCG GGCAGGAAAG ACCTGGATGA AGTCGTGGTA GTAGGATATG GTACACAGCA GAAAGCCAAC GTCGCTGGTT CCATCACATC AGTAAAAACT GCTGATCTTA AACAGACGCC TATCTCCAAC GTCGTACAAG GCTTACAGGG CCGTGTATCG GGTGTACAGA TTACGCAGAA TTCCGCTGCT CCCGGTGGAA ACATCAGTAT GCGTATCCGG GGTGTGAACT CCATCAACGG TACCTCTGAA CCACTCTACG TAGTAGACGG TGTGCAACTG TCTAACTCCG GTGGTGTCAA TGACATCAGC TCCCTGTCTA TTATCAACCC CAATGACATT GAGTCCGTCG CTGTGCTGAA AGACGCTTCC GCAACGGCTA TCTATGGTGC GCGTGGGGCA AATGGGGTAG TGCTGATTAC GACCAAACGG GGTAAATCCG GCAAGACCGT AGTCAGTTAC GACGGCTATT ACGGTGTACA ACGTACGACT AAACGGATGG ACATGATGAA TGCAGCCGAA TTTGCCGCAC TGGAAAATGA GATCTATAAA ACAAGCGTAT ACGAGAATCC CGCTGCATTA GGTGAGGGTG TAGACTGGCA GGATAAAATA TTCCGTGATG CACCTATGCA GAATCACCAG GTAACAATCG CGGGTGGTTC GGACAGAACA CAGTTCGCCA TGTCAGGCAA CTATTTCAAA CAGGATGGTA TCATGCTCAG TTCTGACTTT ACACGTTATT CCTTCCGCCT GAATTTCGAT CACCGGGTTA GTAATCTGTT GAGGGTGGGG GCCAGCATGT TTACCAGTTA TGTAGTCAAT AATACCGTAC CCGCTGGTTC AACCAGTATA GACGCGGGGG CCGTAACCGG TAGTATGTTG GGCGCCGCAA TGGGCGCTCC GCCGACATTA GAACCTTACA GGGCAGATGG TACGATATTT CCCTTCGGGG ATCAGATGAA TGGCAGATAC CGGGAGGTAG TCAATCCTAT CGGACTGTCA ATGATCCTGA ACAGGGATAA GGTCAACCGG ACATTAGGAA ATATCTATGC AGAGATTACG CCATTCAAAG GACTGACTTA CCGCGCTAAT TTCAACCCGG TATTATCTGG TACACTGGCG GACTACTATT CTCCCCGTGC GATTATGAAT ACAGGTGATC TCGCTGGTGG TGGCGGTTCT GCTTCAAAGA CGAATTCCAA TAACGTCGTC TTACTCCATG AAAGTATCAT TACCTACGAA ACGCGTATCG CCAAAGAACA CGCCTTAAAG GTAACAGGCG TATTCGCTAC ACAGAGTAAC AATTCAAATT CCAATACTAT TAACGCCAGT AAGTTTCCCA ACGATGCCAC TGCAAACGAA GCCGTGCAAC TGGCCACAGA AAGGACCGTC AGCAGCAGCC GTTCCAAAGA CAGACTCGAC TCCTATATGG GGCGTATTAA CTACGGCTTC CGCGATAGAT ACCTGCTGGA CCTGATCGCG CGTGTAGATG GTTCTACGAA GTTCGGATCT AATAATAAAT ACGGTTTCTT TCCCGCTGCC GCTATCGCCT GGCGTGCGTC TGAAGAACCT TTTATGAAAG GTATGTCCGG TGTCAGTAAC CTGAAACTCA GATTCAGCTA CGGGCTTACC GGTAATGCCG GTGCGATCGA TGCCTATAAG TCACTATCAC TGCAAGGTAC TTCCGGTCTT TATTACTTCG ATCACAACCC TGTTATCGGC ATCCGTCCAA CGGGTATTGC CAACAGAGAC CTGAAATGGG AACGCTCCTT ACAGGCAGAT TTCGGGTTTG ACCTCGGTCT CTGGAATGAT CGTGTAAATG TGACAGCAGA TATCTATCAC AAGAAAACAG ATGATCTGCT GTTTGTGAAA ATACTCCCTG GTTCATCCGG CTATAGCGAA GTAACCGGCA ATTTCGCCAG TATGGAAAAT AAAGGGCTGG AGTTTTCTAT GGACGCTGTC ATCCTGGACA AAGCCGTAAA ATGGTCAGTG GCGGGTAATA TCTCCTTCAA CAGAAATAAA CTGCTGTCCC TGGCAGATGG TCTTTCAGAA TATTCAGTAA GTAACTACCA GGTCATGCAG GTAGGTCAAC CGCTGGCTTT ATTCAAAACT TACGTATTTG ACGGTATCTA TCAGACGGGG GAGACGGTGC TGGAAGGATC CGGTAGCCGT ACCGGTGGTG TGAAAGTAAG GGACCTTAAC AAAGATGGTA TCATCAGCGC CGGAGACCAG ACGATTGTCG GTAACGCAAA TCCCTCTTTT ATCTATGGCT TCTCCACCAA CCTGAGTTAT AAAAACTTCG ATTTAAGTGC CTTCTTCTCC GGTGTGCAAG GCAATAAAGT ATACAACCTG ATCCGCTATA CTTTTGAAAA CCCATTGGGT GGCAGGAATA TGTACAAGTC GCTGGTTAAC CGCTGGTCTC CTGATAATCC CAGCAATGAG TATGTCAGTG GTTTTCAGGG TGGCAGATTG CCTTTGAGCG ATCGTTTTAT GGAAGACGGT TCTTTCCTCC GCTGTAAAAA TATTACCCTC GGTTATCGTC TGCCAAAGAT CAGCAATATC AGTTCCGCAA GGGTATATGT CAGCGCCAAC AACCTGTTTA CCCTGACGGA CTATACCGGT TACGATCCCG AAGTGAATAC CTTCGGTAAT TCCAATAAAC AGATAGGGGT AGATAATCTG GTATATCCCA CTGCCCGTTC ATTCCTGCTT GGTCTGCAGG TAGCATTCTG A
|
Protein sequence | MKKRLFQLLG IFPILLIPIL CFSQQLTVKG RVTSADNGEA LPGVTVRVKG AATGAVSGPD GSYTLALKQP ATTLVFTFTG YATQEVAVEG KSQLDVALQS GRKDLDEVVV VGYGTQQKAN VAGSITSVKT ADLKQTPISN VVQGLQGRVS GVQITQNSAA PGGNISMRIR GVNSINGTSE PLYVVDGVQL SNSGGVNDIS SLSIINPNDI ESVAVLKDAS ATAIYGARGA NGVVLITTKR GKSGKTVVSY DGYYGVQRTT KRMDMMNAAE FAALENEIYK TSVYENPAAL GEGVDWQDKI FRDAPMQNHQ VTIAGGSDRT QFAMSGNYFK QDGIMLSSDF TRYSFRLNFD HRVSNLLRVG ASMFTSYVVN NTVPAGSTSI DAGAVTGSML GAAMGAPPTL EPYRADGTIF PFGDQMNGRY REVVNPIGLS MILNRDKVNR TLGNIYAEIT PFKGLTYRAN FNPVLSGTLA DYYSPRAIMN TGDLAGGGGS ASKTNSNNVV LLHESIITYE TRIAKEHALK VTGVFATQSN NSNSNTINAS KFPNDATANE AVQLATERTV SSSRSKDRLD SYMGRINYGF RDRYLLDLIA RVDGSTKFGS NNKYGFFPAA AIAWRASEEP FMKGMSGVSN LKLRFSYGLT GNAGAIDAYK SLSLQGTSGL YYFDHNPVIG IRPTGIANRD LKWERSLQAD FGFDLGLWND RVNVTADIYH KKTDDLLFVK ILPGSSGYSE VTGNFASMEN KGLEFSMDAV ILDKAVKWSV AGNISFNRNK LLSLADGLSE YSVSNYQVMQ VGQPLALFKT YVFDGIYQTG ETVLEGSGSR TGGVKVRDLN KDGIISAGDQ TIVGNANPSF IYGFSTNLSY KNFDLSAFFS GVQGNKVYNL IRYTFENPLG GRNMYKSLVN RWSPDNPSNE YVSGFQGGRL PLSDRFMEDG SFLRCKNITL GYRLPKISNI SSARVYVSAN NLFTLTDYTG YDPEVNTFGN SNKQIGVDNL VYPTARSFLL GLQVAF
|
| |