Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4819 |
Symbol | |
ID | 8360995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 6015230 |
End bp | 6018331 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644966969 |
Product | TonB-dependent receptor |
Protein accession | YP_003124454 |
Protein GI | 256423801 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0660191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000000144143 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTATTCCA CAGTTTCCTC CTATGGCCCT TTTATGCGCT ATAAAAAGCG TGGGCCTGTT GCCGGCTTTT CTTTTGTTCA ACTATTCCAT CTATGTTTCT TTTTAGTCCT GCTGCTGGTG GGTGTGCAGG CATCTGCTAC TAAGCCGTAT GCAGATACGC TTATCACTGG TCGCGTAAGC GGCGAAGGGA ATGAACCTTT ACCAGGTGCT AACGTAACCT TACGCAGTGC ACCTTCAAAA GGTGTGACGA CCCGTCCTGA CGGTACATAC AGTATACAGG CTTCACCACA GGATGTACTG GTTGTTACTT TTGTAGGATA TGCCCGTCAG GAGATCCTTA TTAATGGCAG ACAACGCGTA GATATTGTGT TACAGGCCCA GGCGACCAGT CTTGAAAAAG TAGTGGTAGT AGGTTATGGT ACGCAGAAAA GAGGAGAGGT GACGGGCGCT ATTTCCTCTG TTGATGCCGG TGATATCAAA GACTTGCCGG CTGCGAGTCT GCAACAATCC TTACAAGGTA AATCCGCCGG TGTACAGATT ACCCAGAACT CCGGTTCACC GGGTAAAAAT GCGCAGGTAC GCGTCAGGGG ATTAACCTCC ATCAACAACT CTGATGTACT GTATGTCGTA GACGGTGTAC CGCTCACCGC CAACGGGATC AATGCGATCG ATCCTTCGAA TATTGCTTCT GTTCAGATAT TGAAAGACGC TTCTTCGCAG GCTATCTATG GTTCAAGAGG TGCGAATGGT GTGGTGCTGA TCGAAACCAA GAAGGGGAGT AAGAATTCAT CTCATATTTA CTTTAATGCT TACGGCGGTG TACAGCAACT CAGGAAGAAA TTGAAGATGC TGGATACCAG GGATTTTATT ACGTTGAATA CCGAAGCATA TAAGAATGCG GGACAGGCTT CTCCATGGGG CGATCCTTCT CAGTATCCAT TGAACACCGA CTGGCAGGAT GCGATGTTCC GTACGGGACC TATACAGAGT TATGACCTGG CATTTGCCGG TGGCAGCGAA AAACTTACTT ACCGCCTGAG TGGTAACTAT TTTAACCAGG ACGGTATTAT CATTGGGTCT TCTTTCAAAC GTGCTTCTTT ATCGCTTAAT TCAACGTTTA AGGCAAATGA GAAGGTGGAA GTAGGAGAAA ATATCTCTAT TGCGAAAAGT ACGCAGTACC TGGTGGGAGA AGGCGCTACC AGCAGGGTGG ATCTGTTGTC AGCACTCAGT ATGGATCCTA CGGTGCCTTT GCTGGATTCA GCCGGTAACT ATGTGCCTGC CCGGTATTCA GATATCCAGC ATCCGATTGC CAGCATCAAC AATATCTCCC AAAATCATCC ATACAATAAC TGGTCTGTTG TAGGCGCTAC TTATCTGCAG ATCAAACCGT TGAAAGGACT GGCGTTGAGA TCCAACCTGA GTATTGATCT GAATTTTTCA GATGATAAAG CTTTTGCGCC TTCTTATTAT GTGTCTGCAG CGCAGAATAA TCCTGTGCCT AATCTTTCGC AGACAAAAGC CGTCGGCTAT AGCTGGACAT GGGATAACAC CGCTACCTAT GAACAGAAGT TCGGAGATGA TCATGAAATA AAATTACTGG CGGGTGTTTC TCAGCAGCGG TTCAGCTATG ATTTTATCAG GGGGAGTAAC CAGGGGCAAC CCAGTAATGA TCCATATCTG CAATACCTGG ATGCCGGTAC TGCCAATCCT GCTGTAGCCG GTAGTATGGT GCGCTGGAGC CTGCTTTCTT ATATCGGTCG CGTAAACTAT AATTTCAGAG AGAAGTACTT CTTAACTGCC ACCCTGAGAA GAGATGGTTC TTCGAAGTTT GGCGCTAATA ACAAGTATGG TAATTTCCCC TCTGCTTCAG TCGGTTGGTC GCTGACCAAA GAGGGTTGGT TTGACAATGT AAAGGCACTT CATTCATTGA TGTTAAGGGC CAGCTGGGGT GTGGTGGGGA ATCAGTCCAG CGCCGGTTAT TATGACTTCT CTTCCACCAT TAACAACTAT TATTATGCCT ATGGCAATCC GGCTAATGCG GCTTTGACTG CTGAACCTAA TGGACTGGGT AATCCTGATC TGAAATGGGA GCAGGTACGC CAATGGGATA TCGGTTTTGA TGTCAGGTTG TGGGAAGGCT TGAGTGGTAC AGTGGATTAT TATAACAAAA AGACAAAGGA TATGCTGCTG AGGATCCCTA TTTTATACGA AAGCGGATTC TCTACCGGTC CACTGACCAA TGTGGCTTCT ATGTTCAATG CAGGGTTTGA GTTTCAACTG GACTATAATC ACACCTTTGG CAATGGCGTA AGTCTGAATG CCGGTGTGAA TCTCTCCACA TTGAAAAATA AAGTTTTGAG TCTGCAGAAT GAGGGTGCAC AGATCTTCAG TAGTCCGAAT ATGACACGTG CCGGACAACG GGTAGCTGAA TTCTACGGTT ATGTATTTGA CGGTATTTTC CAGAACCAGC AGGAGATTGC CAACCATGCG ACACAGCCGA ATGCAGCGCC TGGCGATATC CGTTTTAAAG ACCTGAATAA TGATAAAGTG ATCAACGATC TGGATCAGAC TTACCTGGGA TCTCCTATCC CGAAAATCCA GTATGGTTTT AGTGTGGGTA CAGGGTATAA AGGCTTTGAC CTGAACCTGT CCTTTTTCGG CGTAGCCGGG AATAAGATTT ATCAATCCTA TAAATACAAC ACAAACGGCT TCTTTATTTC GAACTATAAT ATGGAGCAGG AGATACTGGG CCGGTGGCAT GGTGAAGGGA CGAGTAATAC CATTCCCCGC CTGAATGCAA ATGATCCGAA CTACAATGCG CGTGCTTCCA GTTATTATCT CAGTAGTGGT TCTTACCTGC GTCTGCGGAA CGTGACCCTG GGGTATAATT TCCCGGATAA ACTGATGGAG AGATGGAAGA TGAGCGGACT GCGGGTATTC GTCTCGGGAC AAAACCTGCT GACGTTTACA AAGTATGACG GTTATGACCC GGAAATCGGG ATCACATTCG GGGGGAATGC CGGTACGCTG AACCTAGGTC AGGATCAGGT GAACTATCCG CAGCCAAGAA TAATTACTGC CGGTTTGAAC CTTACTTTAT AA
|
Protein sequence | MYSTVSSYGP FMRYKKRGPV AGFSFVQLFH LCFFLVLLLV GVQASATKPY ADTLITGRVS GEGNEPLPGA NVTLRSAPSK GVTTRPDGTY SIQASPQDVL VVTFVGYARQ EILINGRQRV DIVLQAQATS LEKVVVVGYG TQKRGEVTGA ISSVDAGDIK DLPAASLQQS LQGKSAGVQI TQNSGSPGKN AQVRVRGLTS INNSDVLYVV DGVPLTANGI NAIDPSNIAS VQILKDASSQ AIYGSRGANG VVLIETKKGS KNSSHIYFNA YGGVQQLRKK LKMLDTRDFI TLNTEAYKNA GQASPWGDPS QYPLNTDWQD AMFRTGPIQS YDLAFAGGSE KLTYRLSGNY FNQDGIIIGS SFKRASLSLN STFKANEKVE VGENISIAKS TQYLVGEGAT SRVDLLSALS MDPTVPLLDS AGNYVPARYS DIQHPIASIN NISQNHPYNN WSVVGATYLQ IKPLKGLALR SNLSIDLNFS DDKAFAPSYY VSAAQNNPVP NLSQTKAVGY SWTWDNTATY EQKFGDDHEI KLLAGVSQQR FSYDFIRGSN QGQPSNDPYL QYLDAGTANP AVAGSMVRWS LLSYIGRVNY NFREKYFLTA TLRRDGSSKF GANNKYGNFP SASVGWSLTK EGWFDNVKAL HSLMLRASWG VVGNQSSAGY YDFSSTINNY YYAYGNPANA ALTAEPNGLG NPDLKWEQVR QWDIGFDVRL WEGLSGTVDY YNKKTKDMLL RIPILYESGF STGPLTNVAS MFNAGFEFQL DYNHTFGNGV SLNAGVNLST LKNKVLSLQN EGAQIFSSPN MTRAGQRVAE FYGYVFDGIF QNQQEIANHA TQPNAAPGDI RFKDLNNDKV INDLDQTYLG SPIPKIQYGF SVGTGYKGFD LNLSFFGVAG NKIYQSYKYN TNGFFISNYN MEQEILGRWH GEGTSNTIPR LNANDPNYNA RASSYYLSSG SYLRLRNVTL GYNFPDKLME RWKMSGLRVF VSGQNLLTFT KYDGYDPEIG ITFGGNAGTL NLGQDQVNYP QPRIITAGLN LTL
|
| |