Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1535 |
Symbol | |
ID | 6375213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1657426 |
End bp | 1660344 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684028 |
Product | TonB-dependent receptor |
Protein accession | YP_001959942 |
Protein GI | 189500472 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.39302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.5632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATTTC ACACTTCCTT AACCGATCCT TTGCACAATC GTCACAGGAG ACTGTTTGCC GGAAAATCCA AAAGAGATAT TCTGTTTGCG ACATGGATTT CCCTCTCTTT TCATCGTGGC TTTCCAACAT TAACGGAAAA TACAATGAGA CATTTGTTTT CGTATTTATT CATCTGTCTG ACTACGCTTG CATGCCATGG ATTTGCAGCA CCCGGCATTG CTCAAGCTGA AACAACAGGC TCCTCTTCCG GCACCACGAT AACCGGTCGT GTAGTCGATG AGGTAGACGG TCTTCCGCTC CCCGGTGCTA ACATCTCCGT CAAAGGCACG TCAAAAGGTT CTATTACCGG GCAGGACGGA CGTTACCGGC TTGAAAACGT TTCCGGTGCT ACTGCAGTCA TCGAAGCATC CTATATCGGA TATGTAAAAA GTGATATTCC AATAACCCTT TCACCCGGCA AAGCCGTTAT AAAAAATATA CGGCTTAAAC CCGGCGTCTT GATCAGCGAG GAAATCACCG TTGTTGGCGA ACTCCTGAAG GGCCAGGCAA AAGCACTGAA TCAGCAGAAA AACGACGTCA ATGTCACCAA TGTTGTTGCT TCAGACCAGA TCGGTAAATT TCCCGACTCC AACATCGGCG ACGCCCTTAA AAGAATTCCG GGGATCAGCG TTTTTAATGA CCAGGGTGAA GCGAGATTCG GCCATGTACG AGGAACAGAG CCCCGTTTCA ACTCTGTCAC CGTCAACGGA GAACGCATTC CTTCTGCTGA GGCAGAGAAC AGGACAATCC AGCTCGACCT CGTTCCATCG GACATGATCC AGACCATCGA GGTCACCAAA GCCTTGACAC CCGACATGGA TGCCGACGCA ATCGGTGGTT CGATCAATCT TGTAACGAAA ATCCCTGCGG AAGAAAGATT TTCGCTCTCA GCCGGTGGAG GCTTGAACTT CCTCGATGGG ACCGGTGGCG AAAGATACCA GTTCGGCGGC ACCTACGGCA ACCGGTTCGC TGACGAAAAA CTCGGCGTAC TTTTCAGCCT CTCTTATGAC AACAACGATT TTGGCTCTGA CAATATCGAA GGGGAATGGG ACGCTGGTAA TGACGGCATA GAAGGAATCA AGGAGTTCCA GGTAAGAAAG TATGATGTTC AGCGTATCCG CAGAAGTTTT TCCGGAGCGC TCGACTACCG CTTCAACGAA AACCATATCC TCAAGTTCAA TGGTATTTAC AACTGGAGAG ACGACTTTGA AAACCGCTAC AGGATTAAGT ACAAGGATCT CGATGAAGAT TTTGCCACGG TGGAACGTGA GACAAAAGGT GGCACGGAGA ATGACGCCCG CCTTGAGGAT CAGCGCATGA TGTCATTCAC GCTTGGCGGC GAGCATGATT TCGGCAAGCT GGATCTCGAC TGGCAAGCTT CATACTCAAA AGCTTCGGAA GATCGTCCGA ATGAACGATA CATTAATTTC AGAGCGAAAA ACCAGCCGTT TACCGTCGAT ATCAGCAACC CGGAAAAACC GTTTGTAACG GTCTTGAACC CGGATGTGTC AGGCGGTATC AGCGACAGCG AAGACTGGAA GCTCAAGGAG CTGACAGAAG AACATCAGTA CACCGAAGAT ATCGACAAGA ATTTCGGCCT GAACTTTAAC TATGCTGCAA CCGACGCACT TGCATTCAAA TTCGGCGGTA AAATCAGGGA CAAGAAAAAG AAGCGTGATA ACGATTTTTA CGAATATGAA CCGGTTGATG AAGATGCATT CCGCGATGAA GTCTTTGCCA ACCTCAAAAA TGAAACCAAG GATCATTTTC TTCCGGGTAA CAAATACAAG TCAGGTGTCT TTGTTTCCAA CAAGTTTCTG GGCGGCCTCG ACCTAGACGG CGATGATTTC GACAAGGAAC TTGTAAAGGA AGGGCTTGCC GGAAACTTCG ACGCAAAGGA ACAGATAAAA GCGGTCTACG CAATGGCAAC CTGGGACATC AGCGACAACA CAACACTGCT TGGAGGTGCA AGGCTTGAGC ATACCAGAAA TGAATACGAT GCATTCAAGT ACTTTGCCGA CGAAGACTCA CTTGCTGCAG TAACAGGCAA GCCGTCTGAC TACACCAACG TTTTGCCTTA TGTACATCTG CGTTATAACG TCAACGATCA GACAAATATC AAGCTTGCCT ATACGCACAG TCTTGCTAGA CCAAACTATT TCGATCTGGC CCCGTACCAG GAAATCGTTG CCGAAGACGA GGAAATAAAA TTAGGCAATC CGGCACTTGA ACCGACCCTC TCGAAAAACG TCGACCTGAT GATCGAGCAT TACCTGAGCG ATATCGGCAT TCTTTCTGCC GGCGTCTTCT ACAAGTCGAT CAGCGACTTC ATTGTCACCA AAAAAGAAGA CGTCGATTAT TCGGGAGACA CGTTTGAGCA ATTCCAGCCG GTCAATGCCG GTGACGGAAC ACTTATGGGC ATAGAAACTG CCGCACAGTT CCAGCTTCCT TTCATCCCGG GCCTTGGCCT TTACCTGAAC TATACCTACA CGCACTCTGA AATCGACAAC TTCGATATCA AGGGACGTGA AGGCGACGAC CTGCCGCTGC CAGGCAACCC GGAGCACACA GCAAATGCGT CGATCGCTTA CGAGAACGGT CCGTTCAATA TCCGTCTCTC AGGAAACTAT CACAGCGACT TCATCGATGC TGAAGAGGGA TCGATCGGCG AGAACAAGTG GGAGGACCGC TACTACGACA GCTCATTCAC TCTCGACCTC AATGGCGGCT ACCGCATGAG CGACATAGTC CAGCTATACT TCGAGGTCAG CAACCTGACA AACCAGCCGC TGCGCTTCTA TCAGGGTGAA AAGCAGTATC TCGCCCAGGA AGAATGGTAT GACCGAAGGT TCCTGGTTGG CGTCAAGGCT GATTTCTGA
|
Protein sequence | MLFHTSLTDP LHNRHRRLFA GKSKRDILFA TWISLSFHRG FPTLTENTMR HLFSYLFICL TTLACHGFAA PGIAQAETTG SSSGTTITGR VVDEVDGLPL PGANISVKGT SKGSITGQDG RYRLENVSGA TAVIEASYIG YVKSDIPITL SPGKAVIKNI RLKPGVLISE EITVVGELLK GQAKALNQQK NDVNVTNVVA SDQIGKFPDS NIGDALKRIP GISVFNDQGE ARFGHVRGTE PRFNSVTVNG ERIPSAEAEN RTIQLDLVPS DMIQTIEVTK ALTPDMDADA IGGSINLVTK IPAEERFSLS AGGGLNFLDG TGGERYQFGG TYGNRFADEK LGVLFSLSYD NNDFGSDNIE GEWDAGNDGI EGIKEFQVRK YDVQRIRRSF SGALDYRFNE NHILKFNGIY NWRDDFENRY RIKYKDLDED FATVERETKG GTENDARLED QRMMSFTLGG EHDFGKLDLD WQASYSKASE DRPNERYINF RAKNQPFTVD ISNPEKPFVT VLNPDVSGGI SDSEDWKLKE LTEEHQYTED IDKNFGLNFN YAATDALAFK FGGKIRDKKK KRDNDFYEYE PVDEDAFRDE VFANLKNETK DHFLPGNKYK SGVFVSNKFL GGLDLDGDDF DKELVKEGLA GNFDAKEQIK AVYAMATWDI SDNTTLLGGA RLEHTRNEYD AFKYFADEDS LAAVTGKPSD YTNVLPYVHL RYNVNDQTNI KLAYTHSLAR PNYFDLAPYQ EIVAEDEEIK LGNPALEPTL SKNVDLMIEH YLSDIGILSA GVFYKSISDF IVTKKEDVDY SGDTFEQFQP VNAGDGTLMG IETAAQFQLP FIPGLGLYLN YTYTHSEIDN FDIKGREGDD LPLPGNPEHT ANASIAYENG PFNIRLSGNY HSDFIDAEEG SIGENKWEDR YYDSSFTLDL NGGYRMSDIV QLYFEVSNLT NQPLRFYQGE KQYLAQEEWY DRRFLVGVKA DF
|
| |