Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0389 |
Symbol | |
ID | 4709750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 454072 |
End bp | 456075 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639854852 |
Product | TonB-dependent receptor |
Protein accession | YP_001001985 |
Protein GI | 121997198 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCCGA GGCGATCGTG TACAGCCCCC TTTTTCAGGT CCCTTACGCT TTTCAGCGGT GGTTTGGTGA CGCTCCTTCC CGTATCGTTG CCCGCCAAAG ATGCGGATGA CTTGCCTGTT GTGCGTGTCG AGTCCAGCGT CGATTCGCTG GGGCGGGGCT CGAGCCGCGA AGAATTCCAG AGGCGCCAGT CCTCGCGCAG CGGCGAGCTC TTTCGGGGTG ATGCGTCAGC CACCGTAGGG GGCGGCAGTC GCAACGCCCA GCGCCTCTAC CTGCGCGGGG TTGAGTCAAA CAACCTCAAC GTGACCGTCG ATGGTGCTCG GCAAGGTCGC GACCTCCACC AGCACCGTGG TGGCCTAACC GGTCTCGATC CGCAGCTGCT CGATGACGTG GAAGTCGATA CGCGGCCTGC GGCGGATCAA GGTCCCGGCG CGCTGGGTGG CAGCGTACGC TTCCGGACGG TCGACGCGCA GCAGCTCCTC GATCCTGATG AGCAGACGGG TGCACGCTTG AGGGCCGGTT ACGCAACAGC CGATTCTTCC GAGCAGGGAT CAGCCACTGT CTTTCAGCGG TTGGGTTTCG ATTGGGGCGC CCTCGCCCAT GTGAGTGGGG CGAACCGGGA CGATTACGAG ACCGGGGGCG GTGACAACAT GCCGTACTCG GGCGGAGGCG ATCGTAGCTA CCTGCTGCAG GCTAGCCGAA TGCCTGTGCA CGGGCATGAA CTGCGCCTCG GTGTCCAGCG GCACAGCTTT GAGGGCGATA CGCTCTCCGG TGGGGCGGGC AGTGATTTTG GCGATCCCCG GGTAGAGCAC CGAGGGGAGC CGGAAAAACA GGAGCTGCGA CGGGACACGT GGACGGTGGA GCACCGTTAC GACCCCACCG ACCCCAACGT GGACTGGCAG GCACGGGTTT ATCGCAACGA TAATCGGCTC AAACGTCTGG ATCAGGGCAC CGAGACGCGC GCACTTGAGC ACGGCGGCGA TCTTCGCAAC ACCTTCTCCC TCAATGCCGG ACCGACGCGT CACCAGCTCA CCGCGGGTTT CGACTACTAC ACCGAGGATG GTCGTCTCGA GCAGGACGAC GGCCCGCGGC TGAGCTACAC GGACCGCAAC TTCGGTGCCT TCCTGCAGAA CCGCATGGAG TGGGAACGAC TCCGCCTTTC GAGTGGCCTG CGTTTCGATG ACTACACCAG TGCCCTGGGA GAGCGGAACC CCGAGGGTGA CGCTTTTTCC CCCAACCTGA GTGCAGAACT CGATCTTGCG GCGGGCTGGG CGGTCTTCGG CGGTTACGGG GAGGCAGCCA GAGGTCCTGG CGGCACAATG CCCATCGGCT GGGTGCAGTA CATCGAAGAG GGCAACGACG ATCAGTCTTT AAAGACCGAG GAGTCCCGCC GCAGCGAGGG CGGCCTGCGG TATCAGGGCC GGGGTCTGGT CGCCTCCCGT GATCGCCTGA ACCTCGAGGC GACGGTTTTC GAGACGCGCA TCGACAACAG CCTTGAACGC GTTGGCGGTG GTCCCCCGCA CCAATCGGGT GTTCGTCTCG GGCAGCGCGA TGTCCGGATC AGCGGCTATG AGCTGCGGGC CGCATGGGGG GTGAATGCCT ATGACACGCG GCTGTCGTTC CTGAGCGCCG AGACGGAGGA CGACGATGGC GACCCGGCTG GGGTTAGCCG TCGTCTGGCC GGAAGTGGTG GTGATCGTCT GGTCTGGGAT CACCGTTGGG CGGCTCATGA AACCCTGACC CTGGGGTATA CGCTCACCTG GGTGGGGGAT GACACCGATG TACCTGACGA TGAGCCGGAG CGCGACGGTT ATCACCTTCA CGACATCCAG GTCCAGTGGC AGCCGTGGGC GGATGAGCAG GTCACGCTGG GGGTGGTCGT GAACAACCTC TTTGACGAAC AGTATGCCGA GCACACATCG CTGGTGTCGG AGCAGGATGG TGAGCTGATT GTTCGCGATG AGCCGGGGCG GGATATCCGC CTGGAAGCGG CGCTGCGTTT TTGA
|
Protein sequence | MHPRRSCTAP FFRSLTLFSG GLVTLLPVSL PAKDADDLPV VRVESSVDSL GRGSSREEFQ RRQSSRSGEL FRGDASATVG GGSRNAQRLY LRGVESNNLN VTVDGARQGR DLHQHRGGLT GLDPQLLDDV EVDTRPAADQ GPGALGGSVR FRTVDAQQLL DPDEQTGARL RAGYATADSS EQGSATVFQR LGFDWGALAH VSGANRDDYE TGGGDNMPYS GGGDRSYLLQ ASRMPVHGHE LRLGVQRHSF EGDTLSGGAG SDFGDPRVEH RGEPEKQELR RDTWTVEHRY DPTDPNVDWQ ARVYRNDNRL KRLDQGTETR ALEHGGDLRN TFSLNAGPTR HQLTAGFDYY TEDGRLEQDD GPRLSYTDRN FGAFLQNRME WERLRLSSGL RFDDYTSALG ERNPEGDAFS PNLSAELDLA AGWAVFGGYG EAARGPGGTM PIGWVQYIEE GNDDQSLKTE ESRRSEGGLR YQGRGLVASR DRLNLEATVF ETRIDNSLER VGGGPPHQSG VRLGQRDVRI SGYELRAAWG VNAYDTRLSF LSAETEDDDG DPAGVSRRLA GSGGDRLVWD HRWAAHETLT LGYTLTWVGD DTDVPDDEPE RDGYHLHDIQ VQWQPWADEQ VTLGVVVNNL FDEQYAEHTS LVSEQDGELI VRDEPGRDIR LEAALRF
|
| |