Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2846 |
Symbol | |
ID | 5734727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3610294 |
End bp | 3611301 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279989 |
Product | periplasmic solute binding protein |
Protein accession | YP_001545612 |
Protein GI | 159899365 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGCT GGTGGATTAT TAGCTTCCTT TCGCTGTTTA TCTTGGCAAG TTGCGGTGCT GAACAAGCCG CCCCAACCAC CCAAGGCCAA ACCGCCAAAC TCAACGTCGT CAGTACGGTT TCGCCAATTA CCAATATTAT CTACAACATT GCAGGCGATA AAATTAGCCT AACCGGAATT GTGCCTGAGG GTGTGAATTC CCACACCTTC GAGCCAGTGC CTTCAGATGC CAAAACCTTG GCCGAGGCCG ACTTAATCTT TATCAATGGC CTTAATTTAG AGGAACCAAC CCACAAATTG GCCGAAGCCA ACAAACAACC AAGTGCCGAA ATTATCTTGC TGGGCGAGCA AACGATCACG CCTGAGCAAT ATGTCTACGA TTTCTCGTTT CCTAAAGAGG CTGGTAGCCC CAACCCGCAC CTCTGGACAC ACCCGCTGCA TGGCTTGCGC TACGCCGAAA TTGTGCGTGA TGCCTTGGTG CGCCGCGACC CGAGCAACGC TGAGTATTAC AACGCCAATT ATGCCAGCTT CAAAACCCGC ATCGAAGCCT TTGATCTGGC AGTTAAAAAA ACGATCGAGA GTATCCCGGC TGAAAATCGC AAATTGCTGA CCTACCACGA TTCATGGGCT TATTTCGCCC CGCACTATGG CATGACCGTG ATTGGGGCAA TTCAGCCTGC TGATTTTGCC GAGCCATCGG CCAAAGATGT TGCCGATTTG ATCACGCAGA TTCGTGAGCA AAAAGTGCCA GCGATTTTTG GCTCGGAAGT TTTTCCATCG CCAGTATTAG AGCAAATTGG CCGCGAAACA GGCGTAAAAT ATATTGATAG CCTGCGCGAC GACGATTTAC CTGGCGAGGT CGGGGCTGCC AATCACTCAT ATTTAGGCTT GCTGACCGAA GATCTGCGGA TTATGGCCGA AAATTTGGGT GGCGACCCCA GCTTGATTGC CAATTTCGAT ACCAGCAATA TTCCTGGCAG CGATAGCAGC GTCGTTCAAC AACAATAG
|
Protein sequence | MRRWWIISFL SLFILASCGA EQAAPTTQGQ TAKLNVVSTV SPITNIIYNI AGDKISLTGI VPEGVNSHTF EPVPSDAKTL AEADLIFING LNLEEPTHKL AEANKQPSAE IILLGEQTIT PEQYVYDFSF PKEAGSPNPH LWTHPLHGLR YAEIVRDALV RRDPSNAEYY NANYASFKTR IEAFDLAVKK TIESIPAENR KLLTYHDSWA YFAPHYGMTV IGAIQPADFA EPSAKDVADL ITQIREQKVP AIFGSEVFPS PVLEQIGRET GVKYIDSLRD DDLPGEVGAA NHSYLGLLTE DLRIMAENLG GDPSLIANFD TSNIPGSDSS VVQQQ
|
| |