Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2471 |
Symbol | |
ID | 5734352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3161287 |
End bp | 3162459 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279611 |
Product | extracellular solute-binding protein |
Protein accession | YP_001545237 |
Protein GI | 159898990 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAAT CGAAGCTTGG TGCTTTTCGG CTACGTGCTG CTAAATTAGC GGCCTTATTC CTCGTCGTTT CGTTGATGAT TAGTGCGTGT GGCTCAGCTG CCAGCGAAAG CACCACCGCC CCAACCGCTG GCTCAGGTGG CACATCATCA ACTATGGATG CTTTGATTGC TGCTGCCAAA GCCGAAGGCG AATTAACCGT GATCGCCTTG CCCCACAATT GGCTCAACTA CGGCGAAATG ATCGAAAACT TCTCGAAAAA ATATGGCATC AAAATCAACG AACTGAATCC TGATGCTGGT TCGGGCGACG AAATCGAAGC GATCAAGGCC AATAAAGATA ACAAAGGCCC ACAAGCCCCC GATGTGATCG ACGTTGGCTT TGCTTTTGGT CCATCCGCCA AGCAAGAAAA TCTCTTGCAG CCCTATAAAG TTTCAACCTG GGATACGATT CCTAACGAGT TGAAAGATCC TGAGGGCTAT TGGTTTGGCG ATTACTATGG CGTGTTGGCG TTTGCCGTCA ACAAAGATGT GGTCAAAAAT GTGCCGCAGG ATTGGGCCGA CCTCTTGAAG CCTGAATACA AAGGCCAAGT CGCCTTGGCA GGCGATCCAC GGGTCTCAAA CTTGGCGATT CAGTCAGTCT ACGCCGCAGC ACTTGCTAAT GGTGGTAGCT TGGATAACGT TCAACCTGGC TTGGATTTCT TCAGCAAATT GAACCAAGCT GGCAACTTTG TGCCAGTTAT CGCCAAACAA GGTACCTTGG CCCAGGGCGA AACCCCAATT ATGATCACCT GGGACTATTT GGCAATCAGC GCTCGCGATG AATTGGGCGG TAACCCTGAA ATCGAAGTGG TTGTGCCAAA ATCAGGCGTG CTTGGTGGCG TGTATGTTCA AGCGATCAGC GCTTATGCTC CGCACCCAAA TGCTGCCAAA TTGTGGATGG AATACCTCTA CTCCGACGAA GGCCAATTGA CTTGGCTCAA AGGCGGCGGC CACCCAGTGC GCTACAACGA TTTGGTTGCT CGCAATGTAA TTCCAGCCGA AATTGCGGAA AAATTGCCAC CAGCCGAACT CTACGCCAAC GCCGTATTCC CAACCTTGGC GCAGCTCGAA GCTGCCAAAA AAGTTATCGT CGATGGTTGG GATACCACGG TCAACGTCGA TGTCAAAGAA TAA
|
Protein sequence | MAQSKLGAFR LRAAKLAALF LVVSLMISAC GSAASESTTA PTAGSGGTSS TMDALIAAAK AEGELTVIAL PHNWLNYGEM IENFSKKYGI KINELNPDAG SGDEIEAIKA NKDNKGPQAP DVIDVGFAFG PSAKQENLLQ PYKVSTWDTI PNELKDPEGY WFGDYYGVLA FAVNKDVVKN VPQDWADLLK PEYKGQVALA GDPRVSNLAI QSVYAAALAN GGSLDNVQPG LDFFSKLNQA GNFVPVIAKQ GTLAQGETPI MITWDYLAIS ARDELGGNPE IEVVVPKSGV LGGVYVQAIS AYAPHPNAAK LWMEYLYSDE GQLTWLKGGG HPVRYNDLVA RNVIPAEIAE KLPPAELYAN AVFPTLAQLE AAKKVIVDGW DTTVNVDVKE
|
| |