Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2265 |
Symbol | |
ID | 5734152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2897430 |
End bp | 2898572 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279406 |
Product | extracellular solute-binding protein |
Protein accession | YP_001545033 |
Protein GI | 159898786 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGCC ATTTCACGCG CATTGTGGCG CACGTGTTCA CTGTCGGACT CATGGCCTCG TTGATCAGTT GTGGGAGCAC GCAAGTAACG CCTACCGTCA GCCAAAGTGC CCAATCAGTT TCAGCCTCAA ACGCCGATCC CGCTTTAGTT GAAGCCTCAA AAAAAGAAGC CGATGGTATT TTGATTTATT CGATTATGAG TGAGAAAAAT TGGCAGCCAG TGATTCAAGG CTTTAACGCC AAATATCCTT GGATCAAAGT AACAACTGCT GATTTAGGAG CTTACGAGGT ATTCGAACGC TACTACACCG AAGCCGCTGG CAATGCCCGT ACTGCCGATT TTATCTCCAC CACCGCGCCT GATGGCTGGA TCAACTTTAT CGATAAAGGC GAAGTCCAGT CGTATGTTTC CAGCGAAGAT GCCCAAATTC CTGAGCTTGG CAAAATTGCT CAAGGGGTCT ATGCCGCCTC AACCGATCCG ATGGTCTTTA TTTGGAACAA GCAATTGGTC GCTGATCCGC CGCAAACTAT GGCCGAACTG ATCAGCATGA TCGAAAAAGA CCCCAGCGCC ATGCAAGACA AATTGGTGAG CTACGATGCC GATGGCGAAG GCTTTGGCTA TGGTCTCAAC TGGTTTTATA CCAAGGCCAA AGGTGCTGAT GGTTGGAAAA CTTTAGAAGC AATTGGTTCA AGCCAGCCCA AAATTTTAAC CTCTGGCGGC AAGATGATCG ATTCAGTGCT TTCGGGCGAA TCCAGCATCG GCTACTTTGT CTCGAATATC ACAGTTCAGC CACGCTTGGA AGCTGCTCAA CAATTGCTTG GCTATAGCTT TGTGCCTGAT GCTCAAGTTG TCGCGGTGCG GGGCATGGCA GTCACCAAAC ATGCTGCTAG CCCTAATTCA GCCAAATTGC TACTCGATTA TATTCTCTCA GCCGAAGGAC AAATGGCCTT CTCGCAAGGT GGCTTAACCG CCTATCGCCC CGACATTGCC GCCTCTGCGC CACTGCATCT TTCGCAAGTA AGCCAAGCGG TTGGGGGCGA GCAAAATATC ATCTTCTCAC GGCCCGATAA AGCCCTGGCT GACCCTCAAC AACGCAGCCA GTTCCTCAAC CAATGGAAAC AAGCCCTCGG TCGCAACCAA TAA
|
Protein sequence | MKSHFTRIVA HVFTVGLMAS LISCGSTQVT PTVSQSAQSV SASNADPALV EASKKEADGI LIYSIMSEKN WQPVIQGFNA KYPWIKVTTA DLGAYEVFER YYTEAAGNAR TADFISTTAP DGWINFIDKG EVQSYVSSED AQIPELGKIA QGVYAASTDP MVFIWNKQLV ADPPQTMAEL ISMIEKDPSA MQDKLVSYDA DGEGFGYGLN WFYTKAKGAD GWKTLEAIGS SQPKILTSGG KMIDSVLSGE SSIGYFVSNI TVQPRLEAAQ QLLGYSFVPD AQVVAVRGMA VTKHAASPNS AKLLLDYILS AEGQMAFSQG GLTAYRPDIA ASAPLHLSQV SQAVGGEQNI IFSRPDKALA DPQQRSQFLN QWKQALGRNQ
|
| |