Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3046 |
Symbol | |
ID | 5734918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3847245 |
End bp | 3848255 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280190 |
Product | periplasmic solute binding protein |
Protein accession | YP_001545812 |
Protein GI | 159899565 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0205239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGAT ATAGTTTAGG GTTGTTGCTG GTATTGGTGG TTGGTTGTGG CCAAAGTACA GCCACAGTTC AGCCAAGCCA AGTGAGCCAA AGCCAACAAC AAACCACCAC CGCTGATCAA ATCATGCCAA CGGCAACCGC AATTAGCACC GCGCCAGTCA GCCAAATTCG CGTCGTTACA ACCATGAGCA TTTTGGCCGA TGTGATTAAG CAGGTTGGTG GCGAACGGGT GCTTGTTGAT AATATTATTC CCTTGGGCGC TGGCCCCGAA GATTATCAAG CTACGCCTGG CGATAGCCAA AAAATTGCCG ATGCCAATAT TGTGTTTTTC AATGGCCATG CGCTTGAGGA ATGGCTCGAA CCCTTGTTCG AAAATGCTGG GGGCAGCGAG CAGCCAAGGA TTGAATTATC TGCTGGTTTT GCAGTGATTG AAGAAGAACA TGCTGAAGAA GAACACGCTG ATGAAGAACA TGCTGATGAG CATGCTCACG AAGAAGGCAA CCCGCACTTT TGGCTTGACC CAACCTATGT GATGTCGTAT ACCCTGACGA TTCGCGACCA ACTTAGTGCG ATCGATCCCA GTGGCAAGGA TGTCTATGCA GCCAATGCCG AAGCCTATCT TGGCCAATTA CAAGCGCTCG ATCAAGAATT GCAAGGCTTG GCGGCCCAAA TTCCGGCTGA ACGGCGCAAA CTCGTGACCA ACCACGATGC CTTTCCGTAT TTTGCCCACC ACTATGGCTT TGAAGTTGCT GGCGTGTTGT TGGATAACCC CGAAGCCGAG CTTTCGGCTG GCGATTTAGC GGCTTTGGTC GAGAGCGTTA AGGCCAGCGG CGTGCCGGCA ATTTTCTCTG AATCGCAGTT CAACCAAAAA ACTGCCCAAT TGCTGGCGGA TGAAGCTGGG ATTGAAACCA TTGCGGTGTT GTATACCGAC ACTTTAGGCA GCGATACTGC AACTTCCTAT ATCGACATGA TGCGGTACAA TATGAATACT ATTGTTGCTG CGCTCAAATA A
|
Protein sequence | MRRYSLGLLL VLVVGCGQST ATVQPSQVSQ SQQQTTTADQ IMPTATAIST APVSQIRVVT TMSILADVIK QVGGERVLVD NIIPLGAGPE DYQATPGDSQ KIADANIVFF NGHALEEWLE PLFENAGGSE QPRIELSAGF AVIEEEHAEE EHADEEHADE HAHEEGNPHF WLDPTYVMSY TLTIRDQLSA IDPSGKDVYA ANAEAYLGQL QALDQELQGL AAQIPAERRK LVTNHDAFPY FAHHYGFEVA GVLLDNPEAE LSAGDLAALV ESVKASGVPA IFSESQFNQK TAQLLADEAG IETIAVLYTD TLGSDTATSY IDMMRYNMNT IVAALK
|
| |