Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4030 |
Symbol | |
ID | 5735891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5144920 |
End bp | 5145837 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281180 |
Product | periplasmic solute binding protein |
Protein accession | YP_001546790 |
Protein GI | 159900543 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0298524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAA AAAATCTTTT GGTATTGCTG GTATTGCTAC TGGTAAGTTG TGGCCCTAGC AGTTTTGCCC AACAAACCAG CGCTAATCCC CCGCTACGGG TAGCGGCAAC CGTGGGCATG ATTGCCGATG TCGTCAAGCA TGTTGGTGGC GAGCATGTCG AAGTGCTTGG CTTGATGGGG CCAGGTGTTG ACCCACACCT CTACAAACCC AGCGTTGGCG ATGTGCGGAT TCTCGATGAT GCTGATTTGA TTTTTTATGG CGGCTTGGAG CTTGAAGGCC GCATGACCGA TATGCTCGAA AAGCTCAATC GCCAAAAACC AACGATTGCC GTCACCAGCC AAATCGATCA AAGCCGCTTG CATGCAACTG GCGGCGAATC TTTCGACCCG CATGTTTGGT TCGATGTAAT TTTATGGCGC GATACGATTG CCATCATCGT TGAAACCTTG GCGAGCTATG ATCCAGCTCA TGCAGCCGAT TATCAGCGCA ACGCTGCTGC CTACGCCCAA GAACTGACCG CCTTGGATGA AGAAGTTAAG GCCTTGATTG CAACCGTGCC CAGCCAAAGC CGTGTGCTGA TTACGGCTCA CGATGCCTTT GGCTACTTTG GCTCAGCCTA TGGCTTTGAG GTGCGCGGTT TGCAAGGGTT GAGTACGGCT AGCGAAGCAG GGGCAGGCGA TGTCCAAAGC CTAGCCGAAT TTATTCAAAC TCGCCAAATC AAAGCGATTT TTGTCGAATC CAGCGTGCCC CAAACCACGA TTAATGCTGT CCAAAAAGCC GTTCAATCAC GCGGCTGGGA TGTGGCGATC GGTGGCGAGT TGTTCTCCGA TGCCATGGGC GATGCTGGCA CTGAAGAAGG AACCTATATC GGCATGGTGC GCCATAACGT CACCACGATT GTTAATGCGT TGAAGTAG
|
Protein sequence | MQRKNLLVLL VLLLVSCGPS SFAQQTSANP PLRVAATVGM IADVVKHVGG EHVEVLGLMG PGVDPHLYKP SVGDVRILDD ADLIFYGGLE LEGRMTDMLE KLNRQKPTIA VTSQIDQSRL HATGGESFDP HVWFDVILWR DTIAIIVETL ASYDPAHAAD YQRNAAAYAQ ELTALDEEVK ALIATVPSQS RVLITAHDAF GYFGSAYGFE VRGLQGLSTA SEAGAGDVQS LAEFIQTRQI KAIFVESSVP QTTINAVQKA VQSRGWDVAI GGELFSDAMG DAGTEEGTYI GMVRHNVTTI VNALK
|
| |