Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2064 |
Symbol | |
ID | 5733952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2575704 |
End bp | 2576681 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279206 |
Product | periplasmic solute binding protein |
Protein accession | YP_001544833 |
Protein GI | 159898586 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATGGT TAAAACGTGG GTTCTTGGTG ATTATCAGTT TGCTGTGTAT GGCATGTGGT CAATCGACGG CCACGATCGT TCCCCCACCG ATGGATACCA ACACCAGCAA TAAACCAGTG ATTATTGCGA CAACAACGCA GATTGAAGAT ATTTTGAGTG TGATTGGCGG TGATCGGATC AGCGTGGTGG GCTTGGTTCC ACGCAATGGT GACCCACACG AGTTTGAACC AACTCCAGCC GATGTCCAGC GGGTTGCGAC CGCACAAGCC GTTTTTATGA ATGGGGCAGG CTTTGAAGGA TGGATCGATG AATTAATTCG GAACGCTGGC GGTCAGCGGC CCGTGGTTGA TCTCTCGGTT GGTATTCCGC TTGGCACGAT TGCGAGCGGG TTTGCAGAAA GTGGCGAAAC CGATCCGCAC ATTTGGATGA ACCCCCAACA CATGCTCATT ATGGTGGATA CGATGGTGAC CAACCTTATT CAGCTTGATC CGGCTGGCGC AACGACCTTC AATGCGAATG CAACAGCCTA TAAACAGGAG TTAATCGCCC TTGATGCCTA TGCCGAAAAA GAACTTGCGG CCATTCCGGC CAATCGCCGC AAGTTGGTAA CGGCCCATGA TGCGATGGGG TATTTTGCCG CTCGCTATAA CTTTGACATT GTTGGCGCGG TGATTCCGAG TGCGACGACC GAGGCGGCAG AAACCTCCGC GCAGGATCTT GCAACCCTCA TTGATGCGAT CAAGGCCGCA GGGGTGCCCG TTATCTTTGC CGAAGTGTCG AATAACCCGA AGTTTATTGA ACAAGTTGCC CGCGAAGCGC ATGTGCGTGT CGATACCTTA TACGTTGATT CGCTTGGTGA AAAAGGCTCA GATGCAGGTA CGTACCTTGA TTTTTTTCGT ACTGACGTGC AAAAAATTGT TCAAGCCCTT AAATTCGTAC TGACGTGCAA AAAATTGTTC AAGCCCTTAA ATAGGTGA
|
Protein sequence | MTWLKRGFLV IISLLCMACG QSTATIVPPP MDTNTSNKPV IIATTTQIED ILSVIGGDRI SVVGLVPRNG DPHEFEPTPA DVQRVATAQA VFMNGAGFEG WIDELIRNAG GQRPVVDLSV GIPLGTIASG FAESGETDPH IWMNPQHMLI MVDTMVTNLI QLDPAGATTF NANATAYKQE LIALDAYAEK ELAAIPANRR KLVTAHDAMG YFAARYNFDI VGAVIPSATT EAAETSAQDL ATLIDAIKAA GVPVIFAEVS NNPKFIEQVA REAHVRVDTL YVDSLGEKGS DAGTYLDFFR TDVQKIVQAL KFVLTCKKLF KPLNR
|
| |