Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2121 |
Symbol | |
ID | 5734009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2663844 |
End bp | 2664845 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641279262 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001544889 |
Protein GI | 159898642 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.173687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCC TTGCACTCTG CCGCTGTGCT ATCGCACTGG CCATAGCCTG GACTCTGGCA GCGTGCGGCG GCTCCTCGTC TGGAGTCGGC GCAGTAGCGC CCTCCTCCCT CGATCTCAAG ACCAAAACTT ATCCTGAGCT GGTTGTCGGG TTTGCTCAAA TCGGAGCCGA AAGCGAGTGG CGCACCGCTA ACACCCGCTC GATTCAAGAT ACGGCGAATC AACTTGGTGT CGAATTGGCG CTTTCCGATG CGCAACAGCA GCAGGAAAAT CAGATCAAGG CCATCCGTTC GTTTATTGCT CAAGGTGTCG ATGTCATCGG AGTCTCGCCC GTGGTCGAGA CTGGCTGGGA CGAGGTTTTC GCCGAGGTCA AGCAGGCTGG AATTCCGTTG ATCTTGCTCG ATCGCAACGC CAATGTGCCA GATGATCTCT ATAGTGTCCG CATCGGATCA GACTTCGTGG AAGAGGGTCG GCGGGCCTGC GGTGAGATGG CTCGACTGCT GGATGGTCAA GGTGCGATTG TCGTCTTAGA AGGCACCCAA GGCTCAGCCC CAATGATCGG ACGAGGTACT GGCTTTCAAG AATGCCTGCA ATCTTATCCC GCGCTTCACA TAATCGACAG CCAGTCCGGT GATTTCATTC GCGCCCGTGG CAAAGAGGAG ATGGCAGCGT TGCTGCAAAA ACACGGCAAC AGCATCGACG GCGTGTTTGC CCAGAACGAT GACATGGCGC TTGGCGCGAT CGAGGCCATC GAGGAGTATG GGCTACGGCC TGGCGTTGAT ATCAAAATTG TTTCGATCGA TGCGGTACGG GCTGCCTTCG AAGCAATGAT TGACGGCAAG CTTAACGCCA CAATCGAGTG TAACCCGCTG CTTGGCCCGC TGTTTTTCGC CACAGCCCTG AACTTGGCTA ACGGCATACC GGTTGAAAAA TGGATCAAGC CCGACGAGGG CATCTACCGA CAGGATACCG CCGCGCAGGA ATTGTCTAAG CGCGAATACT AG
|
Protein sequence | MNALALCRCA IALAIAWTLA ACGGSSSGVG AVAPSSLDLK TKTYPELVVG FAQIGAESEW RTANTRSIQD TANQLGVELA LSDAQQQQEN QIKAIRSFIA QGVDVIGVSP VVETGWDEVF AEVKQAGIPL ILLDRNANVP DDLYSVRIGS DFVEEGRRAC GEMARLLDGQ GAIVVLEGTQ GSAPMIGRGT GFQECLQSYP ALHIIDSQSG DFIRARGKEE MAALLQKHGN SIDGVFAQND DMALGAIEAI EEYGLRPGVD IKIVSIDAVR AAFEAMIDGK LNATIECNPL LGPLFFATAL NLANGIPVEK WIKPDEGIYR QDTAAQELSK REY
|
| |