Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2120 |
Symbol | |
ID | 5734008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2662649 |
End bp | 2663647 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279261 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001544888 |
Protein GI | 159898641 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0534772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTT CACGAACATT CGCCGCGCGT CTGATGGCGC TCCTGGTTAT CGTCGGGCTA GCTGGTTGTG GTGCCTCGAG TGCGCCCACA ACGGCACCAG ATGCGCCTGC AACAGGGGCC AAAACCTACA GCGATATGGT GCTGTGCTAC CCGCAGCTTG GGGCTGAAAG CGACTGGCGC ACCGCCAATA CCGCTTCGAT CAAGGAGACC GCAGCAACCT TGGGAATTAA GCAATTGGTT TTCTCTGATG CCCAGCAAAA GCAAGAAAAT CAAATTTCTG CCGTGCGTGC CTGTATCCAG CAGGGCGTGG ATGTGATTGC ACTGCCACCA GTCGTTGAGG ATGGCTGGGA TGCTGTTCTG ACCGAGGCCA AGAATGCAGG TATTCCGGTG ATTATCGTTG ATCGCAGTGT GAGTGCCGAC AAATCCCTCT ATAGCACCCA TATCGGCTCC AATATGGTCT TGGAAGGTGA ACGGGCCGCT GCTGAATTCA ACAAAATGAT GCCAAACGGC GGCGCGATTC TCGAACTTTC GGGAACGACA GGTTCTGGTG CAGCGGTTGG TCGGGCAAAG GGTTTGCGCA ATAAACTCAA TTCCAACATC ACCATTATCG ATTCACAGAC TGGCAACTTC ACCCGTGCCG AGGCCGTCCC TGTGATGCAG GCACTTCTGA AGAAATATAC CCCGGGCACG GATTTCCAAG GCATCTTTAT TCACAACGAC GACATGGGCA TCGGCGTGAT CGAAGTTCTC AAGGCTGCTG GGATCAAACC AGGCGATCTC AAGATTGTGT CGGTTGATGG TACTCGCGGC GGTTTCCAAG CCATGGTTGA TGGCTGGTTC CAAGCCGATG TTGAGTGTAA CCCGCTGCTT GGCCCACAGG TATTCGAATT GGCGCTGAAG CTCATGAACG GTCAGCCAGT CGAGCCAGAA GTCATCACCA ATGAAACCGT CTACTATCCA GAGAATGCAG CTGAACTGTT ACCAACCCGC AAATACTAG
|
Protein sequence | MSISRTFAAR LMALLVIVGL AGCGASSAPT TAPDAPATGA KTYSDMVLCY PQLGAESDWR TANTASIKET AATLGIKQLV FSDAQQKQEN QISAVRACIQ QGVDVIALPP VVEDGWDAVL TEAKNAGIPV IIVDRSVSAD KSLYSTHIGS NMVLEGERAA AEFNKMMPNG GAILELSGTT GSGAAVGRAK GLRNKLNSNI TIIDSQTGNF TRAEAVPVMQ ALLKKYTPGT DFQGIFIHND DMGIGVIEVL KAAGIKPGDL KIVSVDGTRG GFQAMVDGWF QADVECNPLL GPQVFELALK LMNGQPVEPE VITNETVYYP ENAAELLPTR KY
|
| |