Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4154 |
Symbol | |
ID | 5736015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5301731 |
End bp | 5302720 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641281308 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001546914 |
Protein GI | 159900667 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00377851 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAGC ATGCATTGAT CAAACGTTTA TTGATTTTGG TAACTCTGTT GAGTTTGGTA GCTTGTGGCT CAAGCCAAAC CAACCAAACG GTAACTGCCG ATAAACCCAC TAAAACCTAT GCCGATTTGG TGATTGGCTA TTCGCAAATC GGCGCTGAAA GCCGCTGGCG CACCGCTAAT ACCCTCTCAA TCCAAGAAAG TGCCCAAGAT TTGGGCGTTG ACTTACGTTT TGCTGATGCC CAACAGGAAC AAATTAACCA AATTAATGCG ATTCGTTCGT TCATCACTCA AAAAGTTGAT CTGATTGGGG TTTCGCCAAT TGTTGCCGAT GGCTGGGATG AGGTGTTTGC TGAAGCCAAA GCCGCTGGAA TTCCAATTAT TGTGGTTGAT CGCACGGCCA ATGTGCCCGA CGAATTAATT ACCGCATTTA TTGGCTCGGA TTTTGTATTA GAAGGTGAAC GCGCTTGCGA AGAAATGGCT CAGTTGCTCA ATCAAAAAGG CACGATTATT GAGCTAGAAG GCACGGTTGG CTCGGCTCCC GCCCGCGATC GTAAAACTGG CTTTCATAAT TGCTTGAAAA AATATCCCGA AATGCAGGTT TTGGTCTCAA AAAGCGGTGA TTTTACCCGT GCTCAAGGCA AAACTGTGTT GCAAGGCTTG ATCAAGCAAT ATGGCACTGA TTTTGATGCG ATTTATGCTC ACAACGATGA TATGGCCTTG GGTGCGATCG AATTGCTCAA AGAATTAGGC ATCAAACCAG GGGTCGAGGT CAAAATTGTC TCGATCGATG CCGTTGAAGA TGCCTTCAAA GCCATGATTG CTGGCGATTT GAATGTAACG GTTGAATGTA ACCCGTTGCT TGGGCCACAA TTTTTTGAAA CTGCCCTGAA AATCGTCAAC GGCGAGCCAT TTGAACGCTG GGTTAAATCG AACGAAGGAA TTTTTCGCCA AGCAACTGCC GCCCAAGATC TGCCTAAACG GCGCTATTAG
|
Protein sequence | MSQHALIKRL LILVTLLSLV ACGSSQTNQT VTADKPTKTY ADLVIGYSQI GAESRWRTAN TLSIQESAQD LGVDLRFADA QQEQINQINA IRSFITQKVD LIGVSPIVAD GWDEVFAEAK AAGIPIIVVD RTANVPDELI TAFIGSDFVL EGERACEEMA QLLNQKGTII ELEGTVGSAP ARDRKTGFHN CLKKYPEMQV LVSKSGDFTR AQGKTVLQGL IKQYGTDFDA IYAHNDDMAL GAIELLKELG IKPGVEVKIV SIDAVEDAFK AMIAGDLNVT VECNPLLGPQ FFETALKIVN GEPFERWVKS NEGIFRQATA AQDLPKRRY
|
| |