Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4033 |
Symbol | |
ID | 5735895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5148672 |
End bp | 5149961 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281184 |
Product | extracellular solute-binding protein |
Protein accession | YP_001546793 |
Protein GI | 159900546 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGCA AATGGCGTTG GAAATTCGCT TTGGTTAGCC TATTAACGAC CGTTTTAGCC GCGTGTGGTG GCGGCAGCGC AACCCAAGCT CCTACCACCA ATCCCAATCG GGTCACGCTG ACGATTGAAA GCTGGCGCAG CGACGACCAA AAGGTCTGGA CTGAGCAGAT TATCCCGCTA TTTGAAAAGA GCCATCCTGA TATTCATGTG GAGTTTCGGC CCACCGCGCC GACCGAATAC AACGCAGCGC TCAATAGCAA ACTAGAGGCT GGCACTGCTG GCGATCTGAT CACCTGCCGG CCTTTTGATG TTTCATTAGG CTTGTATACT GCGGGGCATT TGACTGCACT CGACGATTTG CCAGGCATTG CCCATTTTAG CGATGTAGCC AAGAGCGCTT GGCAAACCGA CGATGGCAAG ACGCTCTTTT GCTTGCCAAT GGCCTCGGTG ATCCATGGCT TTATCTACAA CAAAGCGATT TTTAACGAGC TAGGTTTGAC TGAACCCAAA ACTGTCGATG AGTTTTTCGC GGTGCTCGAT AAAATTAAGC AAGATGGCCG CTATACACCA TTGGTGATGG GTACTGCCGA GCAGTGGGAA GCAGCAACCA TGGGCTTCCA AAACATCGGG CCAAATTATT GGTTTGGTGA AAATGGCCGT AAAGCGCTAA TCGATGGCAG CGCCAAATTA TCCGATGCGG CCTATGTTTC GACCTTAAAA GATTTGGCAC GCTGGGGCGA TTATTTGCCC GATGGCTTTG AAGCAGTCAA ATATGCCGAT GCTCAGATGT TGTTTGCCTC AGGCAAAGGG GCAATTTACC CCGCTGGCTC GTGGGATATT AGCTATTTCA ACCAAAATGC CAAATTCGAG CTTGGCGCAT TCAAAGCTCC GGTTGCCAAA GCCAATGATC AATGCTTCAT CAGCGACCAT ACCGATATTG GAATTGGGAT TAATGCTAAG ACTGCCCACA TGGCTCAAGC CCGTACCTTC CTTGAGTGGA TGACTGGGCC AGAATTTGCT GGCGAATTCA GCAATGCCTT GCCTGGCTTC TTCAGCCTCT CGGATTATCC AGTGAGCTTG AAAAATCCCT TGGCCCAAAC CTTTGTTGAT TGGCGCAAAC AGTGTCAATC GACAATCCGC AATTCCTACC AAATTCTCTC ACGCGGCGAG CCAAACCTCG AAAACGAACT CTGGCGGATC AGCGTTGGGG TGATTGATGG AACATTTGCG CCTGAAGATG CGGCCCAAGA TCTGCAAAAG GGCTTAGATA ATTGGTACAA AAAGCCCTAG
|
Protein sequence | MVRKWRWKFA LVSLLTTVLA ACGGGSATQA PTTNPNRVTL TIESWRSDDQ KVWTEQIIPL FEKSHPDIHV EFRPTAPTEY NAALNSKLEA GTAGDLITCR PFDVSLGLYT AGHLTALDDL PGIAHFSDVA KSAWQTDDGK TLFCLPMASV IHGFIYNKAI FNELGLTEPK TVDEFFAVLD KIKQDGRYTP LVMGTAEQWE AATMGFQNIG PNYWFGENGR KALIDGSAKL SDAAYVSTLK DLARWGDYLP DGFEAVKYAD AQMLFASGKG AIYPAGSWDI SYFNQNAKFE LGAFKAPVAK ANDQCFISDH TDIGIGINAK TAHMAQARTF LEWMTGPEFA GEFSNALPGF FSLSDYPVSL KNPLAQTFVD WRKQCQSTIR NSYQILSRGE PNLENELWRI SVGVIDGTFA PEDAAQDLQK GLDNWYKKP
|
| |