Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1243 |
Symbol | |
ID | 5733151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1448923 |
End bp | 1450257 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278383 |
Product | extracellular solute-binding protein |
Protein accession | YP_001544019 |
Protein GI | 159897772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACAT CGAAGTCAAC GTTCCGACTC TCTTTTATGC TACTCTTGGT TTTGCTCACC AGCATCTTGG CGGCTTGCGG CTCTGAGACC GCTACCACTG CGCCAAGCGG GAGCACCACC ACTAGCAACG AGCCACGCAC CATCAAACTT TGGCACTACG AAGGTGCTAA CAGCGCCATG GGTATTGCTT GGGCCGAGTC AATCAAACAA TTTCAAGCAT CACACCCTGG CGTAACGATT CAGTTTGAAG AAAAAGGCTT CGAGCAAATT CGCCAAACCG CTGGTATGGT GCTCAACTCC GATGAAACTC CCGATATTTT GGAATACAAC AAAGGGAATG CAACCGCTGG TTTGCTTTCA ACCCAAGGCT TGCTGACTGA TCTTTCCGAG GTGGCGACCC AACGCGGTTG GGATAAATTG CTCAGCTCCA GCTTGCAAAC CACCGCCCGC TACGATGAAA AAGGCGTGAT GGGTGCTGGC AAATGGTTTG GTGTGCCCAA CTATGCCGAA TATGTGATGG TTTATTACAA CAAAGACATG TTCGCCAAAG CCAACTTGCA AGTGCCAACT ACCTTGGCCG AATTTGAAGC CGTCATGGAT GCCTTTGTGC AACAAGGGGT CACGCCGCTC TCGGTCGGCG CTGCTGAATA TCCCGCCCAA CAGATTTTCT ATGAATTGGT GCTGAGCCAA GCTGATCGCG AATTCGTCAA TGCCTTCCAA CTCTATCAAG GCGATGTCGA TTTCCGTGGC CCTGAGTTTA CCTATGGCGC TGAAAAAATG GCCGAATGGG TCAGCAAAGG CTATATCAGC AAAGATGCCA CCGGCATCAA AGCCGAAGAT ATGGGCGTGG CCTTCACCAA TGGCACATTC CCAATCATGA TTTCGGGCAG TTGGTGGTAC GGTCGCTTCA CCGACGAAAT CAAGGGCTTT GAATGGGGCA CCTTCTTGTT CCCAGGCAAT AAATTGCACC CTGGCTCAAG CGGCAACATC TGGGCCGTGC CAACCAATGC CAAAAACAAA GATCTGGTCT ACGATTTCAT CGATATCACG ATGAGCCAAG ATATTCAGAC CTTGTTGGGT AATTCTGGTG GCGTGCCAGT TAACGCCGAC GTGAGCAAAA TCACCAACGA AAAGAACAAA GAATTGATCC AAAACTTCGA TGCAATTTCC AAGGCCGATG GCTTAGCCTT CTACCCCGAC TGGCCAGCCC CAGGCTACTA CGATGTTTTG GTTGCCAACG TTCAAGAGTT GATTGATGGA ACCAAAACTC CCAGCGAAAT GCTCGATGCA ATCGCTATTC CATATCAAGA AAATCGGGCA ACGTTAGGCA AATAA
|
Protein sequence | MSTSKSTFRL SFMLLLVLLT SILAACGSET ATTAPSGSTT TSNEPRTIKL WHYEGANSAM GIAWAESIKQ FQASHPGVTI QFEEKGFEQI RQTAGMVLNS DETPDILEYN KGNATAGLLS TQGLLTDLSE VATQRGWDKL LSSSLQTTAR YDEKGVMGAG KWFGVPNYAE YVMVYYNKDM FAKANLQVPT TLAEFEAVMD AFVQQGVTPL SVGAAEYPAQ QIFYELVLSQ ADREFVNAFQ LYQGDVDFRG PEFTYGAEKM AEWVSKGYIS KDATGIKAED MGVAFTNGTF PIMISGSWWY GRFTDEIKGF EWGTFLFPGN KLHPGSSGNI WAVPTNAKNK DLVYDFIDIT MSQDIQTLLG NSGGVPVNAD VSKITNEKNK ELIQNFDAIS KADGLAFYPD WPAPGYYDVL VANVQELIDG TKTPSEMLDA IAIPYQENRA TLGK
|
| |