Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4340 |
Symbol | |
ID | 5736200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5547890 |
End bp | 5549014 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281501 |
Product | solute-binding protein |
Protein accession | YP_001547100 |
Protein GI | 159900853 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTGC GGCGTAGCGC ATTCACTTCC CTCTTGTTGA TCCTCATCTC TAGCGTGATT GCAGCCTGTG GTAGCGAAAG CGCTACAACT GCTCCAACCA CAGGCACTGG TGGCACAACC AGCAGTGGCG GCAAAGTAGT CGCTTTGTTC TTGCCCGATG CCAAAACTGC CCGCTATGAA ACTGCCGACC GCCCCTACTT CGAAGCCAAA ATGAAAGAGC TTTGCCCAGA TTGTCAGGTG ATTTACAACA ACGCCAACCA AGATGCCAGC TTGCAATTGC AACAAGCTGA AGCAGCGTTG ACCAACGGCG CAAAGGTTTT GGTGCTTGAC CCAGTTGATT CTGCTGCTGC TGCTTCGATT GCCGACAAAG CCAAAGCCCA AAATGTGCCA GTCATCGCCT ACGACCGCTT GATCCTTAAC TCGGATGGCG TGAGCTACTA CATTTCATTC GACAACGAAT CAGTTGGCAA GTTGCAAGCC GAAAGCTTGG TTGCGCAATT AGACAAGCAA GGGATTGCCA ACCCAACTAT CGTCATGATC AACGGCTCAC CAACCGACAA CAATGCTAAA TTGTTCAAAG CTGGCGCTCA CAGCGTGTTT GATCCATTGG TTAGCGCTGG CAAATTGACC ATCGCCAACG AATATGACAC TCCCGACTGG AGCCCCGACA AAGCCCAAGA CCAAATGCAA CAAGCCTTGA CCAGCATGGG TAACAAAGTT GATGGCGTAT ATGCTGCCAA CGACGGTACT GGTGGCGGGG CGATTGCCGC TATGAAGGCT GGTGGTCTCT CACCATTGCC TCCAGTCACA GGCCAAGATG CTGAATTGGC GGCGATTCAA CGGATTTTGG CTGGCGATCA ATACATGACC GTGTACAAAG CGATCAAGCC ACAAGCCGAA GCTGCTGCTG AATTGGCCTT TGCCTTGCTC GAAGGCAAAA CCAGCGACAA AGCTACCAGC AAAGTCAACA ATGGCAAAAT TGATGTTCCT TCAATCTTGC TGACCCCAAT TGCTGTGACC AAAGAAAACG TCAAAGACAC GATTGTCAAA GACCAATTCC ACAAAGTTGA TCAGATCTGT GCTGGCGACT TCGCCAAAGC CTGTGCTGAT GCCGGTATTC AATAA
|
Protein sequence | MNVRRSAFTS LLLILISSVI AACGSESATT APTTGTGGTT SSGGKVVALF LPDAKTARYE TADRPYFEAK MKELCPDCQV IYNNANQDAS LQLQQAEAAL TNGAKVLVLD PVDSAAAASI ADKAKAQNVP VIAYDRLILN SDGVSYYISF DNESVGKLQA ESLVAQLDKQ GIANPTIVMI NGSPTDNNAK LFKAGAHSVF DPLVSAGKLT IANEYDTPDW SPDKAQDQMQ QALTSMGNKV DGVYAANDGT GGGAIAAMKA GGLSPLPPVT GQDAELAAIQ RILAGDQYMT VYKAIKPQAE AAAELAFALL EGKTSDKATS KVNNGKIDVP SILLTPIAVT KENVKDTIVK DQFHKVDQIC AGDFAKACAD AGIQ
|
| |