Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1180 |
Symbol | |
ID | 5733073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1354081 |
End bp | 1355538 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278320 |
Product | extracellular solute-binding protein |
Protein accession | YP_001543956 |
Protein GI | 159897709 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATC GCAGTTTCCT GTTGTTACTG CTGATCAGCA TGCTGATCGC CGCCTGTGGT GGTAGCACAA CGCCTACGAC CGCACCAACC GCTGACACTG CTGCTCAAGC CACCACGGCT CCAGCCGCAA CCGAAGCACC AGCCGCGACC GAAGCACCAG CCGCAACCGA AGCACCAGCC GCAACCAAAA CTACAGATAG CACTGGCAGC GTCCCAGCCG TCAACCCTAA CTATGCCGAA TTAGTCCGCG CCGAAGCTGG CGAATTCAAA GGCACTAAAG TCAGCATTTT CGGTTTGTGG GTTGAAGCTG AAGACACCGC CTTCAATCAA ACCTTGGCTG CTTTCGAAGC CCGCACTGGC ATCGATGTTC AGTACGAAGG CTCGAAAGAT TTCGAAACTT TGATTCCTGT TCGCGCCGAA GGCGGTAACG CCCCTGACAT CGCTGGTTTC TCACAACCAG GCTTGATGGC AACCTTGGCT CGCAAGAACA AGTTGGTCGA TATCTCAAGC TTCATCAAAC CAGAAGACTT GCAAAAAGAT TACATCCAAT CATGGATCGA TCTTGGCTCA GTTGATAACA CCCTCTACGG TGTCTTCTTC CGCGCCAGCA CCAAGAGCAT CGTCTGGTAT CCAGTTCCAG CCTTCGAAGA AGCTGGCTAC ACCATTCCAG AAACCTGGGA TGAATTGACC GCTTTGAGCG ACAAGATGGT TGCTGATGGC AACACTCCAT GGTGTATCGG CTTGGAACAC GGCAACGTGA CCGGTTGGGT GGCTACCGAC TGGATGGAAG ACATCATGTT GCGCACCGCT GGCGCTGAAA CCTACGACAA GTGGGTCAGC CACGAAATCC CATTCACCGA CCCAAGCGTC AAGAACGCCG CTGAAGTCAT GGGTAAAATC TGGTTCAACG AAACCTACGT TGCCGGTGGC CGCGAAGGTA TCTTGACCAC CACCGTGGCC GATACCCAAA CCCCAATGTT CAACGCCGAC AAACCAGGCT GCTGGTTGCA CCGCCAAGCT GGCTGGATTC CATCATTCTT CCCAGAAGGC AAGAAAGCTG GTACCGACAG CAACTTCTTC TACTTGCCAC CAATTGATCC AGCTCAAGGC AAGCCTGTGT TGGGTGGTGG CGACGTGTTC GCAATGTTCA ACGACCGCCC TGAAGTTCGC GCCGTGTTGC AATACTTGGC TAGCCCAGAA TCAGCCAAAG GCTGGGTCGA AGCTGGTGGC TTTATCTCAC CAAACAAGAG TGTAGATTTG GCTTGGTATG GTAACGACAT CGACCGCCGC CAAGCTGAAA TCATCAAGGA AGCTACCGTA TTCCGCTTTG ATGCTTCTGA CTTGATGCCA GCCGAAGTTG GCGTAGGTAC CTTCTGGAAA GGTATGGTCG ATTACGTGAA CAATACCGAA ATCGACACCG TGTTGCAAAC CATCGAAGAA AGCTGGCCAC AAAACTAA
|
Protein sequence | MKHRSFLLLL LISMLIAACG GSTTPTTAPT ADTAAQATTA PAATEAPAAT EAPAATEAPA ATKTTDSTGS VPAVNPNYAE LVRAEAGEFK GTKVSIFGLW VEAEDTAFNQ TLAAFEARTG IDVQYEGSKD FETLIPVRAE GGNAPDIAGF SQPGLMATLA RKNKLVDISS FIKPEDLQKD YIQSWIDLGS VDNTLYGVFF RASTKSIVWY PVPAFEEAGY TIPETWDELT ALSDKMVADG NTPWCIGLEH GNVTGWVATD WMEDIMLRTA GAETYDKWVS HEIPFTDPSV KNAAEVMGKI WFNETYVAGG REGILTTTVA DTQTPMFNAD KPGCWLHRQA GWIPSFFPEG KKAGTDSNFF YLPPIDPAQG KPVLGGGDVF AMFNDRPEVR AVLQYLASPE SAKGWVEAGG FISPNKSVDL AWYGNDIDRR QAEIIKEATV FRFDASDLMP AEVGVGTFWK GMVDYVNNTE IDTVLQTIEE SWPQN
|
| |