Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0259 |
Symbol | |
ID | 5732154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 302771 |
End bp | 303808 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277383 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001543039 |
Protein GI | 159896792 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.904874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCT TGCATTTCAG TTTATTAAGC TTGTTATTGG TCGGATTATT GGCTGCTTGT GGCGAGGCCA GCCAAACTAC CACCAGCAAT GCCACGGTTA CCACCATCAC CTTGGGCGCG TACACCACGC CACGCGAAGC CTATGCCAAG TTGATTCCGC TGTTTCAAGC CAAATGGAAG GCCGATACTG GCGGTGAAGT CAAATTTGAA GAATCATATC AAGGTTCAGG TGCGCAATCA CGGGCAATCG TTGAAGGCTT CGAGGCTGAT ATCGCGGCGC TTTCGCTCGA AGCTGATATT AATCGGATCA CCGATGCAGG CCTGATCACT CACGATTGGA AAGCTGGCAC GCATTCGGGC ATGGTTAGTA CCTCAATTGT GGTGTTTGCG GTACGCGAAG GCAATCCCAA AGGTATTCAA GATTGGGCTG ATTTAGCCAA GCCTGGAGTA CAAATTTTGA CTCCCGATCC ACGCACCAGC GGCGGCGCAC AATGGAATAT TTTAGCGTTG TATGGGGCCG CCAAACGTGG TCAGATTACA GGCGTACCCG CCAACGACGA AGCAGCAGCC CAAGCCTTTT TAGCGAGTGT GCTTAAAAAT GTCGTGGTGT TTGATAAGGG TGCGCGTGAA AGTATCACCA ACTTTGAAAA AGGCGTTGGC GATGTGGCAA TCACCTATGA AAATGAAATT TTGGTTGGCC AAAAAGGCGG CCAAACCTAT CAAATGGTCA TCCCAACTTC CACGATTTTG ATCGAAAACC CAATTGCCCT AATCGATAAA TCAGTTGAAA AACATGGCAA TCGTCAAGCA GTCGAAGCTT TTATTAACTT CTTGCATAGC CGTGAAGCCC AAGAAGTCTT TGCTGAATTT GGCTTACGCT CGGTCGATGC CGATGTTGCC AAAGCCACTG CTGAGCGCTA TCCCGCCATC AACGATTTGT TTACAATCAA CGAATTTGGT GGTTGGAGCA AAGCCACGCC TGAATACTTT GGCGATGACG GTGTCTATGC CAAAGTGCTA GCGCAGGTAC AACAATGA
|
Protein sequence | MKRLHFSLLS LLLVGLLAAC GEASQTTTSN ATVTTITLGA YTTPREAYAK LIPLFQAKWK ADTGGEVKFE ESYQGSGAQS RAIVEGFEAD IAALSLEADI NRITDAGLIT HDWKAGTHSG MVSTSIVVFA VREGNPKGIQ DWADLAKPGV QILTPDPRTS GGAQWNILAL YGAAKRGQIT GVPANDEAAA QAFLASVLKN VVVFDKGARE SITNFEKGVG DVAITYENEI LVGQKGGQTY QMVIPTSTIL IENPIALIDK SVEKHGNRQA VEAFINFLHS REAQEVFAEF GLRSVDADVA KATAERYPAI NDLFTINEFG GWSKATPEYF GDDGVYAKVL AQVQQ
|
| |