Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2789 |
Symbol | |
ID | 5734670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3546761 |
End bp | 3547753 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279932 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001545555 |
Protein GI | 159899308 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGCC GTTTTTCACT TGGTTGGCAA CGCTTGGCCT TGGCTTTAAT CGTCGGGCTT TTGGCAAGCT GTGGCTCCAC CAGCAATGAT GCTGCTTCAG ATGGACAAAC GAGCACGCGC ACCCTGCGGA TCGGCTATCA AAAGGGTGGC TCGTTGCCGA TTCTCAAGGG CAACGGCGCT CTTGAGAGTC GGCTCAAGGA TGTCAAGGTT GAATGGATTG AGTTTGCCGC AGGGCCGCCG CTGCTTGAAT CACTGAATGC AGGCAGCATC GATCTTGGCT CGACTGGCCA AACTCCGCCA ATTTTTGCCC AAGCTGCTGG CACGCCGCTG GTTTATGTGG CCTCGGTCGC GGCTTCGCCT ACCGGCCAAG CCTTGCTCGT ACCGAAAGAT TCACCAATTC AAAGCGTGAG CGAGCTCAAG GGCAAAAAAG TTGCCTTTGC CAAAGGTTCG AGCGCCCATT ATTTTGCAAT TGATGTTCTG CGTGAAGCAG GCTTGCAATA TAGCGATATT GAGCCAGCAT TTCTGACTCC ACCCGATGCG CGGCCAGCGT TCGAGGGTGG CAGCGTCGAT GCTTGGATTA TTTGGGAACC CTATTTGACG ATTGCGCTCA AAGCAACCGA TGCGCGAGTT GTGCACGATG GATCGAGCCT TGCGCCCAGC CATAGCTACT ATTTAGCAGC CAAAAGCTTT GCCGAGCAAC ATCCCGATCT GGTCAGCGCA ACCTTGGAAG AAATTCAAAA AGTTGAGCAA TGGTCAGCCC AAGAACCGCA AGCAGTTGCC AAAATTCTAG CTCCCGTGAT TGGGGTTGAT GCGGCAATCT TAGAAGAAGT GGCCAAAAAA CAGGCCTTTG GGCTTAGCCC AATCAATGAC GCAATTGTGA ATGAGCAACA GCAAATTGCC GATACCTTTT TTGAATTAGG CTTAATTCCC AAAAAGGTCA GCATTCGCGA AGCAGTTTGG ACGTGGCAAC CAACCCAAGC CAGCGCTCAG TAG
|
Protein sequence | MARRFSLGWQ RLALALIVGL LASCGSTSND AASDGQTSTR TLRIGYQKGG SLPILKGNGA LESRLKDVKV EWIEFAAGPP LLESLNAGSI DLGSTGQTPP IFAQAAGTPL VYVASVAASP TGQALLVPKD SPIQSVSELK GKKVAFAKGS SAHYFAIDVL REAGLQYSDI EPAFLTPPDA RPAFEGGSVD AWIIWEPYLT IALKATDARV VHDGSSLAPS HSYYLAAKSF AEQHPDLVSA TLEEIQKVEQ WSAQEPQAVA KILAPVIGVD AAILEEVAKK QAFGLSPIND AIVNEQQQIA DTFFELGLIP KKVSIREAVW TWQPTQASAQ
|
| |