Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0443 |
Symbol | |
ID | 5732342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 519140 |
End bp | 520015 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277569 |
Product | ectoine/hydroxyectoine ABC transporter solute-binding protein |
Protein accession | YP_001543222 |
Protein GI | 159896975 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000249554 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTG TTTGGTGGGG TGTGGCTGGT TTAGCTCTCA TCGGAATGAT CACAGTTTTG ATTTGGTGGC TAGCAAAACC GCTTGAAACA ACCTTGGAAC GTGCCCAACG CACAGGCATG ATCCGGATTG GCTATGCCCC CGAAGCGCCT TTCTCGTATC GCGATGCTAC TGGCACTGTG GTTGGCGAAG AGGCAGTGGT GATTACTTGG GTGATGCAGC GCCTAGGAGT TTCTCAGCTG GAATGGGTGC AAACCGAATG GGCCGATTTA ATTCCTGGCT TACAAGCTGG GCGTTTTGAC TTGATTGCCA GTGGAATGTT TATTACCTGT GAGCGAGCCG AACTCCTTGC GTTTAGCCAA CCAACCTTTG CCCTCAGTCC AGCTATGCTC GTCGCTAAAA CTAACCCATT GGGCATTCAG AGTTTTGCCG ATTTTCAGCG GCCAGATCGG CGTTTGGCGG TGATGCGTGG TGCACGTGAG GCCGAAATTG CCCAAGCCCT GGGGATTGCG CCAGAACAAT TATTATTTGT GCCCGATGTG CAAACAGGCT TGGCGGCAGT GCTGGCAGGC CGTGCCGATG CCTTAGCCTT GACCGATATC AGCATTGATT TGTTGGTATT ACAAGCGCCA GATCAAGTTG AACGAGCCAT GCCGTTTGTG CCACCAATTA TTGATGGAAA TTTAAGTATT GGCTATGGAG CCTTTGCGAT GCGTCATAAG GATGCACATT TGCGCACAGC AATTGATCAA CAGTTGATTG GATTTATTGG CAGTGACGAA CATTATGGCT TAATTGCGCC GTTTGGGTTT TCGCGCGAGC AATTGCCCAA TCGTTCAACC GCCAGTCTGC TCCAAGGTTG TGAGAATGGC TCATGA
|
Protein sequence | MRRVWWGVAG LALIGMITVL IWWLAKPLET TLERAQRTGM IRIGYAPEAP FSYRDATGTV VGEEAVVITW VMQRLGVSQL EWVQTEWADL IPGLQAGRFD LIASGMFITC ERAELLAFSQ PTFALSPAML VAKTNPLGIQ SFADFQRPDR RLAVMRGARE AEIAQALGIA PEQLLFVPDV QTGLAAVLAG RADALALTDI SIDLLVLQAP DQVERAMPFV PPIIDGNLSI GYGAFAMRHK DAHLRTAIDQ QLIGFIGSDE HYGLIAPFGF SREQLPNRST ASLLQGCENG S
|
| |