Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_05470 |
Symbol | |
ID | 7313511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 595469 |
End bp | 596728 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610969 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002508299 |
Protein GI | 220931391 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000000120862 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAGG AAGGAGGTGA ACTTACATTG GAAAACCCCT TTAAAGTATA TCGTGGTTTA AATAGAAATA TTTATATTAT TTTTATTGGA CAGGTTATTA ATTCAATGGG AGCGTTTGTT TTTCCCTTTT TAACCATGTT TCTAACTCAA AAAATAGGCA TGTCCCCTGC TGAAGCAGGT TCATATGTTA CTGTTGCAGC CCTGGCAAAT GTTCCCGGTA TGTTTCTTGG CGCGAAACTG GCTGATAGTT TTGGTCGGAA AAGGTTATAT TTAATTTCCT CGACTCTCAT GGCCTTAATG TTAATTCCAC CTGCTTTTCT GGGTACCAGT AAAGTTGTTA TTTATTTTTT AATAATGATG TCACTTTTTG CTGGTGCTGT AAATCCTGCA TTTAATGCAA TGGTAACAGA TTTAACCAGG GGTGAAGAGA GGAAAAAGGC CTTTTCATTA CTTTACCTTG GATGGAATAT GGGTTTTGCT ATTGGTCCAA TGATTGCTGG GTTTTTATTT AACCACTATT TACCTTTATT ATTTTTAGGG GATGCAGCAA CTGCCTTTAT TGCTATAGTA CTTATTGGAA TTTATGTACC GGAGACAAAA GGAATGATTG AAGAAACTCC CGATGAGGAA TTACCGGAAA ACGAGAGGGC AGAAGAAGGG TCAATTTTTC GTGTTTTGTT AAAACGACCG GGGATAATTC TTGTAAGCTT TATTTTGTTA TTCTTCCGTC TGGTATATGC CCAGAGTTCT TTTGCCTTAC CAATTCAAAT GAATGAGATT TTTGGTCAAC AGGGTCCTGC TTATTATGGC ATAAATTATA GCTTTAATGC TATTGTAGTT GTGGCTTTTA CAGTGCTGGT AACCAGTGTC ACTGTTAAAC TGAAGCCACT GGCCAATATA ATAATTGCGG GATTATTATT GGCAGTAGGG TTTGGTATGA TTTATTATAT TGATATACTT CCCTTGTTTT TTCTGTCCAC TTTTGTCTGG ACCATCGGTG AAATCCTGGA GGCAACAAAT GTCAATGTTT ATATAGCGTC TCATGCTCCT GTAAGCCACC GGGCAAGGTT TAATTCAATT TTTATGTTTA TTTCAGGAGC AGGGTATGCA TTTGCCCCAA AATTAGGTGG GTTGTTTTTA GAGTATTATT CAATTAGGGA AATATGGTTA GCTAGTTTTT TTGTAATGGT AATTGCCAGT AGTGCTCTTT TACTTTTTTA CTTGGGGCAG GAACGGGTTA AAAGATTAAC ATGTAAATAA
|
Protein sequence | MTQEGGELTL ENPFKVYRGL NRNIYIIFIG QVINSMGAFV FPFLTMFLTQ KIGMSPAEAG SYVTVAALAN VPGMFLGAKL ADSFGRKRLY LISSTLMALM LIPPAFLGTS KVVIYFLIMM SLFAGAVNPA FNAMVTDLTR GEERKKAFSL LYLGWNMGFA IGPMIAGFLF NHYLPLLFLG DAATAFIAIV LIGIYVPETK GMIEETPDEE LPENERAEEG SIFRVLLKRP GIILVSFILL FFRLVYAQSS FALPIQMNEI FGQQGPAYYG INYSFNAIVV VAFTVLVTSV TVKLKPLANI IIAGLLLAVG FGMIYYIDIL PLFFLSTFVW TIGEILEATN VNVYIASHAP VSHRARFNSI FMFISGAGYA FAPKLGGLFL EYYSIREIWL ASFFVMVIAS SALLLFYLGQ ERVKRLTCK
|
| |