Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_04810 |
Symbol | |
ID | 7314460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 516663 |
End bp | 517934 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643610904 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002508234 |
Protein GI | 220931326 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.332463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTCT TCTTGTTTGT GGATCAGAAT GCTATGAACC CGATAATTAA CGATTTGGTG GCTGAATATG GGGTTGGTGA AGATAAAATT GGGATTATCG GGTCAGCCTT CATTATACTG GGGGCAGTTG TAAGCCTTAT TTTTGGTTAT CTGGCTGATA AAAAGAGCCG GAAGATGTTA CTGGCCTTTG TTGTCCTTGT CGGGGAAATA CCATGTTTTT TAACCGGTTT TGAGTATTTC ACCCGGACTT ATGAACAGTT ATTAATATTA AGAATATTGA CTGGATTGGG AATAGGAGGT ATCTTTCCCC TGACCTTTTC CCTGATCGGG GATTATTTCC CCGCTGGACA GCGTGGCATA ATAAATGCTA TGGTAACAAC AGCCTGGGGA GTAGGTCAGA TCCTGGGACA GACCCTGGCT GGTTTTTTAT CAGGTCCCTA TGGCTGGAGA TTACCTTTTA TTATAGCAGC TGTCCCCAAT TTTGTCCTCG TTCCTTTGTT TGTTTTGGTG GCCCGGGAAC CGAAACGTGG GGCCCAGGAA GAAGAATTAT CAGAAGTAAT AGAACAGGGT CTGGAGTACA GGGAAAAGAT AAAGTTTTCG GATTTGAAAG AAATATTTAA AAATAAGACA AATATATTGG GTTTCCTGCA GGGGATTCCT GGTTCCCTTC CCTGGGGGTT AATCCCCTTC TTCATAGTTC CCTTTTATGA ACTACATAAG GGTTTTTCCC GTGGTATGGC CACCATTTTG AGTCTCTTCC TGGGGATAGG AGCTACCGTA GGAGGTATCT TTGGTGGCTG GATGGGTGAT AAAATTTATA AAAAATCTCC CAGGATGCTA CCGGTTTTTA ATGCTGTTTT CATATTACTC GGGGTAATCC CAGGATTTTT TCTCATGGGA CTCAATTATG GCTCTGACCC CGGGTGGAAT GAGTTAATAC TTCCACTTAT TTTCAGTTTC TGTACCGGGG TTGTAGTTTC TTTGCCGGCC CCCAATATTA AATCTATTTT GATTAATGTA AACCCTCCAG AACACCGGGG TACTGTTTTT TCGATACATA ATCTGACCGA TAGTCTGGGA CGTGGTGTTG GTCCCCTTAT CGGTGGGTTT CTGGTTGTAT CCCAGGGTTA TCAATGGACT ATGTATTTTG CTGTTTTTGC CTGGATTCCC TGTGGCCTTA TATATTTATG GATGTACCGT ACTATAGATA ATGATCTGGA TACACTCAGG GGTTATCTGA AAAGAAAACG GGAGAAAATT ATCGGGATGT AA
|
Protein sequence | MSVFLFVDQN AMNPIINDLV AEYGVGEDKI GIIGSAFIIL GAVVSLIFGY LADKKSRKML LAFVVLVGEI PCFLTGFEYF TRTYEQLLIL RILTGLGIGG IFPLTFSLIG DYFPAGQRGI INAMVTTAWG VGQILGQTLA GFLSGPYGWR LPFIIAAVPN FVLVPLFVLV AREPKRGAQE EELSEVIEQG LEYREKIKFS DLKEIFKNKT NILGFLQGIP GSLPWGLIPF FIVPFYELHK GFSRGMATIL SLFLGIGATV GGIFGGWMGD KIYKKSPRML PVFNAVFILL GVIPGFFLMG LNYGSDPGWN ELILPLIFSF CTGVVVSLPA PNIKSILINV NPPEHRGTVF SIHNLTDSLG RGVGPLIGGF LVVSQGYQWT MYFAVFAWIP CGLIYLWMYR TIDNDLDTLR GYLKRKREKI IGM
|
| |