Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_21760 |
Symbol | |
ID | 7313724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 2366941 |
End bp | 2368143 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643612629 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002509917 |
Protein GI | 220933009 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGTA AATATAAACT AATGATGGTT GCCTTCAGTG GAGTACCATT TATCATGGTC CTCGGAAATT CCATGTTAAT ACCTGAATTT CCCCAGATCA AGGCTGCTTT AGACATCGAC CAATTTCACG TTGGTCTTCT TATTACGGTT TTTTCTATAT CAGCTGGTAT TACAATCCCT TTTCTCGGTT ATCTGTGTGA CCAGATCGGT AGAATAAAAG TAATTGTACC CTCACTCCTG TTATATGGGC TTGGGGGAAT TATTTCCGGG GTAGCTGCCC TGATTATGGA TAATCCCTAT AATATTATTC TTATAGGAAG GGTCGTCCAG GGTGTCGGGG CAGCCGGAAC AGCCCCCATT GTTATGGCTC TGGTAGGTGA TATTTTCCAG TCCGAGCAGA GAAGTGAGGC CCTGGGGATA ATCGAGGCCG CTAATGGAAT AGGCAAGGTA GTCAGTCCCA TACTGGGATC AGCCATCGGT TTAATCAGCT GGATTGCCCT CTTTTTCTTT TATGTCTTTT TAGCTATCCC TATTGCTGCA GGGGTCTGGT TCTGGGGTAA GGAAGTTAAA GAAAAAGGGC AGCAAAGTTT AAAGAAATAT CTTAGAAATG TGGGAGAAAT ATTCAAAGAA AAAGGCCTGT CACTTATAAT GACAATTCTC TCTGGAATGC TTGTCCTCTT TATCCTGTTT GGACTTCTCT CCTATTTTTC GGATTTTCTG GAAGCAAAAC ATAATATTAA GGGTTTTGTT AAGGGTTTAG TAATTGCCAT ACCCATTCTG TTTATGTCAA CAACCTCTTA TATAAACGGG TATATCCTTA AAAAAGTAAA AAAATACTTT AAAGCCTCCA TTATTACTGG TTTAATTATA ATTCCCCTTG CCCTTATAAT CCTCAGTTTT ATAAAATCTT TAACCACCTA TCTGGTTTTG TTCTCTCTGC TCGGCATCGG GACCGGGCTT GTTTTACCGG CTATTAATAC GCTGGTTACC AGTTCCACAA AAGCTGACCA GAGGGGGGTT ATCACCTCTA TTTACGGCAG TGCCCGTTTT GTCGGGGTTG CTATTGGGCC ACCGGCTTTT TCCTTTCTCG AAGAATTAAG TCTGAAAACC ATGTACTATG GGGGAAGCCT TATCGCTGGA ATAATTATGG TTTTAGCTCT GATTTTTATT TCAGAGAAAG GTATGACTCC TAACCAGGGT TAA
|
Protein sequence | MKSKYKLMMV AFSGVPFIMV LGNSMLIPEF PQIKAALDID QFHVGLLITV FSISAGITIP FLGYLCDQIG RIKVIVPSLL LYGLGGIISG VAALIMDNPY NIILIGRVVQ GVGAAGTAPI VMALVGDIFQ SEQRSEALGI IEAANGIGKV VSPILGSAIG LISWIALFFF YVFLAIPIAA GVWFWGKEVK EKGQQSLKKY LRNVGEIFKE KGLSLIMTIL SGMLVLFILF GLLSYFSDFL EAKHNIKGFV KGLVIAIPIL FMSTTSYING YILKKVKKYF KASIITGLII IPLALIILSF IKSLTTYLVL FSLLGIGTGL VLPAINTLVT SSTKADQRGV ITSIYGSARF VGVAIGPPAF SFLEELSLKT MYYGGSLIAG IIMVLALIFI SEKGMTPNQG
|
| |