Gene Hore_21760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21760 
Symbol 
ID7313724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2366941 
End bp2368143 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content41% 
IMG OID643612629 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002509917 
Protein GI220933009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGTA AATATAAACT AATGATGGTT GCCTTCAGTG GAGTACCATT TATCATGGTC 
CTCGGAAATT CCATGTTAAT ACCTGAATTT CCCCAGATCA AGGCTGCTTT AGACATCGAC
CAATTTCACG TTGGTCTTCT TATTACGGTT TTTTCTATAT CAGCTGGTAT TACAATCCCT
TTTCTCGGTT ATCTGTGTGA CCAGATCGGT AGAATAAAAG TAATTGTACC CTCACTCCTG
TTATATGGGC TTGGGGGAAT TATTTCCGGG GTAGCTGCCC TGATTATGGA TAATCCCTAT
AATATTATTC TTATAGGAAG GGTCGTCCAG GGTGTCGGGG CAGCCGGAAC AGCCCCCATT
GTTATGGCTC TGGTAGGTGA TATTTTCCAG TCCGAGCAGA GAAGTGAGGC CCTGGGGATA
ATCGAGGCCG CTAATGGAAT AGGCAAGGTA GTCAGTCCCA TACTGGGATC AGCCATCGGT
TTAATCAGCT GGATTGCCCT CTTTTTCTTT TATGTCTTTT TAGCTATCCC TATTGCTGCA
GGGGTCTGGT TCTGGGGTAA GGAAGTTAAA GAAAAAGGGC AGCAAAGTTT AAAGAAATAT
CTTAGAAATG TGGGAGAAAT ATTCAAAGAA AAAGGCCTGT CACTTATAAT GACAATTCTC
TCTGGAATGC TTGTCCTCTT TATCCTGTTT GGACTTCTCT CCTATTTTTC GGATTTTCTG
GAAGCAAAAC ATAATATTAA GGGTTTTGTT AAGGGTTTAG TAATTGCCAT ACCCATTCTG
TTTATGTCAA CAACCTCTTA TATAAACGGG TATATCCTTA AAAAAGTAAA AAAATACTTT
AAAGCCTCCA TTATTACTGG TTTAATTATA ATTCCCCTTG CCCTTATAAT CCTCAGTTTT
ATAAAATCTT TAACCACCTA TCTGGTTTTG TTCTCTCTGC TCGGCATCGG GACCGGGCTT
GTTTTACCGG CTATTAATAC GCTGGTTACC AGTTCCACAA AAGCTGACCA GAGGGGGGTT
ATCACCTCTA TTTACGGCAG TGCCCGTTTT GTCGGGGTTG CTATTGGGCC ACCGGCTTTT
TCCTTTCTCG AAGAATTAAG TCTGAAAACC ATGTACTATG GGGGAAGCCT TATCGCTGGA
ATAATTATGG TTTTAGCTCT GATTTTTATT TCAGAGAAAG GTATGACTCC TAACCAGGGT
TAA
 
Protein sequence
MKSKYKLMMV AFSGVPFIMV LGNSMLIPEF PQIKAALDID QFHVGLLITV FSISAGITIP 
FLGYLCDQIG RIKVIVPSLL LYGLGGIISG VAALIMDNPY NIILIGRVVQ GVGAAGTAPI
VMALVGDIFQ SEQRSEALGI IEAANGIGKV VSPILGSAIG LISWIALFFF YVFLAIPIAA
GVWFWGKEVK EKGQQSLKKY LRNVGEIFKE KGLSLIMTIL SGMLVLFILF GLLSYFSDFL
EAKHNIKGFV KGLVIAIPIL FMSTTSYING YILKKVKKYF KASIITGLII IPLALIILSF
IKSLTTYLVL FSLLGIGTGL VLPAINTLVT SSTKADQRGV ITSIYGSARF VGVAIGPPAF
SFLEELSLKT MYYGGSLIAG IIMVLALIFI SEKGMTPNQG