Gene Hore_21930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21930 
Symbol 
ID7313741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2384081 
End bp2385379 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content33% 
IMG OID643612646 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002509934 
Protein GI220933026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones79 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATACAC AAAAAGGTTT GGAAAAAACA CTACCCGAAA AAGAAATGAA AAAATCAGTA 
AAAAAGAATA TTATGTTTTT ATTAATGGGG AAATTCGTTT CTGTACTGGG TAGTCAAATT
TATTCTTTTG CAATCAGCCT CTATGTTTTA TCAATAACTG GTTCGGGTCT TAGTTTTTCT
CTTACCCTGG CACTTTCTAC CTTACCCAGG GTTATTTTTG GTCCCATTTC GGGGGTAATA
GCTGACAGGG TTGATAGAAA GAAAATGGTA GTAGCAATGG ATATTATTAG TGGATTAGTG
GTAATTGGAT TATTTTCCCT AAGTATAATT GATGAACTCA GGTTAGTCTA TATTTATTCA
ACTACATTTT TGCTTTCTAC GTGTAGTATC TTTTTTAACA CCCCCTTAAC TGCATCCCTG
CCAAACATTG TGGATGATGA AAATCTTACA AGGATCAATT CATTGAGTCA AACTATAGAA
TCTATATCGT CAATTGCCGG ACCTTTTATT GGCGGTATTG TTTATGCAAT TATGGATATT
AAAACATTTT TAGTTATTAA TGGAATATCT TTTATAATCT CAGGGATATC AGAATTATTT
ATAGATTTCA AATTGAATAG TCGTGGAAGA GTTCTTGAAG AAAGTAATAT GGAGAAAGAA
AAGGTATCCT TTTTTGTTGA TTTAAAGGAA GGCATAAGAT ATATAGCTAG TCAGAAATGG
CTTATTGTCC TCAGTTCATT TTTTGTAATA TTAAATATGT TGGTCATGAT GGGTTTACTG
GTACCAGTTC CCTATATTGT AAGGGAAATC TGGGGATTTA CCTCCCAACA ATATGGTTAT
TTAAATTCAA TGTTTCCGAT GGGAATATTA GTTGGCTCTC TTTTGCTGGC TATTTTGCCG
CAAAAGGGAA AGAAATTTAA AAGGTTTATG TTTTTTACTA TGGTTTTTTC AATTGCTGTT
ATTTCAGTTG GTATAATTAC TTCAGAGATG ATTTTTGAAC TGAACAACCT GCAGTATTTG
TTTATTTTAA TGGGTTTATA CTTTATTATA TCAGTATCTG CCATATTTAT TAATGTCCCC
CTTGAAGTGA CATTACAACG GCTTGTACCA GATGATAAAC GTGGTAGGGT TGAGGGGAGT
TTAGGGTCCC TATCTGAGGC TTTATCGCCA ATAGGTGTTA TAGTTGCTGG TGTACTTGTT
GACTTAATAT CTCCCTGGAT TTTACCTATC ACTTGTGGAA TTATAATGTT GGTTTTGTCT
ATAGCAATGG GGAGGGTAAA AGTTGTTAAG GAAATCTAA
 
Protein sequence
MNTQKGLEKT LPEKEMKKSV KKNIMFLLMG KFVSVLGSQI YSFAISLYVL SITGSGLSFS 
LTLALSTLPR VIFGPISGVI ADRVDRKKMV VAMDIISGLV VIGLFSLSII DELRLVYIYS
TTFLLSTCSI FFNTPLTASL PNIVDDENLT RINSLSQTIE SISSIAGPFI GGIVYAIMDI
KTFLVINGIS FIISGISELF IDFKLNSRGR VLEESNMEKE KVSFFVDLKE GIRYIASQKW
LIVLSSFFVI LNMLVMMGLL VPVPYIVREI WGFTSQQYGY LNSMFPMGIL VGSLLLAILP
QKGKKFKRFM FFTMVFSIAV ISVGIITSEM IFELNNLQYL FILMGLYFII SVSAIFINVP
LEVTLQRLVP DDKRGRVEGS LGSLSEALSP IGVIVAGVLV DLISPWILPI TCGIIMLVLS
IAMGRVKVVK EI