Gene Hore_19250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19250 
Symbol 
ID7312740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2058283 
End bp2059617 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content44% 
IMG OID643612371 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002509667 
Protein GI220932759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00108172 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAA AATCCACCCG GAGGAAGCTA TTACATAACA AAGACTTTAT TCTCCTCTTT 
CTGGGAGGAT TTGTCTCCCG GATAGGTAGT AAAGTACACT ATGTTGCTAT GACCTGGTTT
GTTTTAAAAC TGACCGGAAG TGGTACCGCA GCCGGAACCG TGTTATTACT GGCAACTCTC
CCCGGAGCTA TTTTAGGGCC TGTTGGTGGA GTAATAGCAG ACAGAATTAA CCGTAAACTT
ATAATTGTCA GTATGGATAC CGTCCGGGGG CTGATTGTAA TCTGGCTTAG CTGGACCGTC
TATAACGGAA CCGCTGGCTT TTATCACATC TGTATTGCCA CCTTTCTGGT CGCCCTGAGT
GGCACCTTTT TCAATCCCGC AGTAACGGCA TCTATACCCA ATATAGTTGA AAAACATAAT
TTACAGAAGG CAAATTCCCT CGAACACTTA AGCTTTCAGG GGACTTCCAT TATCGGAGCT
GCTACCGGAG GGATTCTTAT TGCTATCTTT GGGGTAGCCG GGGTCTTCTT AATTGACGGG
ATAAGTTATT TAATCTCAGC CTTCTCCGAG TTATTTATTA ATATTCCCCC TGTGAAGAGA
GAAGAACAAT CCGGGGATAA CGGGGAATTG AGTAAATTTA CTATTCTGTA CAATGATCTC
AGGGAAGGAG CCCGATACCT TTACTCCAAC AAACCCCTGT TTACCCTGTT CAGTATATCT
ATTATTATTA ACTTCCTTTT TGCCGGGGCT ATGGCAGTCG GGATTCCCTA TGTGTTTAAA
GAAGTCCTAC AGGTAAACAG TAAGTTATTC GGCCTGGCCC AGTCCTTCTT TCCAGCTGGG
GCTATCCTGG GAGCAGTTAT TATGAACTTT TTACCACCAG TTAAAAATTT TTTCAGGACC
CTGTTTACAG GGATTACCTT TCAAACAATA CTTCTGGCGG CAATCGGTTT ACCTATTTCC
CCTTTCATGG TAGATAAATA CCCGGTTATA AGTTTATTCA TACTCATGGC CGTTATTCTC
ATTCTTTTCG GGGCCTTCAA TGCCTATACC AATATCCCCA TTAATACTAT GCTCCAGAGG
TTAATAGATG ACAGGGTAAG GGGTAGGGTT TTCGGCCTTC TGGCCACACT AAATATGGGG
TTAGTACCGG TTTCCATGTG GGCAGCAGGG TGTCTTCTTG ATGCCTTCCC GGCCTATCTC
CTGTTTGTGG GAGCAGGGGG AATTATGGTC GGTGTACTTG CCTATAGTGT ATCCCTCCCT
ACCCTCAAAC CATTAAAAAA TGAAGTCTAT ATTGATAAAA GAGAGAACCC GGCAGAATAC
TCTGCCGGGG TATAA
 
Protein sequence
MDEKSTRRKL LHNKDFILLF LGGFVSRIGS KVHYVAMTWF VLKLTGSGTA AGTVLLLATL 
PGAILGPVGG VIADRINRKL IIVSMDTVRG LIVIWLSWTV YNGTAGFYHI CIATFLVALS
GTFFNPAVTA SIPNIVEKHN LQKANSLEHL SFQGTSIIGA ATGGILIAIF GVAGVFLIDG
ISYLISAFSE LFINIPPVKR EEQSGDNGEL SKFTILYNDL REGARYLYSN KPLFTLFSIS
IIINFLFAGA MAVGIPYVFK EVLQVNSKLF GLAQSFFPAG AILGAVIMNF LPPVKNFFRT
LFTGITFQTI LLAAIGLPIS PFMVDKYPVI SLFILMAVIL ILFGAFNAYT NIPINTMLQR
LIDDRVRGRV FGLLATLNMG LVPVSMWAAG CLLDAFPAYL LFVGAGGIMV GVLAYSVSLP
TLKPLKNEVY IDKRENPAEY SAGV