Gene Hore_05410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_05410 
Symbol 
ID7313505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp587398 
End bp589026 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content38% 
IMG OID643610964 
ProductSodium:solute symporter 
Protein accessionYP_002508294 
Protein GI220931386 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000184365 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATGGT ACTATCTTAT TCCGTTAGTC TATTTATTTT TACTACTGGT TGTTGGTTTT 
GTTATAGCTA AAAGACAGGA AACCCGGTCA GATTTTTATG TTGCCTCAAA TAAAATGGAT
GGATCAGTCC TCTTTGCTAC AGTTATGTCC ACAGTGGTTG GCGCAAATAC ATATATGGGT
TTTAGTGGCT TGATTTATAA TGGTGGACTT CATTTTATGT GGATGCTCAG TGGTGCCGGT
CTTGCTTATT TTATCCTCTT CTTTATATCA GGTAAAATAA GGAAAATAGC CACAAAATAC
GAAGTATTCA CTCTTCCTGA CCTGGTAGAG CTTAGATATT CTAATCCTGT TGCCCTACTA
ACTACTTTCT TTTCCCTTAT TGGGCTAGTT GGAGGTATCG GAGGTGGTCT TCTCGGTTTA
GGAGTTATAC TTAATTCTTT ACTGGGAATA CCCACCACTA CTGCTATAAT TGTTACTTCC
ATTATTACTA TCATTTATAC CTGTCTTGGA GGTTTATGGG GAGTATCCCT GACAGACTGG
ATTCAATCTA TTATTATGAT TGCTGGTGTA GCAGTCTGTA TAGTATTTGG GATAACCTCT
GTAACACCGG GACAATCATT TGTCAATGGT GCCTTCGAAA TAGTAAATGT ATTAAAACAA
CAATTAGGAA CAGAACTGGT TAGCCCCTTT GCCGGTTTAA CCTTTTTTAT GGCTCTGGCC
TGGACCATTA CCTTTATGCC CCTTAATACT ATCTCTCAGA CCCAGATCCA GAGGGTTTAT
GCAGCAAAAA ATGTAAAGAC TATTCGTGGT GTCAGTTTAC TAATGATTAT TTTTGTAGCT
ATGGTCCTCA CTTTCGGTTT AGCCTTTATC GGAATTCTTG GAAGAGTTGC TTTACCCGGT
TTAAAAAATG CTGAGGCAGT CTTCCCCCAG ATGAGCATGA AAGTTATCAA CCCTGCATTA
GGTATTTTAA TTGTAACGGG AATTATGGGA GCTGCTATGT CTACAGTAGA TTCAAACCTT
CTCGGTTCCG CCATGCATGT CACCCGTGAC CTATATGAAC GGTACATGAG ATATAAGAAT
AAGTCTGTTG ATGAAAAGCG TATTTTATTT ATCAGTCGGG TAACCATTGT TATTATTGGT
GTAATTAGTA CTATAGCTGC TCTATTCACT CCTTCTATAA TGAGCCTACT ACTGATAACA
ATGAAGATAT TTGCCGGAGC TACTTTTGCC CCTGTACTTA TCGGTCTTTA CTGGAAAAGA
GCCAATGCTT TCGGGGCTTT ACTGGGTGAA ATTCTTGGAG GTATGGCTGT TGTTATTAAT
ATTATTCACC CCGTTGTCAA CCTGGATCCT GTCATATTTG GAATTATTAT GGCAGTTTTA
GGAACGATAA CTGGCAGCTT ATTTACCAAA GAAAATACAG AAAAAGGCAG TATTTTTTCT
TTTGCTAATG ATATTTCATC AAAAGGATGG CTCGCAGTTA TAGCTATTGC TCTTCTCTAT
TTTGGATGGG TTATAAGTAT GAACAACTAC GCCATGTGGC CGTATTTCAT TATAACTACT
GTAGTATTAC TGGTTTTATC AGTTGTCTTC CTTATTTATA GTTTTATCAC TGAAAGAACC
GGCAATTAA
 
Protein sequence
MEWYYLIPLV YLFLLLVVGF VIAKRQETRS DFYVASNKMD GSVLFATVMS TVVGANTYMG 
FSGLIYNGGL HFMWMLSGAG LAYFILFFIS GKIRKIATKY EVFTLPDLVE LRYSNPVALL
TTFFSLIGLV GGIGGGLLGL GVILNSLLGI PTTTAIIVTS IITIIYTCLG GLWGVSLTDW
IQSIIMIAGV AVCIVFGITS VTPGQSFVNG AFEIVNVLKQ QLGTELVSPF AGLTFFMALA
WTITFMPLNT ISQTQIQRVY AAKNVKTIRG VSLLMIIFVA MVLTFGLAFI GILGRVALPG
LKNAEAVFPQ MSMKVINPAL GILIVTGIMG AAMSTVDSNL LGSAMHVTRD LYERYMRYKN
KSVDEKRILF ISRVTIVIIG VISTIAALFT PSIMSLLLIT MKIFAGATFA PVLIGLYWKR
ANAFGALLGE ILGGMAVVIN IIHPVVNLDP VIFGIIMAVL GTITGSLFTK ENTEKGSIFS
FANDISSKGW LAVIAIALLY FGWVISMNNY AMWPYFIITT VVLLVLSVVF LIYSFITERT
GN