Gene Hore_23010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_23010 
Symbol 
ID7313053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2513151 
End bp2514461 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID643612753 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002510041 
Protein GI220933133 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00000978171 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAAA AAAGTAAAAT TATTGCCGGA AGCTTAGTAG CGCTTTTCTT AATGGTTGTA 
ATCTTCGGTG GTGTTGGTCT GGCTAAAGAA AAGGTTAAAC TGACAGCAAC TTTTGCAGCA
CCCAAGGAAC GCTGGGACTG GCTTGTTGAT AAAGCCTTAC CTGTTTTAAA GGCAAATCAC
CCTGAACTTA ATATAGAGTT TGAATATGAA GTGTTACCAT ATGATAAAAC CCATGATAAG
TTAATTACCA TGATGATAGC AAATACCCCT AGGGACTTAG TTTCAGTAGA TGGCATCTGG
CTTGGAGAGT TTGCCCAGGG TGGTTTACTT AAAGATATTA CCGAAGAAGT AAAAGAATGG
GGCAGGATGG ATGAATATTA TGCGGTTAAT CGTGAGGGTA GCAAATATAA TGGCAAATAT
TACGGTATCT GGTCCTGGAC AGATGCCAGG GTTCTGTGGT ACTGGCCTGA TTTATTAGAA
AAGGCCGGGG TTGAACCGGA AGATTTAACT ACCTGGGATG GTTATATAGC TGCAGCCGAG
AAATTAAATA ATACATTACA GAATGAAGGT ATTGAAGGTG TTCACCTGGT CGGGGCCCCT
CATTCACCTG ACATGTTTTT CCCTTATCTC TGGATGAATG GAGGTAAAAT TCTTGAAAAA
CGTGATGGCA AGTGGTATCC TGCCTTCCAT AAAGAAGCCG GTATTAAAGC CCTGACCTTT
ATTAAGAGGC AGGTTGAGGC CGGTATTAAG CCACAGAAAC AGCACTTCTG GGGCCAGGAG
TTTGCCGACA AGAGGTATGC GGTTATGTTA GAAGGAAGCT GGTTAGCCGG TAAGTTTTCT
AAAAATATAA CAAAAGAGGA ACTGGAAAAT AAAATTGGTA TGTTACCATT ATTCCCTACT
CCCTCAGAAG AGGTTGATAC TGCAACCATG GCTGGAGGAT GGGTTCTGGC AATACCAAAA
ACAAGCCGGC ATCAGGACCT TGCCTGGGAA CTGATGGAGA TTATCCAGTC TCCTGAAATT
ATGAGTCAAT TCCTGGCTAA ATTTGGTTAC TTACCAACCC AGCGGGTTAT TGCTGAAAAT
CCTGAATATA ATAAAGTACT TATAGAAAGT ATTCCTTTCT TTGATAAATA TACTAAAATA
CTGCCACTGG CCCATGGTAG GCCTAATATT CCTGAGTATC CCCAAATATC TGAAGCTTTA
AGAATTGCTA TTGAAGAAGT TTATTACCGT GGTGCTGACC CTGAAGTAGC TTTAACTAAA
GCCGCCCAGA AAGTAGCCCG TATTCTGGGT TGGCCTGGTC TGGTAGATTA A
 
Protein sequence
MLKKSKIIAG SLVALFLMVV IFGGVGLAKE KVKLTATFAA PKERWDWLVD KALPVLKANH 
PELNIEFEYE VLPYDKTHDK LITMMIANTP RDLVSVDGIW LGEFAQGGLL KDITEEVKEW
GRMDEYYAVN REGSKYNGKY YGIWSWTDAR VLWYWPDLLE KAGVEPEDLT TWDGYIAAAE
KLNNTLQNEG IEGVHLVGAP HSPDMFFPYL WMNGGKILEK RDGKWYPAFH KEAGIKALTF
IKRQVEAGIK PQKQHFWGQE FADKRYAVML EGSWLAGKFS KNITKEELEN KIGMLPLFPT
PSEEVDTATM AGGWVLAIPK TSRHQDLAWE LMEIIQSPEI MSQFLAKFGY LPTQRVIAEN
PEYNKVLIES IPFFDKYTKI LPLAHGRPNI PEYPQISEAL RIAIEEVYYR GADPEVALTK
AAQKVARILG WPGLVD