Gene Hore_20520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20520 
Symbol 
ID7314376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2215698 
End bp2216951 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content41% 
IMG OID643612496 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509792 
Protein GI220932884 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.561553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA GGGTATTAAT GGTTTTGAGC CTGGTCCTGG TATTGTTGGT TTCTCTGTCC 
ACTGTTTCTA TGGCTAAAAA AGTCTTAACT ATTAACAGTT ATATCAGTGA CCCTGTCCCC
AAAGAAGCTT TTGAAGATGT TATTAAGGCC TTTGAAGAAG CACACCCTGA TATTGATGTC
CGGGTAAGTA CTACTGCTCA TGAAGATTTT AAAAAGGCCC TGAGGATCTG GTTAAGTTCT
GATAATCCAC CAGATGTCAT CACCTGGTTT GCCGGTAACA GGGCAAAATA TTTTATTAAC
AAGGGTTTAA TTATGGCTAT AACTGATGTC TGGGAAGAAG CGGATCTTTA CAATAAATTC
CCCCGGGCTT TCAGGAGTAT AAGTTTTGTT AATGGAAAAG CTTATTTCCT TCCTTATAAC
TGGTACTGGT GGGGAATGTT TTACCGTAAG TCCATCTTTG ATAAGTATGG GCTTGAAGAA
CCCCGGACCT GGGATGAATT TTTAGATGTC TGTGAAACCC TCAAACAAAA CGGTATTACT
CCGATAACAA TCGGGACCAA ATACCGCTGG ACTGCTACTG GGTGGTTTGA TTATCTCAAC
ATGAGGGTTA ATGGGCCCGA ATTTCATATC AGGTTGATGG AAGGTAAAGA GAAATATAAT
GATCCCCGGG TTAAGAAGGT ATTTGAGTAC TGGCGTCAGT TACTGGATAG GGGATATTTT
GTTGACAATG CGGCTGCCTA TTCCTGGCAG GAAGGTGTAA GGTTTATGGT TAAGGGAGAA
GCTGCAATGT ACTTGATGGG TCAGTTTATT CTGGATGCTG TTCCTGAGGA GGTAGCTAAA
GACCTTGACT TTTTCCGCTT CCCTATAATT AATGAAGATG TACCTATTGG GGAAGATACT
CCTACTGATG GGTTTATGAT TCCTAAGAAA GCTAAAAACC CGGAACTTGC TAAAGAATTC
CTCAAGTTCC TGGCTTCCAG AGAAGGGCAG ATGATCTTTA TAGAAAAAAC AGGCCGTATC
GGGGTTAATA ATGAAATTCC AATGGATTCC TACCCGCCTC TAACCCAGAA GGGTGTTAAG
ATGATTCAGG GAACCGATGC CCTGGCCCAG TTCTATGACA GGGATACACC TCCAACTATG
GCTGATAAAG GTATGAACGG ATTAATGAAT TTCTGGGCAT ACCCTGATCA GATAGATAAA
ATTCTTGATA ACCTTGAAAG GCAGAGACAG ATGATTTTTT CAGAACAGGA ATAA
 
Protein sequence
MSKRVLMVLS LVLVLLVSLS TVSMAKKVLT INSYISDPVP KEAFEDVIKA FEEAHPDIDV 
RVSTTAHEDF KKALRIWLSS DNPPDVITWF AGNRAKYFIN KGLIMAITDV WEEADLYNKF
PRAFRSISFV NGKAYFLPYN WYWWGMFYRK SIFDKYGLEE PRTWDEFLDV CETLKQNGIT
PITIGTKYRW TATGWFDYLN MRVNGPEFHI RLMEGKEKYN DPRVKKVFEY WRQLLDRGYF
VDNAAAYSWQ EGVRFMVKGE AAMYLMGQFI LDAVPEEVAK DLDFFRFPII NEDVPIGEDT
PTDGFMIPKK AKNPELAKEF LKFLASREGQ MIFIEKTGRI GVNNEIPMDS YPPLTQKGVK
MIQGTDALAQ FYDRDTPPTM ADKGMNGLMN FWAYPDQIDK ILDNLERQRQ MIFSEQE