Gene Hore_14550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_14550 
Symbol 
ID7313995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1546129 
End bp1547427 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content44% 
IMG OID643611895 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509199 
Protein GI220932291 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000299418 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAG GTTTAACCTT AGTTTTAACC CTGGTATTGG TTTCTGTAAT GGCTTTTGCC 
TCCACAGCTT TAGCTGCTGA CAAGACTTTA ACAATTATGG GAGTCTGGGG AGGCCATGAA
AGGGATGCCT TTGAAAAGGT TATTGAAACT TTTGAAACGG CAACCGGAAT TGATGTTCAG
TTTGAAGGAA CAAGGGACCT GCCCACTCTG TTAACAACAC GTCTGGAAGC AGGAAACCCG
CCTGATATTG TTGCCCTTCC CAATCCCGGT AATATGAAAG AACTTGCTGC TGAAGGTCAT
CTGGTTGACC TGAGGAAAGT TCTTGATATG GATACCTTAA GGGAAGATTA CGGACAGACA
TGGATTGACT TAGGTTCCTA CAATGATGGT CTATATGGAA TTTTTATTTC TGCAGATGTT
AAGAGTTTAG TCTGGTATAA TCCGAAACAG TTTGAAGCTA AAGGTTATGA TATTCCCAAG
ACCTGGGATG AGATGGAAAG ATTAATGAAC AACATGGTTG CTAAAGGTGA TATCCCATGG
TCTATCGGTC TGGAATCCGG TGCTGCCAGT GGCTGGCCTG GAACTGACTG GATCGAAGAC
ATTATGTTAA GAACAGCCGG TCCTGAAGTT TATGACCAGT GGGTAAATCA CGATATTCCC
TGGACTGACG AAAGGGTTAA AAAAGCCTTT GAAATTTTTG GTAAAATTGC CCGTAATCCT
AAATTCACCT GGGGAGGACC TACTGCTGTA TTAGCTACTA ACTTTGGTGA TGCTGCTAAC
CCACTGTTCA CCAATCCTCC ACAGGCATAT ATGCACCGTC AGGCTAGCTT TATCACTGGA
TTTATTACAG ATAATAATCC AGACCTTGTT GCCGGTAAAG ACTACAACTG CTTCATTCTT
CCCCCAATCA ACGAAGAAGT AGGGACTCCG GTTCTTGGTG CTGCTGATAT GATGGGTATG
ATTAATGATA CTCCTGAAGC CAGAGCTTTC ATGAGATATC TCGCCTCTCC TGGAGCCCAG
ATGGTCTGGA TTGGGGCTGT TGGTAGTAAA ATCGGTATCA ACAAACGGAT TGACCTCAAT
GTATACTCCA GTGAGTTAAT GAAGAATATC GCTAAAGGAT TAAGGGAAGC AGATGTATTC
AGGTTTGATG GTTCTGACCT GATGCCCAAG GCTGTTGGTT CTGGTGCCTT CTGGCAGGGT
GTAATGGATT ATGTTGGAGG TCAGGATCTT GACAGTGTTC TGGAACATAT CGAATCTGTT
GCTGATGATG CCTACGATTC CGGAAAAACT ACAGACTAA
 
Protein sequence
MKKGLTLVLT LVLVSVMAFA STALAADKTL TIMGVWGGHE RDAFEKVIET FETATGIDVQ 
FEGTRDLPTL LTTRLEAGNP PDIVALPNPG NMKELAAEGH LVDLRKVLDM DTLREDYGQT
WIDLGSYNDG LYGIFISADV KSLVWYNPKQ FEAKGYDIPK TWDEMERLMN NMVAKGDIPW
SIGLESGAAS GWPGTDWIED IMLRTAGPEV YDQWVNHDIP WTDERVKKAF EIFGKIARNP
KFTWGGPTAV LATNFGDAAN PLFTNPPQAY MHRQASFITG FITDNNPDLV AGKDYNCFIL
PPINEEVGTP VLGAADMMGM INDTPEARAF MRYLASPGAQ MVWIGAVGSK IGINKRIDLN
VYSSELMKNI AKGLREADVF RFDGSDLMPK AVGSGAFWQG VMDYVGGQDL DSVLEHIESV
ADDAYDSGKT TD