Gene Hore_00780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_00780 
Symbol 
ID7314296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp90301 
End bp91536 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content42% 
IMG OID643610496 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002507834 
Protein GI220930926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0480859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAC GTATCATTTC TATTTTAACA GTTGGCTTGT TATTAGTGGC AATGTTAACT 
GGAACGGTAA TGGCCGGGAA AGTTAAATTA CGCTTTTTGC AGCCCGGGGG AAATTTATAT
AAGCAGAGTG TTGAGTTTGC CAAGGAGTAT ATGAAACTAC ACCCCAATGT AGAGATAGAA
GTAATTGAGG TTGGCTGGAG TGATGCCTAT TCCAAGATAA TGACCATGGT GGCCGCCGGA
AATGCCCCGG ATATTATGTA TATTGGAACC AGGTGGATAC CGGCTTTGGC CCAGATGAAT
GCTATTCAGC CCCTTGATAA ATTTATCAGT GAAGAGAAGA AGGACCTGTA TTTTGATTCT
CTGTTAAAAG GTACCTATTA CCAGGGAAAA CTCTATGCTT TACCACGCTC TTTTTCTACA
AAAGCCTTAA TTTACCGGAC TGATTTAATC CCGGAACCAC CTGAAACCTG GGATGAGCTG
GTTGAGGTTG CCAAAAGGGT TCAAAAGGAA CATGAAGGTA TATATGGTTT TGGAATAGCA
GGAGCAAAAC ATGTTTCTAC CACTACCCAG TTTTTTAATT ATGTCTATCA GAATGGTGGC
TCAATCTTCG ATAGTGAGGG AAATATTTTG CTCGATAGTC CCCAGTCAGT TAAGGCCCTT
CAGTTTTATG TAGATCTTTA TCGTAAACAT AAAGTGGTTC CCAATCCTAT TGAATATAAC
CGTGAAGAAC TACCGAACCT CTTTAAGACC GGGAAAATAG CCATGTTTGT CTGTGGTCCC
TGGGCCAAAC CAATGATTGG ACTTGATCCT GATAATGAAA AAGTACCTTA TGCCAGTGCT
CCCCTGCCCC GGGGAAGGTA TATGGCAACT ACCCTTGTTT CTGATTCCCT GGTATTATCT
TCCCAGAGTG AACATATTGA TGAAGCCTGG AAGTACTTAA ACTGGATAAC CAGCCTGGAG
AACCAGAAAA AACATGACCT TATTAATGGA ATGGCTCCGG CTATGGAAAA AGAACTTGAA
GACCCGGCAT TTACAGAGGA TCCTTTTTTC AAAACATATG TTGATATGAT TCCTAAAGGT
CAGCCCCAGC CTCTACCTCT GGCCTGGGAA CCTTTCCAGG ATGTAATCAC CGGGGCTATT
CAAAAGGCTT TACTCGGAAT GGCAACACCT GAAGAAGCCC TAAAGGAAGC AGTTACCAGG
ATTGAAGCTG AAAATCTGGC ACCGGTTAAA CACTAA
 
Protein sequence
MSKRIISILT VGLLLVAMLT GTVMAGKVKL RFLQPGGNLY KQSVEFAKEY MKLHPNVEIE 
VIEVGWSDAY SKIMTMVAAG NAPDIMYIGT RWIPALAQMN AIQPLDKFIS EEKKDLYFDS
LLKGTYYQGK LYALPRSFST KALIYRTDLI PEPPETWDEL VEVAKRVQKE HEGIYGFGIA
GAKHVSTTTQ FFNYVYQNGG SIFDSEGNIL LDSPQSVKAL QFYVDLYRKH KVVPNPIEYN
REELPNLFKT GKIAMFVCGP WAKPMIGLDP DNEKVPYASA PLPRGRYMAT TLVSDSLVLS
SQSEHIDEAW KYLNWITSLE NQKKHDLING MAPAMEKELE DPAFTEDPFF KTYVDMIPKG
QPQPLPLAWE PFQDVITGAI QKALLGMATP EEALKEAVTR IEAENLAPVK H