Gene Hore_00380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_00380 
Symbol 
ID7314255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp42408 
End bp43676 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content42% 
IMG OID643610455 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002507794 
Protein GI220930886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATA AATCTTTAAT TGTTTTACTG GTAGTAAGCC TGTTACTGGT TTGTTATGGG 
ACAGGATTGG CTAAAACCAC TGTCAGGATT GCTGGCTTTG GGGGGAATGA CCAGGTAATT
GTTGAGGAGC TCCTTAATAA ATTTGTCAGG CCAGAGCTGG CGGATGAAGG TATTGAAATT
ATTTACGAGC CTATTGCTGA TGATTATCAA AGGTACCTTT TGAATTCGCT TTCTGCCGGT
ACTGCTGCAG ACCTGTTTTA TATGGATATT TTCTGGGCTA AAAATGTTAT AAAAGAAGGT
CTGGTTGAAC CGCTTGATAG CTATCTGGCT AAATCTGAAG TTATCAGCAA AGAAGACATT
GTGCCCAGCT TACTAGAAGG TTTTACCTAT GAAGGCAAAT TGTATGGAAT CCCCAAGGAT
TTCAACTCTC TGGCCCTGTT TTACAATAAG GACCTTTTTG ATATAGCAGG AATACCTTAT
CCCAATGAGG CCGATACCTG GAAAACCTTA GAATATAAAT TAAGGAAAGT GGTTGAGTTT
TTTGAAAAAG AAGGAGAAGA AATTCATGGA TTGGCATTAC AACCTGAGTA TGCCAGGATG
GGTGCCTTTG CTTATGCTGC TGGATGGGAA CCTTTTGTAA ATGGAAAAAC AAATCTCCAG
GACCCCAAAT TTGTCAAAGC ATTTAAATGG TATACCGGAT TAAAAGAAAA AGGGTTAGGT
ATTATGCCGG CTGATATTGG CCAGGGCTGG GGCGGTGGCG CCTTTGCTAA TGGTAATTTT
GCTGCCTGCC TCGAAGGAGC CTGGATTATT GGATTCCTGC GTGATCAGGC ACCAAACCTG
AATTATGGTG CTACCTTGCT ACCGAAATGC TCAGATACCG ATGAAAGAGG TAACTTTATC
TTCACTGTTG CCTGGGGTAT AAATGCTAAC TCAAAGAATA AAGAAGCCGC TTTCAGGGTT
TTAGAAACAC TGACAAGTCC TGAAGCCCAG CAATGGGTTC TGGAAAGGGG TCTTGCCATT
CCCAGCCGGA AATCACTGGC TGACAATCCG TACTTTGAAA AGCAGACCAA GGAAGCCCAG
GCCAATAAAG TTGTCTTCAT GGGTGCGTCA AGAGGAAATA TTAAACCCTA TAGTTTCAGG
GATTATGGTG GAGAATGGAT GGAACCAATC AATACTGCTT TAAATGAAGT AATGAGTGGA
CAGTCAACAG TAGAAGAAGC ATTAAAAACT GCCCAGGAAA GACTTGAACA GGACATAATG
AATAAATAA
 
Protein sequence
MRNKSLIVLL VVSLLLVCYG TGLAKTTVRI AGFGGNDQVI VEELLNKFVR PELADEGIEI 
IYEPIADDYQ RYLLNSLSAG TAADLFYMDI FWAKNVIKEG LVEPLDSYLA KSEVISKEDI
VPSLLEGFTY EGKLYGIPKD FNSLALFYNK DLFDIAGIPY PNEADTWKTL EYKLRKVVEF
FEKEGEEIHG LALQPEYARM GAFAYAAGWE PFVNGKTNLQ DPKFVKAFKW YTGLKEKGLG
IMPADIGQGW GGGAFANGNF AACLEGAWII GFLRDQAPNL NYGATLLPKC SDTDERGNFI
FTVAWGINAN SKNKEAAFRV LETLTSPEAQ QWVLERGLAI PSRKSLADNP YFEKQTKEAQ
ANKVVFMGAS RGNIKPYSFR DYGGEWMEPI NTALNEVMSG QSTVEEALKT AQERLEQDIM
NK