Gene Hore_13200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_13200 
Symbol 
ID7314106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1418635 
End bp1419909 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content42% 
IMG OID643611760 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002509065 
Protein GI220932157 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA AGGGCTTATG CTTGGTACTT AATTTGCTTA TTATTGTATT TGCTTTTACA 
GGTGTTTCAG TTGCTGATAA TGGTAGTTTA ACCGTAATGG CCCAGTGGGG TGGTCAGGAA
CTTGATGCCT TTATGAAAGT TATTGAAAGG TTTGAGGAGA AAACTGGTAT TGACGTTAAA
TATGAAAGCA CCCGTGATAC TTCCACTGTA CTGGTTACCA GAATCCAGGC AGGAAACCCG
CCTGAAGTAT GTGTTATCCC TGACCTTGGT TTGATTAAAG ACCTTGGTCA AGAAGGGAGT
TTAGTTGATC TGAATAAGGT TCTGGATATG GACAGGATTA AAGAAGAATA TAACGATGTC
TGGCTTGACT TAACTACTGT AGATGGCCAC ATGTATGGTC TGGTAATGAC CGCCGATATT
AAAAGTCTTA TCTGGTATAA TCCTAAAGCC TTTAAAGCCA GGGGGTATGA GGTCCCTGGA
ACCCTGGATG AACTGATGAG TTTAACTGAG AGAATGGCCA GAAAGGGAGA TATTCCCTGG
GCTGTAGGAT TAGAATCTGG TCCGGCCAGT GGCTGGCCAG GTACTGACTG GATTGAGGAT
CTTGTCTTAA GGTTGGCAGG CCCTGAAGTA TTTGATAAAT GGATTAACCA TGAGATCCCC
TGGACAGATC CAAGAATAAA AGAGGCCTTT GAGTACTTCG GAAAAATCGT TAAAAACTCA
AAATATGTCT GGGGTGGACC AACCAGTGTC CTAATGACTA ATTTTGGCGA TGCTGTAGCC
CCACTTTATA CTGAACCTCC ACAGGCTTTT ATGCATAAAC AGGCTAGCTT TATTACCAGT
TTCATATTGG AACATAATCC TGACCTTGTG GCCGGTGAAG ATTATGATTT CTTCCCCTTC
CCACCGGCTG AAAAAGGAGA GGGGGTACCT GTCCTCGGGG CTGCTGATAT GGTAAGTATG
CTTAAAGATA CCCCTGAAGC CAGGAAGTTT GTAGACTTTT TATCAACACC TGAAGCCCAG
ACAATCTTTA TCAAAGAACT GGGTAAAATC GGTGTAAACA AAACAATAGA CCTGGCAGTA
TACCCTGATA AGATTACCAG GAAGATGGCC AGAACTCTGT TAAATGCCTC TGTTTTCAGG
TTTGATGGTT CTAATTCAAT GCCGGCAGCT GTAGGTTCTG GTGCTTTTAA CCCGGGTATC
CTGGATTATG TTAGGGGAAA AGACTTAGAT GATGTCTTAA AATCTATTGA AGCTGTAGCT
GAAGAAAACT ATTAA
 
Protein sequence
MTKKGLCLVL NLLIIVFAFT GVSVADNGSL TVMAQWGGQE LDAFMKVIER FEEKTGIDVK 
YESTRDTSTV LVTRIQAGNP PEVCVIPDLG LIKDLGQEGS LVDLNKVLDM DRIKEEYNDV
WLDLTTVDGH MYGLVMTADI KSLIWYNPKA FKARGYEVPG TLDELMSLTE RMARKGDIPW
AVGLESGPAS GWPGTDWIED LVLRLAGPEV FDKWINHEIP WTDPRIKEAF EYFGKIVKNS
KYVWGGPTSV LMTNFGDAVA PLYTEPPQAF MHKQASFITS FILEHNPDLV AGEDYDFFPF
PPAEKGEGVP VLGAADMVSM LKDTPEARKF VDFLSTPEAQ TIFIKELGKI GVNKTIDLAV
YPDKITRKMA RTLLNASVFR FDGSNSMPAA VGSGAFNPGI LDYVRGKDLD DVLKSIEAVA
EENY