Gene Hore_18420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18420 
Symbol 
ID7313840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1967509 
End bp1968459 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content41% 
IMG OID643612289 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_002509586 
Protein GI220932678 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00000366323 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAA GAAGGTTTAT TTTAGTTTTA GCTGTGTTGT TACTGGGAAT AGTTCTTCTG 
GCTGGATGTG GTGATAAGCA GGAAGAAGGA CAGGCTGTTA TTAAAATGGG GACCAATGTT
GAATTTTTAA ATAGAGATGA TGGTATTCCT GGACTGGAAA AAGCCTATGG ATTTAAATTT
GACCGGGATG CCTTAACAAC CATGAAAACA GGGTTAACAT ATGATGCTTT AAGGAATAAC
AAACTAGATG TAGCCATGGG TTTTGCCACA GATGGGCGGA TTGCTGCTTT TGACCTTGTC
TCACTGGAAG ATGATAAAAA CTATTTTCCG GTCTATAATC CAGCTCCGAC AATCAGAAAG
GAAATTCTGG ATGAATACCC TGAACTGGCT GATGTAATTA ATAAACTTCC ACCTGTTCTG
GACCAAAAAA CCCTGACTAA CTTAAACAAA GAAGTAGATG TTGACGGTAA AGACCCCGAA
GAGGTTGCAC AAAAATTCCT TAAAGAGAAG GGACTTCTTC CCGATGAACC AGAGCTCAAA
CAGGGGCCTG CCATAACAGT AGCTTCCAAG ATATTTACCG AGCAGCTTAT TTTAGGTCAC
ATGTTAATTG ATCTCCTAAA AGCCCATGGT TATCCTGTTG AGGATAGGAC AAGCCTGGGA
GGCACTCCGG CCCTCCGTAA GGCTCTGGAA TCAGGTCAGA TTGATGCCTG CTGGGAATAT
ACCGGGACTG TTTTAATGAC CGTAATGAAA GAAGACGAGA TTACCCAGTC CGACGAAGCA
TACCAGAAGG TTAAGAAATG GGATGCTGAG GCTAATAACA TTATCTGGTT AGATTACGCC
CCTGCCAATA ATACCTATAC TTTGCTGATG ACCAGGAAAA AGGCTGAAAA GCTTGGTATA
AAAACAATTT CTGATCTGGC AAGTTATATA AATGGGGAAG AGAATAAGTA A
 
Protein sequence
MVKRRFILVL AVLLLGIVLL AGCGDKQEEG QAVIKMGTNV EFLNRDDGIP GLEKAYGFKF 
DRDALTTMKT GLTYDALRNN KLDVAMGFAT DGRIAAFDLV SLEDDKNYFP VYNPAPTIRK
EILDEYPELA DVINKLPPVL DQKTLTNLNK EVDVDGKDPE EVAQKFLKEK GLLPDEPELK
QGPAITVASK IFTEQLILGH MLIDLLKAHG YPVEDRTSLG GTPALRKALE SGQIDACWEY
TGTVLMTVMK EDEITQSDEA YQKVKKWDAE ANNIIWLDYA PANNTYTLLM TRKKAEKLGI
KTISDLASYI NGEENK