Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_18420 |
Symbol | |
ID | 7313840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1967509 |
End bp | 1968459 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643612289 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002509586 |
Protein GI | 220932678 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00000366323 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAAAA GAAGGTTTAT TTTAGTTTTA GCTGTGTTGT TACTGGGAAT AGTTCTTCTG GCTGGATGTG GTGATAAGCA GGAAGAAGGA CAGGCTGTTA TTAAAATGGG GACCAATGTT GAATTTTTAA ATAGAGATGA TGGTATTCCT GGACTGGAAA AAGCCTATGG ATTTAAATTT GACCGGGATG CCTTAACAAC CATGAAAACA GGGTTAACAT ATGATGCTTT AAGGAATAAC AAACTAGATG TAGCCATGGG TTTTGCCACA GATGGGCGGA TTGCTGCTTT TGACCTTGTC TCACTGGAAG ATGATAAAAA CTATTTTCCG GTCTATAATC CAGCTCCGAC AATCAGAAAG GAAATTCTGG ATGAATACCC TGAACTGGCT GATGTAATTA ATAAACTTCC ACCTGTTCTG GACCAAAAAA CCCTGACTAA CTTAAACAAA GAAGTAGATG TTGACGGTAA AGACCCCGAA GAGGTTGCAC AAAAATTCCT TAAAGAGAAG GGACTTCTTC CCGATGAACC AGAGCTCAAA CAGGGGCCTG CCATAACAGT AGCTTCCAAG ATATTTACCG AGCAGCTTAT TTTAGGTCAC ATGTTAATTG ATCTCCTAAA AGCCCATGGT TATCCTGTTG AGGATAGGAC AAGCCTGGGA GGCACTCCGG CCCTCCGTAA GGCTCTGGAA TCAGGTCAGA TTGATGCCTG CTGGGAATAT ACCGGGACTG TTTTAATGAC CGTAATGAAA GAAGACGAGA TTACCCAGTC CGACGAAGCA TACCAGAAGG TTAAGAAATG GGATGCTGAG GCTAATAACA TTATCTGGTT AGATTACGCC CCTGCCAATA ATACCTATAC TTTGCTGATG ACCAGGAAAA AGGCTGAAAA GCTTGGTATA AAAACAATTT CTGATCTGGC AAGTTATATA AATGGGGAAG AGAATAAGTA A
|
Protein sequence | MVKRRFILVL AVLLLGIVLL AGCGDKQEEG QAVIKMGTNV EFLNRDDGIP GLEKAYGFKF DRDALTTMKT GLTYDALRNN KLDVAMGFAT DGRIAAFDLV SLEDDKNYFP VYNPAPTIRK EILDEYPELA DVINKLPPVL DQKTLTNLNK EVDVDGKDPE EVAQKFLKEK GLLPDEPELK QGPAITVASK IFTEQLILGH MLIDLLKAHG YPVEDRTSLG GTPALRKALE SGQIDACWEY TGTVLMTVMK EDEITQSDEA YQKVKKWDAE ANNIIWLDYA PANNTYTLLM TRKKAEKLGI KTISDLASYI NGEENK
|
| |