Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_23010 |
Symbol | |
ID | 7313053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2513151 |
End bp | 2514461 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643612753 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002510041 |
Protein GI | 220933133 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00000978171 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAAA AAAGTAAAAT TATTGCCGGA AGCTTAGTAG CGCTTTTCTT AATGGTTGTA ATCTTCGGTG GTGTTGGTCT GGCTAAAGAA AAGGTTAAAC TGACAGCAAC TTTTGCAGCA CCCAAGGAAC GCTGGGACTG GCTTGTTGAT AAAGCCTTAC CTGTTTTAAA GGCAAATCAC CCTGAACTTA ATATAGAGTT TGAATATGAA GTGTTACCAT ATGATAAAAC CCATGATAAG TTAATTACCA TGATGATAGC AAATACCCCT AGGGACTTAG TTTCAGTAGA TGGCATCTGG CTTGGAGAGT TTGCCCAGGG TGGTTTACTT AAAGATATTA CCGAAGAAGT AAAAGAATGG GGCAGGATGG ATGAATATTA TGCGGTTAAT CGTGAGGGTA GCAAATATAA TGGCAAATAT TACGGTATCT GGTCCTGGAC AGATGCCAGG GTTCTGTGGT ACTGGCCTGA TTTATTAGAA AAGGCCGGGG TTGAACCGGA AGATTTAACT ACCTGGGATG GTTATATAGC TGCAGCCGAG AAATTAAATA ATACATTACA GAATGAAGGT ATTGAAGGTG TTCACCTGGT CGGGGCCCCT CATTCACCTG ACATGTTTTT CCCTTATCTC TGGATGAATG GAGGTAAAAT TCTTGAAAAA CGTGATGGCA AGTGGTATCC TGCCTTCCAT AAAGAAGCCG GTATTAAAGC CCTGACCTTT ATTAAGAGGC AGGTTGAGGC CGGTATTAAG CCACAGAAAC AGCACTTCTG GGGCCAGGAG TTTGCCGACA AGAGGTATGC GGTTATGTTA GAAGGAAGCT GGTTAGCCGG TAAGTTTTCT AAAAATATAA CAAAAGAGGA ACTGGAAAAT AAAATTGGTA TGTTACCATT ATTCCCTACT CCCTCAGAAG AGGTTGATAC TGCAACCATG GCTGGAGGAT GGGTTCTGGC AATACCAAAA ACAAGCCGGC ATCAGGACCT TGCCTGGGAA CTGATGGAGA TTATCCAGTC TCCTGAAATT ATGAGTCAAT TCCTGGCTAA ATTTGGTTAC TTACCAACCC AGCGGGTTAT TGCTGAAAAT CCTGAATATA ATAAAGTACT TATAGAAAGT ATTCCTTTCT TTGATAAATA TACTAAAATA CTGCCACTGG CCCATGGTAG GCCTAATATT CCTGAGTATC CCCAAATATC TGAAGCTTTA AGAATTGCTA TTGAAGAAGT TTATTACCGT GGTGCTGACC CTGAAGTAGC TTTAACTAAA GCCGCCCAGA AAGTAGCCCG TATTCTGGGT TGGCCTGGTC TGGTAGATTA A
|
Protein sequence | MLKKSKIIAG SLVALFLMVV IFGGVGLAKE KVKLTATFAA PKERWDWLVD KALPVLKANH PELNIEFEYE VLPYDKTHDK LITMMIANTP RDLVSVDGIW LGEFAQGGLL KDITEEVKEW GRMDEYYAVN REGSKYNGKY YGIWSWTDAR VLWYWPDLLE KAGVEPEDLT TWDGYIAAAE KLNNTLQNEG IEGVHLVGAP HSPDMFFPYL WMNGGKILEK RDGKWYPAFH KEAGIKALTF IKRQVEAGIK PQKQHFWGQE FADKRYAVML EGSWLAGKFS KNITKEELEN KIGMLPLFPT PSEEVDTATM AGGWVLAIPK TSRHQDLAWE LMEIIQSPEI MSQFLAKFGY LPTQRVIAEN PEYNKVLIES IPFFDKYTKI LPLAHGRPNI PEYPQISEAL RIAIEEVYYR GADPEVALTK AAQKVARILG WPGLVD
|
| |