Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_19250 |
Symbol | |
ID | 7312740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 2058283 |
End bp | 2059617 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643612371 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002509667 |
Protein GI | 220932759 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00108172 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAAA AATCCACCCG GAGGAAGCTA TTACATAACA AAGACTTTAT TCTCCTCTTT CTGGGAGGAT TTGTCTCCCG GATAGGTAGT AAAGTACACT ATGTTGCTAT GACCTGGTTT GTTTTAAAAC TGACCGGAAG TGGTACCGCA GCCGGAACCG TGTTATTACT GGCAACTCTC CCCGGAGCTA TTTTAGGGCC TGTTGGTGGA GTAATAGCAG ACAGAATTAA CCGTAAACTT ATAATTGTCA GTATGGATAC CGTCCGGGGG CTGATTGTAA TCTGGCTTAG CTGGACCGTC TATAACGGAA CCGCTGGCTT TTATCACATC TGTATTGCCA CCTTTCTGGT CGCCCTGAGT GGCACCTTTT TCAATCCCGC AGTAACGGCA TCTATACCCA ATATAGTTGA AAAACATAAT TTACAGAAGG CAAATTCCCT CGAACACTTA AGCTTTCAGG GGACTTCCAT TATCGGAGCT GCTACCGGAG GGATTCTTAT TGCTATCTTT GGGGTAGCCG GGGTCTTCTT AATTGACGGG ATAAGTTATT TAATCTCAGC CTTCTCCGAG TTATTTATTA ATATTCCCCC TGTGAAGAGA GAAGAACAAT CCGGGGATAA CGGGGAATTG AGTAAATTTA CTATTCTGTA CAATGATCTC AGGGAAGGAG CCCGATACCT TTACTCCAAC AAACCCCTGT TTACCCTGTT CAGTATATCT ATTATTATTA ACTTCCTTTT TGCCGGGGCT ATGGCAGTCG GGATTCCCTA TGTGTTTAAA GAAGTCCTAC AGGTAAACAG TAAGTTATTC GGCCTGGCCC AGTCCTTCTT TCCAGCTGGG GCTATCCTGG GAGCAGTTAT TATGAACTTT TTACCACCAG TTAAAAATTT TTTCAGGACC CTGTTTACAG GGATTACCTT TCAAACAATA CTTCTGGCGG CAATCGGTTT ACCTATTTCC CCTTTCATGG TAGATAAATA CCCGGTTATA AGTTTATTCA TACTCATGGC CGTTATTCTC ATTCTTTTCG GGGCCTTCAA TGCCTATACC AATATCCCCA TTAATACTAT GCTCCAGAGG TTAATAGATG ACAGGGTAAG GGGTAGGGTT TTCGGCCTTC TGGCCACACT AAATATGGGG TTAGTACCGG TTTCCATGTG GGCAGCAGGG TGTCTTCTTG ATGCCTTCCC GGCCTATCTC CTGTTTGTGG GAGCAGGGGG AATTATGGTC GGTGTACTTG CCTATAGTGT ATCCCTCCCT ACCCTCAAAC CATTAAAAAA TGAAGTCTAT ATTGATAAAA GAGAGAACCC GGCAGAATAC TCTGCCGGGG TATAA
|
Protein sequence | MDEKSTRRKL LHNKDFILLF LGGFVSRIGS KVHYVAMTWF VLKLTGSGTA AGTVLLLATL PGAILGPVGG VIADRINRKL IIVSMDTVRG LIVIWLSWTV YNGTAGFYHI CIATFLVALS GTFFNPAVTA SIPNIVEKHN LQKANSLEHL SFQGTSIIGA ATGGILIAIF GVAGVFLIDG ISYLISAFSE LFINIPPVKR EEQSGDNGEL SKFTILYNDL REGARYLYSN KPLFTLFSIS IIINFLFAGA MAVGIPYVFK EVLQVNSKLF GLAQSFFPAG AILGAVIMNF LPPVKNFFRT LFTGITFQTI LLAAIGLPIS PFMVDKYPVI SLFILMAVIL ILFGAFNAYT NIPINTMLQR LIDDRVRGRV FGLLATLNMG LVPVSMWAAG CLLDAFPAYL LFVGAGGIMV GVLAYSVSLP TLKPLKNEVY IDKRENPAEY SAGV
|
| |