Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_22220 |
Symbol | |
ID | 7313770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 2422006 |
End bp | 2423430 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643612674 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002509962 |
Protein GI | 220933054 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATA TACCCCATAA TTATGAGTAT GAGATTGATC TCACTGAATA CCTCGGAATT TTAAATAAAA GAAAATGGCT GATACTGGCT ATAACTATCC TGGCAGGTGT TATGGGTTAT ATCCTTACCT CATACAAACC GGCGGTATTT CAGTCAGATG CCCTGTTAAT GATAGATGAA ACTCCTACTA CTTTAAATCA AATTGAACTA TCCCCTTTTA ACAACAATAA CAGGGATCTT ATCACTTACA GTAAATTGTT AAAAACTAGG AAACTCCTTA AAAAAGTCAG TCAACACTTT GACGAAAATA AGGTTTCAGT TAAGTATTTA ATTACTAACT TAAACATTGA GATTATACCT GATACCAGGT TAATTAAAAT TTCCATAAAA CATACAGACC CGGTAATAGC CCGGCAGATT ATATCATATC TAATAGAAGA ATTTATTATA AATAATAGGA ATTTAAAAAA ATCAGCAACG GTTAATGCCC GAAACTATGT AGCCCGGCAA CTGGAGAAGG TCTCCAGGGA TTTAAAAAAT ATTGAGGCAG AAGTCAGACA TTATAAAGAA GAAAACAAAA GCCTTGTTCT CTCTAATTTT ACCCAGAAAA TTATGCAATC AATGCTGGAC CTGGAAGAAA ATCTGGCAGA AGTGGAAATC AATATAAAAA CACATCGGGC TGCCCTGAAC CACTTATATA ACAAATTACA GTCGTCCCGG GAGCTTATAT TATCATCTAA AACCCTTTCC AAAAACCCTG CTTATACAGA ACTTAAAAAT AGACTGACAG AACTGGAAAT CGAGCTAGAC TCTTTGACAA CAGTCTATAC GGACAAACAC CCTGAGATTA TTAAATTAAA AGCTGAGAAA AAATCAATCC TTAACGAAAT TTCTAATACT CTTGGTGAAG TTATAACCTC AACCATCTAT ACGACAAACC CTGTTTATAA TAATCTAAAA CAGGAATTAG TCAAACTGGA AACAGAGTTA ACCTCCCTGA AAGCGCAAAA AGAGTCTCTG AATATCCAGT TTCAAAAGAT TAAAGCAAAA ACAGAGCAGT TACCCAAAAA AGAACTGGAG TACTCAAGGT TACTCAGGAA ACTGGAGGTT TCTGAAAAAC TATATACCAT GCTTTTGACC AGATACCAGG AATTAAAGAT AACTGAGGCC ATGAAGGTCT CTGATATTAT TACAGTCGAT CCCCCGGTCG TGCCCGAAAG CCCGGTCGGA CCCAATATGA AATTAAACCT TGCCATTGCC ATTATAATGG GTCTCTTTGT TGGTGTCTTT ATAGCTTTTA TTCTTGAATT TATAAATAAT ACTATTCAGC GGGTTGAAGA AATTGAGGAA ATAACAGATG TACCAATAAT AGGTTATATT CCTTATATAG ACAAAAAAAA TGATGGACGT GATCATAATA ATTAA
|
Protein sequence | MDNIPHNYEY EIDLTEYLGI LNKRKWLILA ITILAGVMGY ILTSYKPAVF QSDALLMIDE TPTTLNQIEL SPFNNNNRDL ITYSKLLKTR KLLKKVSQHF DENKVSVKYL ITNLNIEIIP DTRLIKISIK HTDPVIARQI ISYLIEEFII NNRNLKKSAT VNARNYVARQ LEKVSRDLKN IEAEVRHYKE ENKSLVLSNF TQKIMQSMLD LEENLAEVEI NIKTHRAALN HLYNKLQSSR ELILSSKTLS KNPAYTELKN RLTELEIELD SLTTVYTDKH PEIIKLKAEK KSILNEISNT LGEVITSTIY TTNPVYNNLK QELVKLETEL TSLKAQKESL NIQFQKIKAK TEQLPKKELE YSRLLRKLEV SEKLYTMLLT RYQELKITEA MKVSDIITVD PPVVPESPVG PNMKLNLAIA IIMGLFVGVF IAFILEFINN TIQRVEEIEE ITDVPIIGYI PYIDKKNDGR DHNN
|
| |