Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_18820 |
Symbol | |
ID | 7312697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2010220 |
End bp | 2011617 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643612330 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002509626 |
Protein GI | 220932718 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.0706746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAGA AGGCACGGAT TATATATTCA GTAGTAGTTG GTTTTTCAGC ACTGTATAAT TTTACATTAC CCCTGCACCC GGATGAAGCC TATTACTGGG TCTGGAGCCG GAATTTACAG CTTTCGTATT TTGACCACCC TCCTATGGTG GCCTATTTAA TAAAAATGAT GACTCTTCTG GGGCAACATG AATTTGTTAT CAGGTTGGTT TCAGTATTCT GTGTTGCCGG GGCGGCATAT CTGGTTTACC GTCTTGCTGA AGATATGTTT AACAGCCGGG TGGCTGAAAT AAGTTTAGGT GTTTTCCTGT TTATGCCTCT TGTGCAGTCC GGATTTATTG TAGTTACCCC GGATTCTCCC CTGGTCTTTT TCTGGACATT GACATTTTAC TGTTTTTATA ATTATGTATT CAGGGAAAAG AGGAAATACA TCTATCTGGC AGGTATTGCG GCAGGAATGT TGTTACTTTC AAAATATACC GGGGTTTTAT TACTGGGGAG TTTATTTTTT TATTTAACCT TTTCTAAAAA GAGGAAACTC TTCAAACGAC CTGAATTATA CGCGGCAGGA GTACTGGCCC TTTTGGTGTT TTTACCGGTT ATAATCTGGA ATTATCAGCA TGGCTGGGCT TCTTTTAAAT TTCAGTTTTC CCACGGGGTA GCAGATAAAA AAGTGTTTAA TCCTTCTGCC CTGGGGGAGT TTATTGCAAG TCAGGTTATG GTCTTTAACC CTGTATTTGC TACAGGTTTT CTGATCTTAT TTATTAAAAA TATTAAAGAT GTAATTAAGA ATAAATATTT ATATTTACTT ACCTGGCCTT TTGCCTTTAC CTTACTGTTT TTTGTCTATA ACAGTGCCTT TAAAAAGGCT GAAGCCAACT GGGCAGGCCC GGCTTATATT ACTGCTGCTA TAATCCTGGC TTACTGGATT GACAAACACA AATTAAAAAA ATTCTTTATA GCCGGCCTTA TTATGGCTAT ATTTTTAATA ATTGTTATGA GGTTTCCGGA ATTTATTCCA GGTTATCCAG AACAGGCAGT CCTTAAAAAA GATTTTTATG GACATAATTT TATCTATAGT GAGGCCAGTC AGTACCTGGG TCCGGGGATA ATTTTAAGTG ATAGTTACCA GAATGCTTCA GAGGCCCAGT TTTATTTAAA AGGCAGGCCG GAAGTCTATA TTATTACCCA GACCAGGTAT TCCAATTACA ACCTCTGGAG TAAAGAGGTT AAAGAAAATA TTGAGAATGG AGAGATTAAA GAGGCCATTT ATATCGGGGC CTCTGATAAA AAGGATGAGT TGTTAACATA TTTTGATGAT GTCTATCTCC TCGACAGGAT TAAATATAAC GGAAGGTTTG TTCAGCGCCT TTTTTATGTT TATCGATGTT ACAACTAA
|
Protein sequence | MTKKARIIYS VVVGFSALYN FTLPLHPDEA YYWVWSRNLQ LSYFDHPPMV AYLIKMMTLL GQHEFVIRLV SVFCVAGAAY LVYRLAEDMF NSRVAEISLG VFLFMPLVQS GFIVVTPDSP LVFFWTLTFY CFYNYVFREK RKYIYLAGIA AGMLLLSKYT GVLLLGSLFF YLTFSKKRKL FKRPELYAAG VLALLVFLPV IIWNYQHGWA SFKFQFSHGV ADKKVFNPSA LGEFIASQVM VFNPVFATGF LILFIKNIKD VIKNKYLYLL TWPFAFTLLF FVYNSAFKKA EANWAGPAYI TAAIILAYWI DKHKLKKFFI AGLIMAIFLI IVMRFPEFIP GYPEQAVLKK DFYGHNFIYS EASQYLGPGI ILSDSYQNAS EAQFYLKGRP EVYIITQTRY SNYNLWSKEV KENIENGEIK EAIYIGASDK KDELLTYFDD VYLLDRIKYN GRFVQRLFYV YRCYN
|
| |