Gene Hore_18820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18820 
Symbol 
ID7312697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2010220 
End bp2011617 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content38% 
IMG OID643612330 
Productglycosyl transferase family 39 
Protein accessionYP_002509626 
Protein GI220932718 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0706746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGA AGGCACGGAT TATATATTCA GTAGTAGTTG GTTTTTCAGC ACTGTATAAT 
TTTACATTAC CCCTGCACCC GGATGAAGCC TATTACTGGG TCTGGAGCCG GAATTTACAG
CTTTCGTATT TTGACCACCC TCCTATGGTG GCCTATTTAA TAAAAATGAT GACTCTTCTG
GGGCAACATG AATTTGTTAT CAGGTTGGTT TCAGTATTCT GTGTTGCCGG GGCGGCATAT
CTGGTTTACC GTCTTGCTGA AGATATGTTT AACAGCCGGG TGGCTGAAAT AAGTTTAGGT
GTTTTCCTGT TTATGCCTCT TGTGCAGTCC GGATTTATTG TAGTTACCCC GGATTCTCCC
CTGGTCTTTT TCTGGACATT GACATTTTAC TGTTTTTATA ATTATGTATT CAGGGAAAAG
AGGAAATACA TCTATCTGGC AGGTATTGCG GCAGGAATGT TGTTACTTTC AAAATATACC
GGGGTTTTAT TACTGGGGAG TTTATTTTTT TATTTAACCT TTTCTAAAAA GAGGAAACTC
TTCAAACGAC CTGAATTATA CGCGGCAGGA GTACTGGCCC TTTTGGTGTT TTTACCGGTT
ATAATCTGGA ATTATCAGCA TGGCTGGGCT TCTTTTAAAT TTCAGTTTTC CCACGGGGTA
GCAGATAAAA AAGTGTTTAA TCCTTCTGCC CTGGGGGAGT TTATTGCAAG TCAGGTTATG
GTCTTTAACC CTGTATTTGC TACAGGTTTT CTGATCTTAT TTATTAAAAA TATTAAAGAT
GTAATTAAGA ATAAATATTT ATATTTACTT ACCTGGCCTT TTGCCTTTAC CTTACTGTTT
TTTGTCTATA ACAGTGCCTT TAAAAAGGCT GAAGCCAACT GGGCAGGCCC GGCTTATATT
ACTGCTGCTA TAATCCTGGC TTACTGGATT GACAAACACA AATTAAAAAA ATTCTTTATA
GCCGGCCTTA TTATGGCTAT ATTTTTAATA ATTGTTATGA GGTTTCCGGA ATTTATTCCA
GGTTATCCAG AACAGGCAGT CCTTAAAAAA GATTTTTATG GACATAATTT TATCTATAGT
GAGGCCAGTC AGTACCTGGG TCCGGGGATA ATTTTAAGTG ATAGTTACCA GAATGCTTCA
GAGGCCCAGT TTTATTTAAA AGGCAGGCCG GAAGTCTATA TTATTACCCA GACCAGGTAT
TCCAATTACA ACCTCTGGAG TAAAGAGGTT AAAGAAAATA TTGAGAATGG AGAGATTAAA
GAGGCCATTT ATATCGGGGC CTCTGATAAA AAGGATGAGT TGTTAACATA TTTTGATGAT
GTCTATCTCC TCGACAGGAT TAAATATAAC GGAAGGTTTG TTCAGCGCCT TTTTTATGTT
TATCGATGTT ACAACTAA
 
Protein sequence
MTKKARIIYS VVVGFSALYN FTLPLHPDEA YYWVWSRNLQ LSYFDHPPMV AYLIKMMTLL 
GQHEFVIRLV SVFCVAGAAY LVYRLAEDMF NSRVAEISLG VFLFMPLVQS GFIVVTPDSP
LVFFWTLTFY CFYNYVFREK RKYIYLAGIA AGMLLLSKYT GVLLLGSLFF YLTFSKKRKL
FKRPELYAAG VLALLVFLPV IIWNYQHGWA SFKFQFSHGV ADKKVFNPSA LGEFIASQVM
VFNPVFATGF LILFIKNIKD VIKNKYLYLL TWPFAFTLLF FVYNSAFKKA EANWAGPAYI
TAAIILAYWI DKHKLKKFFI AGLIMAIFLI IVMRFPEFIP GYPEQAVLKK DFYGHNFIYS
EASQYLGPGI ILSDSYQNAS EAQFYLKGRP EVYIITQTRY SNYNLWSKEV KENIENGEIK
EAIYIGASDK KDELLTYFDD VYLLDRIKYN GRFVQRLFYV YRCYN