Gene Haur_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0065 
Symbol 
ID5731938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp84367 
End bp85875 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content54% 
IMG OID641277187 
ProductO-antigen polymerase 
Protein accessionYP_001542845 
Protein GI159896598 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.321315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTTG CACGTTGGCG TTGGCATCCC CTCACAACGC TTGGGCTGAG TGGGCTGGCG 
GCCTTGGCAA TTGTGGCGCT CTATCGGCCC AAACGCTTTG TGCCATTGAT TGGCATTGAT
GAGGCGGCTG CGCATAATTT GCTGTTTGCG CTAGCGTTGT TGGTGTTATT TGCCCTACTG
GCTTGGCTGC GACCCGAAAT TGGCTTGGCT GGAGTGGCCG CGACGATTCC GTTTAACTAT
CGCTCACGCG GCTTTTGGGA TGGCAACTAC CCATTTATTG ATGGCAAATA TTTCCCGTTG
CACGAGTTGT TGCTGTTGGT CGTCTTGGGT GTTACGGCGC TCGATGTTGG TTGGCGCTTG
ATCCAACGCC AAATCGCATG GCAAATGTGG TGGCGTGAAC ATTGGCGCAT GCTGCTGTTG
CCCTTGGCTT GGCTGCTTGT GGCTACATTT GCGGCAAACT TGGCCGTGCC CGAAGGTCAA
GCTGAGGCAT GGCGCGAATG GCGTTGGATG ATTATCGAGC CATTGTTGCT GTATGGCTTG
ATTTTGTATT GGCAACCACG CTACCCAGTG CGGCGCTGGT TACTTTGGGG TTTGCTAGCT
GGCAGCGTAA TTGTGGCGAT CATTGGCATT TTGCAGTGGC GCGACCTCGA TTGGACTCCG
ATTGATGGTA CGGGCATGTG CTTTAGCGAT TTGATTGTTG ATTCTGGCGG CACAAAACGC
ACATCATCGG TCTACTGCCA CCCCAATAAT TTGGCCTTGT GGATGGATCG CGCCAGTATG
TTGGCGGTTG TCGCGGCAGC ATGGGCCTGC TGGCAATGGT GGCGCAAACG CACCTGGCAA
ACGGCTATAT GGTCGTTGCT CTATCTTGGG GCCAGCAGCT TGTTGCTATT AAGTTTGATG
CTAACCTATT CCAAGGGGGC ACGGTTTGCA GTGGCCTTGG TATTGATTGG CCTAAGCTGT
TTGCCGCGCC GCTGGTGGCT ACCGCTGATC ACGAGCGCTG TGCTTGGTGG TTTGTTGCTT
TATTCGTCGT TGAGTGGTCC TGAGCGCTTG AACGTGACGG GCGATTCCAG TTCGGCGCGT
TTGAGTATTT GGCGCTCAGC CACGGCGATG ATTATTGATC ATCCAATTGT AGGCATTGGG
CTTGATCAAT TTTATTTCTA TTTCAATCCA CAATTCAATC GTGGCTATAT CGAGCCACGC
CTTGCCGCCG ACCCTGCCGA GCGTAACACC GCCCATCCGC ACAATTTGCT ACTCGATTTG
TGGTTGCGGG TTGGGATTGC GGGAGTGCTT ATTTTTGCGG CGTTGGCGTG GCGCAGCCTG
CGGCGCACTT GGCAAATTTG GCATAGCGAG CAGCCTGAGC GCTGGTTGGC GTTAGCGGCT
TTGGCGGCCT TGATGGCTGG CTGGTTGCAT GGCGGCGTTG ATCAAGGCTA TTTCTCCAGC
GATTTAGCAA TGGTGACATG GTTAACCCTA GGGATGATCG ATAGCTTCAC CCTAAAAGGT
CAAAATTGA
 
Protein sequence
MMLARWRWHP LTTLGLSGLA ALAIVALYRP KRFVPLIGID EAAAHNLLFA LALLVLFALL 
AWLRPEIGLA GVAATIPFNY RSRGFWDGNY PFIDGKYFPL HELLLLVVLG VTALDVGWRL
IQRQIAWQMW WREHWRMLLL PLAWLLVATF AANLAVPEGQ AEAWREWRWM IIEPLLLYGL
ILYWQPRYPV RRWLLWGLLA GSVIVAIIGI LQWRDLDWTP IDGTGMCFSD LIVDSGGTKR
TSSVYCHPNN LALWMDRASM LAVVAAAWAC WQWWRKRTWQ TAIWSLLYLG ASSLLLLSLM
LTYSKGARFA VALVLIGLSC LPRRWWLPLI TSAVLGGLLL YSSLSGPERL NVTGDSSSAR
LSIWRSATAM IIDHPIVGIG LDQFYFYFNP QFNRGYIEPR LAADPAERNT AHPHNLLLDL
WLRVGIAGVL IFAALAWRSL RRTWQIWHSE QPERWLALAA LAALMAGWLH GGVDQGYFSS
DLAMVTWLTL GMIDSFTLKG QN