Gene Hhal_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1558 
Symbol 
ID4710777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1693726 
End bp1695144 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content63% 
IMG OID639856022 
ProductO-antigen polymerase 
Protein accessionYP_001003124 
Protein GI121998337 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.238347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTC GTGACATCGT CTTCGGCACC ATCCTGATCT CCAGCCTGCC GCTGATCTTA 
TACAGGCCCT GGATCGGGCC GCTGATGTGG TACTGGGTGG GCTTTTTTAA TCCGCACCAG
CAGGCGTGGG GGTTCTTTGC CGGTGCTCAG GTCGCACTCC CCGTCGCTAT CGCGACCCTC
GCAGCCACCG CCTATACCCA GGAGAAGCGC TGGCCACCGA TGACTCGGGA GATGTGGCTG
GTCTTCCTTC AGGTCATCCT GTTCACGGTG ATTACGTTCG GCTTCGCCTG GCTCCCGGAT
GACGCGTTCG GGCTCTGGGA TCAGCGGATG CGCATCATCT TGATGACCGT CATCACGGTA
ATACTGATCT ACGGGAAGCA GCGCGTCATG GCGCTGCTGG CCATGATCAC CCTGTCCATC
GCCTACTTCG GCTTCAAGGG CGGCCCCTAT ACGCTCAGCA CCGGATTCGG GGGGATGGTA
CTCGGGCCAC AGGGAACGTT CATCGGCGGC AATACCGATA TCGGCCTGGC TCTGGTGATG
ATCCTGCCTC TCACCTTAAT CCTCGCCCGC CAGGTCTATC ACGGGCGTTT CGAACTCCCG
ATCCGCATCC CCGGCTTCGA AACCTGGCAC CGGCTGATTG GGCTCGCCCT CTACGGCGGC
TTCTGGATGA CCCTGATCTC GATCATTGGG ACCCAGTCCC GCGGTGCCTG GGTAGCCCTG
GCATGCACCT GGCCGTTCAT CTTCTGGCGC CTGCGCTTCA AGTGGGCCCT GGTCGCCGCC
GTCGTCCTCG CGGTTGGGGT CATCGGAGTC ACGGTCCCGG ACCGCGTGGC CCACGAGTGG
CAGACCCTTG TCGAATACGA GGACGACGGG TCCGCACAGG GCCGATTCCA CGCGTGGGAT
GTGGCTTGGA ACATTGGGGT GGAGCACCCG CTGACCGGTG CCGGCTTTGG TGCCCAACGC
ATCGACGCCG AGCTATGGCG CTCCTACAGT AGCGATGGTG ATGGCAGCCC GCTCGCACAG
CACAGCATCT ACTTCCAGAC GCTGGCCGAG AACGGATTCC TGGGGCTCGG GCTGTTCCTG
GCACTACTTG GCTTTACGCT GCTCACCCTG AACCGTCTGC GTCGCGACGC CGCTCAGCAC
CCGGATACGC TCTGGATCAG CGAGTGGTCG TGGGCCCTCG CCATCGGCCT GATCGGTTAC
TGCGTCGCGG GGGCCTTCTT GAGCCTCGCG TACTTCGACC TGATGTACGC CTTTATCGCC
CTAGCCATCA TCCTGCGCCG AGAATTTGAG GATGTCCGGG TCGCGGTACG GTACCCCAGC
CCGACCACCG CCACAGCACC ACAACAAACG CCGGTGGGCG AAGTCGGCTA TCGTCCGGGT
ACTCCGCCGC GTGCCCTCTA TCGACGCCCC CCTGCATAA
 
Protein sequence
MDLRDIVFGT ILISSLPLIL YRPWIGPLMW YWVGFFNPHQ QAWGFFAGAQ VALPVAIATL 
AATAYTQEKR WPPMTREMWL VFLQVILFTV ITFGFAWLPD DAFGLWDQRM RIILMTVITV
ILIYGKQRVM ALLAMITLSI AYFGFKGGPY TLSTGFGGMV LGPQGTFIGG NTDIGLALVM
ILPLTLILAR QVYHGRFELP IRIPGFETWH RLIGLALYGG FWMTLISIIG TQSRGAWVAL
ACTWPFIFWR LRFKWALVAA VVLAVGVIGV TVPDRVAHEW QTLVEYEDDG SAQGRFHAWD
VAWNIGVEHP LTGAGFGAQR IDAELWRSYS SDGDGSPLAQ HSIYFQTLAE NGFLGLGLFL
ALLGFTLLTL NRLRRDAAQH PDTLWISEWS WALAIGLIGY CVAGAFLSLA YFDLMYAFIA
LAIILRREFE DVRVAVRYPS PTTATAPQQT PVGEVGYRPG TPPRALYRRP PA