Gene Haur_4434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4434 
Symbol 
ID5736285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5674199 
End bp5675623 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content53% 
IMG OID641281597 
ProductO-antigen polymerase 
Protein accessionYP_001547194 
Protein GI159900947 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTC ACGCATTAAC TTGGCGCTCC GTGCAGCCAT TGGATTGGTT GCTAGCTAGC 
TTGGGGCTGG CAGGAGTGGC AATTATTACT CTGCTGCCCT TCACGCAGGC AGCAACCTTG
ATTATTTGTG GCATGCTGCT AGTTTGCATG CTGATTCAGC CTGCCGTAGG TTTGAGCCTG
ACGGTTGCCA CGGTCATGCT TCAAGAGTTA TTGAGCTTTC CGCTGGGCCT GACCGCCACC
CATGTGATTG GGATTATGGC GCTGGGGGCG TGGTTGCTGT ATGGCATGGC GCAGCGCAAA
ATCATCATCG ACACAACTTT GCTGGTGCCA TGGAGCCTGT TTTTGATGGC CTTGCTGCTC
TCGGCGGGAC TGACCGAATA TAACGCGGTT GATGCCTTGA AGCAGGTGGT GCGTTGGTTT
ATGGCCTTGC TGGCCTTTGT GGTGACGGTT GCCACAATTA CCACGCCCAA ACGCGCGATT
GGCCTGATTG CGGTGATGTT CACGGTTGGG GTCATCGAGG CACTGATCGG TATTCAGCAA
TATCGCGTGG GTGCTGGGCC ATTTGCCATT GGCGAAACTG TGCGGGCTTA TGGCACAATC
GGTAAGCCCA ACACCTTTGC CGGCTTTTTG GAGTTGATGT GGCCCATGAC TTTGAGTGTA
GCCTTGGGCT TGCTCTGGTT TTGGTGGCAG CAGCGCCAAC GCTGGCACTA TTTAATTGGC
TCGGCCTTGA GTGCTGGCGC AAGCCTGATC ATTTTGGCGG CAGTTGGGGT TAGTTTTTCG
CGCGGCGCTT GGATTGGCAT TATGGGTGCG GTGGTGGTGA TGCTGCTGGC GGTTGATCGG
CGGCGAGCCT TGCCATTAAT CGCGCTTGGT GGAATCTTGC TGTTGGCGAT TATCAGCCAA
CCTGAGCTTT TCCCCCCAGT GATTACCGAG CGAATTAGCA GTCTAACCAA CAATTTACGG
ATTTTTGATG CTGGGCGGGT GACGGTTACC GATGAAAATT TTGCGGTTGT CGAACGCATG
GCCCATTGGC AAGCGGGGGC AAATATGTTT TTGGCTCATC CGCTGCTCGG AGTTGGCCCC
GACAACTTCA ATCGAGCCTA TCCCGAATTT TTTGTCGGGC GCTGGTCGGA ATCGCAAGGC
CACTCGCACA ACTACTACAT TCATATTGCG GCAGAAGCTG GCATTTTAGG CTTTGTTGCT
TATCTCGTGC TGATTGCAGC GGTCTATCGT CAAGCCTATT TGGCAATTCA GGCGACGCGC
GGCACGGTTT GGCAGATGGT AGCAATTGGC TGCTGTGGTA TCATAACCGC CATTCAATTG
CATAATGTTT TCGATAATCT CCATGTGTTG AATTTTGGAA TTCATTTGAG CGCAGTGTGG
GCCTTATGTG TGGTTCTGAC ACAGCGCCAA GGGTGGCGTG CATGA
 
Protein sequence
MQRHALTWRS VQPLDWLLAS LGLAGVAIIT LLPFTQAATL IICGMLLVCM LIQPAVGLSL 
TVATVMLQEL LSFPLGLTAT HVIGIMALGA WLLYGMAQRK IIIDTTLLVP WSLFLMALLL
SAGLTEYNAV DALKQVVRWF MALLAFVVTV ATITTPKRAI GLIAVMFTVG VIEALIGIQQ
YRVGAGPFAI GETVRAYGTI GKPNTFAGFL ELMWPMTLSV ALGLLWFWWQ QRQRWHYLIG
SALSAGASLI ILAAVGVSFS RGAWIGIMGA VVVMLLAVDR RRALPLIALG GILLLAIISQ
PELFPPVITE RISSLTNNLR IFDAGRVTVT DENFAVVERM AHWQAGANMF LAHPLLGVGP
DNFNRAYPEF FVGRWSESQG HSHNYYIHIA AEAGILGFVA YLVLIAAVYR QAYLAIQATR
GTVWQMVAIG CCGIITAIQL HNVFDNLHVL NFGIHLSAVW ALCVVLTQRQ GWRA