Gene Haur_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3579 
Symbol 
ID5735440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4499827 
End bp4501332 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content50% 
IMG OID641280728 
ProductO-antigen polymerase 
Protein accessionYP_001546343 
Protein GI159900096 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCCA ACACACCGCA AGAAGAGCCA ACCAGCCAAC CAGATGTTTT TAACAGCAAA 
TGGATGAAAT TTGCCTTTGC TTTTTTAGCG ATCGCTGGGG GCGCTGGGGT TGCCGCCGCC
TTGGCATTTT TCGATAACCC CTTGAAATTA GTGCTGCTGT TTGGTGGCGC TGGTGCAGCC
ATGGTCACAA TGCGCAATAG CGAATGGGGC TTGCTGGCAC TGGTGTTCAT GAGCTACACC
CGTTTTTCCG ATGTGATGGT GCGCAACGGA GCGCCCTCAA CCGCCCAACC ATTTTTAGCG
TTGCTCTTTT TAATTATTTT CTTGCGCTGG GCACTCTACA ACCAAAAACC CGAGCCTTGG
CTCAAGCCCG CCGCCTGGAT CTTTGTCTAT GGCATGGTTG GGGTCGCCAC ATTTCTCTAT
GCTGATGATG TCATGCGAGT TAAGAATGGG GTCATCACCT ACTTCAAAGA TGCGATTATC
GTGATCGTGG TGGTGATGAT GATGCGTTCG CCCAAGATGC TGCATCGTTC GATGTGGGCG
CTCTTGTTCG CTGGCATTTT TATGGCTTCG ATTACCACCT GGCAGCAATT AACTGGTACG
TTCGAGAACG ATTATTTGGG CTTTGCCAAA GCTGGCAAAA TGCAAATCGT CTCAGGCGTT
GAGGATGATT ATCGGATTGC TGGGCCAATC GGCGACCCTA ATTTCTATTC ACAAGTGCTC
TTGACCCTAA TTCCACTGGG CATGGATCGC ATGTGGAATG AGAAGAACAA AAAATTACGC
TGGTTTGCAA TTTGGCAATT GAGCGTTTGT ATGGCCTCAA TTTTCTTCTC GTTCAGTCGT
GGGGCATTTC TCTCGCTCTC GATTGCCAGC TTAATTATGT TTGTGCGCCG ACCACCCAAG
CCGCTTTCGG TGATCATTAT CATCGCTTTG GGTTTTGTAA TCATCCCGAC CTTGCCAGCT
TCGTATATCG CGCGGCTCGA AACGATTCCC GAGGCAATTC CCGGCTTAGC TCAAGAAGAT
GTGCGCAACG AGGCTTCGTT CCGTGGCCGT TCGAGCGCCC AACAAGCGGG TTTACGCATG
TTCTGGGCTA ACCCAGTTTT TGGCTTAGGT GTGGGCAATT TTGGCAATCA CTATCAAGAA
TATGCCCGTG ATCTAGGACT TGATAACAGC CGTTGGGACC AAGCGCCGCA CAACATGTAC
CTTGAAATTC TGACCGAAAA AGGCTTATTT GGGCTTTCGG TCTTTAGCGC AATGATGTGG
GTGCTGTTCC GCGATATGAA CCGAGCACGT AAAAAGTTTC GCGAAATCAA TATGGGCGAT
TTCGATGGTC TGATCTTTGG TTTCCAGGCT GGGTTGGTTG GCTATATGTT TGCCGGGATC
TTCCTGCAAC TATCCTACCC ACGCTTTTTC TGGATTTTGA TCGCCATCGC CTATGCAATT
CCCAATGTTG CCAATAAAGC TTATGAAGAG TATCGCGAGG CGCTACCAAA TGGCGAAACA
GCCTGA
 
Protein sequence
MLANTPQEEP TSQPDVFNSK WMKFAFAFLA IAGGAGVAAA LAFFDNPLKL VLLFGGAGAA 
MVTMRNSEWG LLALVFMSYT RFSDVMVRNG APSTAQPFLA LLFLIIFLRW ALYNQKPEPW
LKPAAWIFVY GMVGVATFLY ADDVMRVKNG VITYFKDAII VIVVVMMMRS PKMLHRSMWA
LLFAGIFMAS ITTWQQLTGT FENDYLGFAK AGKMQIVSGV EDDYRIAGPI GDPNFYSQVL
LTLIPLGMDR MWNEKNKKLR WFAIWQLSVC MASIFFSFSR GAFLSLSIAS LIMFVRRPPK
PLSVIIIIAL GFVIIPTLPA SYIARLETIP EAIPGLAQED VRNEASFRGR SSAQQAGLRM
FWANPVFGLG VGNFGNHYQE YARDLGLDNS RWDQAPHNMY LEILTEKGLF GLSVFSAMMW
VLFRDMNRAR KKFREINMGD FDGLIFGFQA GLVGYMFAGI FLQLSYPRFF WILIAIAYAI
PNVANKAYEE YREALPNGET A