Gene Haur_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0400 
Symbol 
ID5731968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp470013 
End bp471179 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content50% 
IMG OID641277523 
Productcell wall biosynthesis glycosyltransferase-like protein 
Protein accessionYP_001543179 
Protein GI159896932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.443716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTGGT TGGCTTGGTG GTATCTGCTT GATCGCTGTT GGCGTTGGCT GGTGATTCGA 
CGGTTTTTTG CCAAGCACCA ACCACCAGCA CCCAAACATT GGCCGAGCAT TGATTTGATT
CAACCGTTAA CCCATGGGGT GTTTGATTTA GCCAAAACCT TGCAACAGCG GCTCGATTTG
CGCTATGCGG GCCAACTTCA ACATTATTGG GTGATCGATC AAGCAGATAA TCAAACATTG
GCAGTGTGTT CTGCTTTACA GAAAAAACAT CCTGAGCAGC ACATCACGAT TATTCAAGTT
GCGCCAGATT GGGGCCAACG TGCTTCGAAG TTAGTGAAAT TGCAAGCTGT GCTGGCCCAA
GCCAACTCCG AGATCGTATG GTTTGTTGAT GATGATGTTA GCTTACCGCT TGATGGCTTG
AGCCAAGGCC TACCCTATTT GTTCCAGCCG CAGGTTGGGG CGATTTTTGG CTTGGCCTGC
TATGTGAATT GGCATAATTT CCCGTCGGCT TTGATGAGCA ATTTTGTCAA TGCCAATGCC
TTGCCCAGCT ATATCGGTTT GGCGGCTTTA ACCGAGCCAT ACACAATTAC TGGGCATCAA
TTTGCTTTGC AACGCAGCGT GTTTGAACAA ATTGGCGGGC TGAGTGGCAT GTACGGGCGG
ATTGACGACG ATCATGAGTT GGCGCGGCGG GTGCAAGCCC ATGGTCTGCG CAATTTGCAA
ATGCCATTAA TCTATCAGGT AGATAATTAT TTTGTGAATT TACCAGCCTA TTTTCAGCAG
ATGCAGCGTT GGTTTACGAT TCCCCGCGTG CTGATGCTGC CGCATCTCAG CCAATACGAC
CAGTTTGTGA CCATTTTGAG CAGCCTTGCC CAGCCGATTC CCACGATTTT GGCCTTGGCA
AGCATGCGCC AGCCAAAACT ACGGCCTTGG CTCGTGGCAT GTTTATTGGC GCAGTTGAGC
TTGCAGGGTT GGCAAATTCG GCGCTATTGC CAAACCAAAG TGCCGTGGTG GGCTTGGCCC
TTGAGTATCA TTGGTACGTT GATCGATCCC TTGCTGATGC TTTGGGGTTT GCTGGGCGAT
GATACGATTG TTTGGCGCGG TGAGCGGATT CGGCTGCGGC ATTCGGCCTC GGCTCAATGG
TTGGGCAAGG AGCAAGATCA TGATTAA
 
Protein sequence
MRWLAWWYLL DRCWRWLVIR RFFAKHQPPA PKHWPSIDLI QPLTHGVFDL AKTLQQRLDL 
RYAGQLQHYW VIDQADNQTL AVCSALQKKH PEQHITIIQV APDWGQRASK LVKLQAVLAQ
ANSEIVWFVD DDVSLPLDGL SQGLPYLFQP QVGAIFGLAC YVNWHNFPSA LMSNFVNANA
LPSYIGLAAL TEPYTITGHQ FALQRSVFEQ IGGLSGMYGR IDDDHELARR VQAHGLRNLQ
MPLIYQVDNY FVNLPAYFQQ MQRWFTIPRV LMLPHLSQYD QFVTILSSLA QPIPTILALA
SMRQPKLRPW LVACLLAQLS LQGWQIRRYC QTKVPWWAWP LSIIGTLIDP LLMLWGLLGD
DTIVWRGERI RLRHSASAQW LGKEQDHD