Gene Haur_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0234 
Symbol 
ID5732129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp269886 
End bp271043 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content51% 
IMG OID641277358 
Productglycosyl transferase group 1 
Protein accessionYP_001543014 
Protein GI159896767 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTATA ATGTGCTCAG CAATTATTTG AGCGCGAGTA TCACCCTAGC AATGCATATT 
CTTCATGTCT ATAAAGATTA TTTTCCAGTG CTTGGTGGCA TGGAAAACCA TATTCGGGTA
GTGGCTGAGG GCTTGGCTGA ACGCGGCCAT CAAGTTACGG TGGCAGTCAG CAATACCTAC
CCAAAAACCG AAATCGAGCG CCGTAATGGT GTAGCGATTA TTAAAGCAGC CCAATGGTTG
CGCAAGGCAT CAACCCCGAT TAGCCCGATG AGTTTGCCAT TAAGTTGGCG TGTACCCGCC
GATATTATCC ATTTGCATCA TCCCTTCCCA CCTGGCGATT TGCTGTATTG GCTGCGTGGT
GGCAAGGCTA AATTGGTGAT TACTTATCAA AGCGATATTG TGCGCCAACG CCGTTTGTTG
CAACTCTATC GACCATTGCT TACCCGTACT TTGAACGCCG CCGATCGAAT TATCGCGGCC
AGCCCGCAGT ATATCCAAAC CTCGCCATGG TTGGCTCCTC ATGCCGCCAA ATGCCGCGTA
ATTCCCTTGA GCGTCGATAC CGAGCGCTTC AATCAACTTG ATCATGCGGC GATTCAGGCG
TTGCGTGAGC AGGTTGCAGC ACCCATGGTG TTGTTTGTTG GGCGCTTCCG CCATTACAAA
GGCCTGCACT TTTTGCTCGA AGCCTTGCCA AAAATTCCCA AGGCCAAATT GGTGTTGGTC
GGCATTGGCC CTGAGGAAGC TCGTTTGCGC GAGTTGGCGC AACGCTTGGG TGTTGGCGAA
CGTATTATAT GGGCTGGCGA AGTCCCGGAT CAAGCCTTAC CAAATTACTA TGCCGCTGCC
GATGTATTTG TGCTACCATC TCATTTACGA GCAGAAGCAT TTGGCATCGT GCAACTCGAA
GCATTAGCCG CTGGAATTCC AATTGTCAGC ACTGAGTTGG GCACTGGCAC AAGTTTTGTC
AACGCCCACG GCCAAACTGG GTTTGTTGTG CCACCAGCCG ATCCGGCGGC ACTGGCGCGG
GCAATCACTG TGCTGTTGGA AAATCCAGGC TTGCGGGCGC AATTTGGAGC TAACGGTCGT
CAACGCGCGA GCAGCACGTT CAGTCCACAG CGCATGCTCG ATCAGATTGA AGAACTTTAT
CGTGAGATTG TGAGTTAG
 
Protein sequence
MFYNVLSNYL SASITLAMHI LHVYKDYFPV LGGMENHIRV VAEGLAERGH QVTVAVSNTY 
PKTEIERRNG VAIIKAAQWL RKASTPISPM SLPLSWRVPA DIIHLHHPFP PGDLLYWLRG
GKAKLVITYQ SDIVRQRRLL QLYRPLLTRT LNAADRIIAA SPQYIQTSPW LAPHAAKCRV
IPLSVDTERF NQLDHAAIQA LREQVAAPMV LFVGRFRHYK GLHFLLEALP KIPKAKLVLV
GIGPEEARLR ELAQRLGVGE RIIWAGEVPD QALPNYYAAA DVFVLPSHLR AEAFGIVQLE
ALAAGIPIVS TELGTGTSFV NAHGQTGFVV PPADPAALAR AITVLLENPG LRAQFGANGR
QRASSTFSPQ RMLDQIEELY REIVS