Gene Haur_0362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0362 
Symbol 
ID5732213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp433120 
End bp434373 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content53% 
IMG OID641277485 
Productsterol 3-beta-glucosyltransferase 
Protein accessionYP_001543141 
Protein GI159896894 
COG category[G] Carbohydrate transport and metabolism
[C] Energy production and conversion 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0442187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA GCATCATCAG TTATGGTTCG CGCGGCGATG TTCAACCATT TGTAGCGATT 
GCACGGGCAT TACGCCATGT AGGGCATCAA GTTCAACTGA TTGGCCCAGC CAATTTTGCT
GCGCTAAGCC ACGATGCAGG CGTGCCGTTC GTTTCGGTTG GAGTTGATAT TCAAGCCTAT
TTACGTGAAC GCATCGCTAG CTTATCTGGC TCGCGCAATG TGATAGGGCT GCTCAAAAGC
CTGCGTAACG AACTAAACGA ATTGATTGAA GGAATTGCCC AAGAAACATT GCAAGCCTGT
CAAGGAACCG ATCTGATTCT TGGGACTGGC CCCCAGACTG CTAGTTTTGC TGAACGACTG
GGGGTTCCAT TTATTGAAGC AGTGCTCCAA CCGCTAACCC CCACCCGCGC CTATCCCTCG
CCAATTGCCC CGGCGTGGCT CCAACTTGGC GGATTCGCCA ACTATCTCAC GCATCTTGGT
TTTGAGCAGA TTTTTTGGCA GATCTTTCGG CCTACGGTTA ATCGAGTTCG CAGCCATGTG
CTTGGCTTAC CATCCTATGG CTTTACCAGC CCGTTTGGCA AAATCCGCGA GCAGGTTCCG
TTGCGGCTAC ACGCCTATAG TGACTATGTT ATGCCAAGGC CAAACGATTG GGCCAAGCAA
CATCAGGTCA CAGGCTTTTG GTTTCTGCCA GCACCAGCCG ATTGGTCGCC ACCAGCTGAG
CTATGCGCCT TTCTGGCAGC TGGGCCGGCT CCAATTTATA TCGGCTTTGG CAGTATGATG
GGCGGTGATC CACAACAATT AACCAGCATT GTGAAAGAAG CCTTAGCTCG CTCTGGCCAA
CGGGGAATTT TGGCTGGCGG TTGGGGTGCA TTAGCCGAAA CCTCAGCGCC AAGCGATCAC
TTATGCTTTG TTGAAAGCGT GCCGCATCAA TGGCTTTTCC CGCAAACAGC GGCAATTGTG
CATCATGGCG GTGCTGGCAC CACTGGCGCA GCCTTACGCA GTGGCCGACC GTCAATCGTT
GTGCCCTTTG CCTTCGATCA GACTTTCTGG GGGCGACGGG TGGCTGAGCT AGGCGTGGGC
ACTGCACCCA TCGCACGTTC GCAAATCACG GTCGATCGGC TGACAGCAGC GATCAATCAG
GTAACAACCC AAACCGCAAT TCGTGAACAA GCAGCCCAGC TTGGCAGCCA AATTCAGCAA
GAATACGGCA CAGCCCAAGC GATTGACCAT ATTCATCGCG TATTTCGCCA TTAA
 
Protein sequence
MDISIISYGS RGDVQPFVAI ARALRHVGHQ VQLIGPANFA ALSHDAGVPF VSVGVDIQAY 
LRERIASLSG SRNVIGLLKS LRNELNELIE GIAQETLQAC QGTDLILGTG PQTASFAERL
GVPFIEAVLQ PLTPTRAYPS PIAPAWLQLG GFANYLTHLG FEQIFWQIFR PTVNRVRSHV
LGLPSYGFTS PFGKIREQVP LRLHAYSDYV MPRPNDWAKQ HQVTGFWFLP APADWSPPAE
LCAFLAAGPA PIYIGFGSMM GGDPQQLTSI VKEALARSGQ RGILAGGWGA LAETSAPSDH
LCFVESVPHQ WLFPQTAAIV HHGGAGTTGA ALRSGRPSIV VPFAFDQTFW GRRVAELGVG
TAPIARSQIT VDRLTAAINQ VTTQTAIREQ AAQLGSQIQQ EYGTAQAIDH IHRVFRH