Gene Haur_0373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0373 
Symbol 
ID5732224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp446120 
End bp447373 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID641277496 
Productsterol 3-beta-glucosyltransferase 
Protein accessionYP_001543152 
Protein GI159896905 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATT GTATCTTGGC GCTTGGTTCG CGCGGCGATG TGCAGCCGTT TATCGCCTTG 
GCGCTTGGGT TGCAAACTGA GGGTCATCAG GTCGTTATCG CCGCAGCTCA TGATTATCGT
AGCTTGGTTG AAAGCTATGG CTGCCGTTTT GCCCCGTTGG TTGGCTCAAT TTCGGCCTTG
CTCAACCCTG AACAAATGGC GGCGGGCTTG GCAGCAGGTC GCAGCGCAAT CATCAAACAA
TTTCTTCAAC AAACACCACC GATTATTCGT CAATTAATCG CTGATGCACT AGCGGCCTGC
CAAACTGCCG ATTGTTTGAT CGTTTCCAGT TTGGGGATGT GGCCTGCGCT GCATCTAGCC
GAGCATTTGC ACATTCCCGT AGTGTTGGTA CACCTGCACC CTTATGCTGC TAGCAGCCAA
ACCGCCCATC ATTTTGCGCC GCAACTGGCT TGGGCCAGTT ATCGCCGCAT GAGCTATCGC
GTGGCCGAGC AATTGCAATG GCAAGTCTTG CGCATGGCCT TCAATCAAGC ACGGCAGCAG
ATTTTGCAAC GCCCAAGCCT AAGCATTGGC CAGCTTTGGC AACGGAGTCG CAATTTTCAG
CCACCAACCT TGTATGCCTA TAGCGCGTTG GTTGCTCCGC CGCCAGCAAC TTGGTTTGAC
GATGGCGCGA TCACTGGCTA TTGGTCACTG CCCCCAGCGG CAGATTGGCA AGCACCAACG
GCATTACAGC AATTTCTGGC AGCAGGCCCA GCGCCAATCA CCATTAGTTT TGGCAGCATG
TTGCATGGTC AAAAGCGCGG CAATCAATTA AGCCAATTGC TAATTACCGC CAGCCAAAAA
GCCAAAGTAC GCATGATCAT CAACCAAGGC TGGGGCGATT TAGCTCAGGG CAAGTTGCCA
GCCAACTGTT TAGCGATCAA TGGCCTAGCG TATGCTTGGT TATTTGAGCG GGTAGCGGCA
GTTGTGCATC ATGGCGGGGC GGGCGTAACC GCTACAGCCT TAGGTGCAGG CAAGCCCGCC
TTGGTCACAC CATTTTTGGG CGACCAATAT TTTTGGGGCC AGCGGGTGTA TGATCTCAAA
GCGGGGCCAG CGCCTGTGCC AGCCAACCAA TTGCAAGTTG CACAGCTCGC TACTCTGCTG
TGTAGCTTGA TTGAGCGCGA TGATTATCAG GCGGCGGCGC AACAACTTGC GACCCAATTA
GCCCAAGAGC AAGGCGTAAC CAAGGCCATA GCTTGGTTAA AACAACGATT TTGA
 
Protein sequence
MNYCILALGS RGDVQPFIAL ALGLQTEGHQ VVIAAAHDYR SLVESYGCRF APLVGSISAL 
LNPEQMAAGL AAGRSAIIKQ FLQQTPPIIR QLIADALAAC QTADCLIVSS LGMWPALHLA
EHLHIPVVLV HLHPYAASSQ TAHHFAPQLA WASYRRMSYR VAEQLQWQVL RMAFNQARQQ
ILQRPSLSIG QLWQRSRNFQ PPTLYAYSAL VAPPPATWFD DGAITGYWSL PPAADWQAPT
ALQQFLAAGP APITISFGSM LHGQKRGNQL SQLLITASQK AKVRMIINQG WGDLAQGKLP
ANCLAINGLA YAWLFERVAA VVHHGGAGVT ATALGAGKPA LVTPFLGDQY FWGQRVYDLK
AGPAPVPANQ LQVAQLATLL CSLIERDDYQ AAAQQLATQL AQEQGVTKAI AWLKQRF