Gene Haur_4524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4524 
Symbol 
ID5736375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5790056 
End bp5791321 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content53% 
IMG OID641281686 
Productglycosyl transferase family protein 
Protein accessionYP_001547283 
Protein GI159901036 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAATTT TAATTATTGC CTTTGGCACA CGCGGCGATG TTCAGCCGAT GGTGGCCTTG 
GGCTTGGCCT TGCAAGAGCG TGGGCATTCG ATAACCCTGT TGGTCAGCAG CAATTTTAAA
AGCTGGGTTG AGGAGTTTGG GTTACAAGTG GCGACTGCGC GGGTCGATAT TCAGCAGATG
ATGTTGAGCG ATCATGGCAA CGATTGGGTC AAACATGGGG CTAATCCAAT TAAACAGCGC
AATGCGATGC GCCGTTTGTT GAAGCAACAT GCCTTGACCA TGGTTGAAGA TGCTTGGCAA
GTCGCCCAAA ACTGCGATGT TTTGATCAGC AGTTTTACCT CGGATGTCTT TGCAGTGACG
TTGGCTGAAG TGCTGAATGT GGTGCATATT AGCACGCCAC TGCAACCAGC CATGTTGGCC
ACCCGTTGCG GCCCTGCTAG TGCGGCGGCA ATTCTACCAA ACCACGAGAG CATCATTAAT
TATTGGTTTG GGCGCTGGGT GCTTGAGCCA TTTATGTGGC AAGTTGGCGG CGATTTTATT
AATCAGTTTC GCCAGCAACA GCTCAAATTG CCAGCCCAAA GTGTGCGCGA ATATGCTCAA
CGCTTGCGTC AAACCACGAT TATTCAAGGC TATAGCCCGG CAATCATTCC GCATCCCAGC
GATTGGCCCG CCAATATTCA GACGGTTGGT TATTGGATGT TGCCGCCCGA TGAGGCTTGG
CAAATGCCGC CTGAGCTTGA GCAATTTTTG GCCGATGGCC CAACTCCAAT CTATATAGGC
TTTGGTAGCA TGACCGGAGC TAACCCTGAT GCTTTTACCG AATTGTTGCT CAAGGCGGTG
GCACATAGCG GCCAGCGGGC AATTATCCAA ACTGGTTGGG CTGGCTTGGG CCAAATCGAA
TTGCCCAAAA CTGTTTTTCG GATTGGCTCA GCGCCGCATG AACGGCTTTT TCGCCATGTC
AAAGCGGCGG TACACCATGG CGGGGCTGGC ACAACGGCTG CAAGCTTAGC GGCTGGTTTG
CCAACCGTCA TCGTGCCGCA CTTGGGCGAT CAACTGCGTT GGGGTCAGCG CGTGTTTGAT
TTGGGCTTAG GGCCAAAGGC GATTCCGCGC AACAAACTTA CGGTTGATCG GTTGGCTTGG
GCGATTTCGC AGGCCGCTAA CACGCCGAGC ATGCAACACA ATGCCCAAGC CATGGCCAAA
ACCCTGCAAG CTGAGCAGGG CATCAGCCGC GCGGTCGAAA TTATTGAACA ACGGATACAA
GCCTAG
 
Protein sequence
MRILIIAFGT RGDVQPMVAL GLALQERGHS ITLLVSSNFK SWVEEFGLQV ATARVDIQQM 
MLSDHGNDWV KHGANPIKQR NAMRRLLKQH ALTMVEDAWQ VAQNCDVLIS SFTSDVFAVT
LAEVLNVVHI STPLQPAMLA TRCGPASAAA ILPNHESIIN YWFGRWVLEP FMWQVGGDFI
NQFRQQQLKL PAQSVREYAQ RLRQTTIIQG YSPAIIPHPS DWPANIQTVG YWMLPPDEAW
QMPPELEQFL ADGPTPIYIG FGSMTGANPD AFTELLLKAV AHSGQRAIIQ TGWAGLGQIE
LPKTVFRIGS APHERLFRHV KAAVHHGGAG TTAASLAAGL PTVIVPHLGD QLRWGQRVFD
LGLGPKAIPR NKLTVDRLAW AISQAANTPS MQHNAQAMAK TLQAEQGISR AVEIIEQRIQ
A