Gene Haur_3826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3826 
Symbol 
ID5735690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4803723 
End bp4804952 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content55% 
IMG OID641280978 
Productglycosyl transferase group 1 
Protein accessionYP_001546590 
Protein GI159900343 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.762475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATAC TCATGATTGC GTCGTCGTTT CCCAAATATC CAGGTGAGAT GACCGCGCCC 
TTTATCGAAG AAATTGCCGC CGCCGTGGTC GAGCGTGGCC ACGAGGTGCA TATGCTTTTG
CCCGATCACC CCGAACTCAA ACGTGGCGAT CAGGTACGGG GCATGCAGAT CCATCGCTAT
CGCTATGCGC CGCATCCGAG CTTAAATGTT TGGGGCTATG CTGGTGCGTT GCATAATGAT
GTACAAATGC GCAACGCCGC GCTTTTGGTT GCACCCTTGG CAGTTGCTTC AGCATGGCGC
ACCATGGCCC AGCTTACCGC CCAACAACCC TTCGATTTAA TTCATGGTCA CTGGTCGATT
CCGAATGGCT TTCCGGCTTG GTTGCTAGCG CGGCAACGAA AATTACCACT GATTATCAGC
ACGCATGGCT CGGATGTTTC GGTGGCTGAG CGCACTGCCC CAACTGGCTG GATCAATACA
GCGATCATGC GCTATGCCTC GGCAATCACT GCGCCATCGA GCGATCTGAC GACGCGGGCA
GCGGCTTTGG GCGCAGAACC TGCTAAATTG CATGTACTGC CCTATTGTGT TGATGCCGTT
GATTTTCGGC CCGATCCCGC CGTTGGCGCG GCCTTTCGCC AACAACATGG CCTCGATACT
GCTACACCAT TGTTATTTAC GGTTGGGCGC ATGGTCGAGA AAAAAGGCTT TCGCTATTTG
GTGCAGGCCT TTGCTCAGGT GCTAGCCCAG CATCCAACCG CCAAATTGAT GATCGGTGGC
TATGGCCCAG GCCTAGAGCA ACTTATGGCT CAAGCCGCTG ATCTAGGGAT TGGCGAGGCC
GTGCTATTTC CCGGGGCAAT TGGTCACGAT CTCATCAATA GTGCCTTGAA TGCTGCTACA
ATCTTCATCC TGCCTTCGGT GCGCGATCGC AGTGGCAACG TCGATGGCTT GCCCAATACC
CTGCTCGAAG CCATGGGCGC GGGTCGGCCA ATTATCGCCA GCAAGATTGC CGGAGTGCCT
GGAGTAATTA CTTCGGGCGA ACATGGCTTG TTGGTAGCAC CTGCCCAGCC ACAAGCACTG
AGTGCCGCGA TCAACGATCT ACTCAATCAA CCAGAACGGG CTAGGCTGCT AGGTAAAGCG
GCGCGGTTAC GGGTTGAAAC CGAATTAACT TGGAACCGTT ATGCCGCGCG GCTTGAACAG
CTGTATACTG CGGCGATACA ATCGTCATAA
 
Protein sequence
MRILMIASSF PKYPGEMTAP FIEEIAAAVV ERGHEVHMLL PDHPELKRGD QVRGMQIHRY 
RYAPHPSLNV WGYAGALHND VQMRNAALLV APLAVASAWR TMAQLTAQQP FDLIHGHWSI
PNGFPAWLLA RQRKLPLIIS THGSDVSVAE RTAPTGWINT AIMRYASAIT APSSDLTTRA
AALGAEPAKL HVLPYCVDAV DFRPDPAVGA AFRQQHGLDT ATPLLFTVGR MVEKKGFRYL
VQAFAQVLAQ HPTAKLMIGG YGPGLEQLMA QAADLGIGEA VLFPGAIGHD LINSALNAAT
IFILPSVRDR SGNVDGLPNT LLEAMGAGRP IIASKIAGVP GVITSGEHGL LVAPAQPQAL
SAAINDLLNQ PERARLLGKA ARLRVETELT WNRYAARLEQ LYTAAIQSS