Gene Haur_4282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4282 
Symbol 
ID5736141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5468219 
End bp5469436 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content54% 
IMG OID641281442 
Productglycosyl transferase group 1 
Protein accessionYP_001547042 
Protein GI159900795 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.870736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTG CTTATATTGC TTATCCAACC AGTTTGATGC TGGCTTCGGC CAATGCGATT 
CAAACCTGGA CAACCCTACG CGAATTGCGC CAACAAGCGC CCAACACCTT GATTATTATT
CCGCGCTGGT TGCGTGAACC AAGCCGATTT AAAGAGGTCG GCGCAACACA CTTGCAACGC
CCAGCAATTG GCAAGCTCTC GCGTTTTAAA AAATCGACTT TGTGGTATTA CGCTGAACGT
AGCGTTTTTG CCGCCATGAG TGCTGCTGTC GTAGCCAGCC AACGTTGGCG CGGCGAGGCC
GTCGATGTGG TCTATGTGCG CGAGGTGATT GCCGCTGGTT GGTGGGCCAC GCTTTGGGGG
CCGTTGCTCA ACATTCCGGT GATCTACGAG GCCCATGATC TGGAAAGCTG GAATCCGTCA
CGCGCCAAAG AATCATGGGT GCAGCCCGTG CTCAATTTGC TTGATCGCTT GACGCTTGGG
CGGAGCGCTG CTGTAGCTTC ATTGACCGAT GATTTTCGCC AACTCTTGGC ACGTTTGGGC
TGGCGTAAAC CCAGCGATGT GGCCGTAATC CCCGATGCGT TTGATGATTC GCTGTATCAG
CCCCACGATC GCCAACAGGC GCGGGCGCAG CTTGGGCTTG ATCCAACTGC ACCATTAATT
GTCTATGCTG GAATGACCTT CTCCTATCGT GGGATTGATC GCTTGATTGC TGCTTTTGCG
AGCCTACGCC AAGCGATGCC AAACGCTCAA TTATTGTTCA TCGGCGGGCG GCCAGCTGAA
ATTGCCCAAT TTAGCCAGCA GGCCAACCAT TTGGGGCTTG GCGAGAGCGT GCGTTTTCTG
GGAGCCTTGC CGCAAAGCGC CACGCCAGCC TATTTACATG CTGCCGATGT TTTGGTCATT
CCCGATACAG TCACCGACGT AACCGCCTCG CCGCTCAAAT TATTTGAATA TTTGGCGGTT
GAGCGGGCGG TCGTTTTGCC GAATATTCCA GCCTTGCGCG AAATTTTGCC CGAACAGATC
GGCTATTATT TTGAGCGTGG CAGCATCCAA GGCTTAGAGC AAGCTCTCGT CGATGCCTTA
ACCGATCCGC TGCGCCCTGA GCGTGAGCAG GCTGGCCGCC AATGTGTGCA AGAGCATACC
TATCGCGCCC GCGCTGGCAG GATCAAGGCC TTGTGCCAAC AAATTAGCCA AACAACCAGT
AGTAGTGCAT TAGATTAA
 
Protein sequence
MRIAYIAYPT SLMLASANAI QTWTTLRELR QQAPNTLIII PRWLREPSRF KEVGATHLQR 
PAIGKLSRFK KSTLWYYAER SVFAAMSAAV VASQRWRGEA VDVVYVREVI AAGWWATLWG
PLLNIPVIYE AHDLESWNPS RAKESWVQPV LNLLDRLTLG RSAAVASLTD DFRQLLARLG
WRKPSDVAVI PDAFDDSLYQ PHDRQQARAQ LGLDPTAPLI VYAGMTFSYR GIDRLIAAFA
SLRQAMPNAQ LLFIGGRPAE IAQFSQQANH LGLGESVRFL GALPQSATPA YLHAADVLVI
PDTVTDVTAS PLKLFEYLAV ERAVVLPNIP ALREILPEQI GYYFERGSIQ GLEQALVDAL
TDPLRPEREQ AGRQCVQEHT YRARAGRIKA LCQQISQTTS SSALD