Gene Haur_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3357 
Symbol 
ID5735227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4235142 
End bp4236308 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content49% 
IMG OID641280504 
Productglycosyl transferase group 1 
Protein accessionYP_001546121 
Protein GI159899874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.656502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAAA TAACTGTCGC TCAAGTGATT ACCGGCTTTG CGAGTGCCGA AGGTGCTGGC 
GGCTCAGCCT TATTTGGCAT CGAAGTAGCA CGAGCTTTAG ATAAAAGCCG TTTTCGGCCA
ATTTTGTGTG GAATTCATCG GTTTAATGCA CCTTCGGAGC AGCGTTGGCT CAAAACCTTG
GCCGATGAGG GCATTGAAAC CAGAATTATG GTGCAAGAAC GCAGCAAATT GCGCTACGAT
ATGGTGCGCT TCAGTGCGTT GCTCAATCAA CTGATTCAAG CACAAGCCGT TGATATCATT
CACACCCATG TTGAGCGAGT TGAATTTTTC ATTAGTTTGC AAAAATTACT CCACCCCAGC
CACTATCCCA AACTTGTCCG CACCATTCAT GTCAATGCCA TGTGGGTTAC GCGGCCATTA
GTACGACGCT TGATGAACAT TGTCTACACC CAACTATTTG GCGAGGAAAT CGCAATTTCC
CAAGCCACCA AAACCATGCT TGATCAACGC ATGGCAGCCA AGGTCTTTGG GCGCTCGGCC
AGCTTAATTC AAAATAGCCT ACCGCTCGCA CGCCTGCAAA AATTCGATCT ACCCAAACAG
CACCAGCGAT TTAGCCCACC CCGTTTTTTA GTGATTGGCC GGCTAGAAAT CCAAAAAGCT
CAAGATATTT TTATTCAAGC GGCGGCGTTG GTGTTGCAAC AATACCCTGA AGCCGAGTTT
TGGTTGGCAG GCGAAGGCAC CCAAGAGGCC AATTTTCGCC AATTGACGGC CAATTTAGCG
ATTGAGCATG CAGTTAAATT CCTTGGGCCA CGCGGTGATA TTCCCGAAGT GTTGAGCCAA
GTCGATGTGC TGGTCTCAAC CTCACGCTGG GAAGGCTTTG CAACGGTAAT TTTAGAGGCA
ATGGCAGCAC GCACGCCAGT GATTGCTACC GATATTGGCG GCAATAACGA ACAAATCGTT
GATGGCGAAA ATGGGCGTTT GGTCGCAAGC GAAAATCCTA GCGCAGTCGC CGATGCCATG
ATCTGGATGC TTGAACATCC TCAAGCAACT GCGCTGATGG CACAGCGCGG CTACGAATGG
GGGCAGCAGT TTACGATGGA ACGCACTGCT GCCCAGTATG GCGAACTGTA CGAGCGTTTG
CTTAGGGAGC AAAAATATCG ACCTTAA
 
Protein sequence
MRKITVAQVI TGFASAEGAG GSALFGIEVA RALDKSRFRP ILCGIHRFNA PSEQRWLKTL 
ADEGIETRIM VQERSKLRYD MVRFSALLNQ LIQAQAVDII HTHVERVEFF ISLQKLLHPS
HYPKLVRTIH VNAMWVTRPL VRRLMNIVYT QLFGEEIAIS QATKTMLDQR MAAKVFGRSA
SLIQNSLPLA RLQKFDLPKQ HQRFSPPRFL VIGRLEIQKA QDIFIQAAAL VLQQYPEAEF
WLAGEGTQEA NFRQLTANLA IEHAVKFLGP RGDIPEVLSQ VDVLVSTSRW EGFATVILEA
MAARTPVIAT DIGGNNEQIV DGENGRLVAS ENPSAVADAM IWMLEHPQAT ALMAQRGYEW
GQQFTMERTA AQYGELYERL LREQKYRP