Gene Haur_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3582 
Symbol 
ID5735443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4503917 
End bp4505062 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content55% 
IMG OID641280731 
Productglycosyl transferase group 1 
Protein accessionYP_001546346 
Protein GI159900099 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA CGTTACACGT AGCCATGATC ATCCAAGGCT ATTTTCCGCG CATCGGCGGC 
GCTGAACGCC AGCTTGCGGC TTTGGCTCCT TTTTTGGCCG CTGAAGGCGT GGCAATTAGC
GTGCTGACAC GACGGTATGC AGGTTTCAAG CCCTTCGAGA TAATCGACAA TGTGCCAGTC
CATCGCTTGC CAATTCCTGG GCCTGTGCCA ACTGCGGCCT TGGTTTTCAC CGCAGCAGCG
ATTCCCTTGC TTTGGCGTTT GAAGCCAAAT TTGGTCCATG CTCACGAAAT GTTTTCGCCT
GCCACCACGG CGATTGCGGC CAAACAAGCC TTGGGTCTGC CCTACGCCGT GACCGCCCAT
CGTTCAGGGC CACTGGGAGA TGTGCTGCGC ATGCAACAAC GTCCGTTTGG CAAAGGCCGG
CTGGAGCGCA TCACCAAAAC TGCCGATGCC TTTTTCACGA TCAGTCGCGA AATCGATCAA
GAATTTGAGC AGATTTTGGG CATTGAGGCC AAGCGGCGAC ATTATGTGCC CAACGGGGTT
GATCCTGAAA AATATTGCCC AATCGCTGCC GAAGCCAAAA CTGCGCTACG CCGCGAACTC
AATCTGCCAA CCGAAGGCAC AATTACGCTG TATGCTGGGC GGCTCTCTGA GGAAAAACGG
GTGCGCTATT TGGTTGAGGC CTGGCCAGCA ATTCGTGCCA AACATCCCGA TGCGACCTTA
TTGATTTTGG GACAGGGGCC AGAAGAAGCC AATCTCAAGG CCAAAACCAG CGATGGAGTG
ATTTTTGGCG GGGCGGTGCA TAATGTGCCA CCCTATTTGC AAGCCGCCGA TGTGTTCGTG
CTGCCATCGA TTGCCGAGGG TTTTTCGGTG GCAATGCTCG AAGCCATGGC CAGCGAATTA
GCCGTCGTGA TCACCGATGT TGGCGGTGCA CGCGATGCGA TTGACGATAC TGTGCATGGC
TTGGTGATTC CGCCCGACGA TCAGCCAGCG CTTGAACAAG CTTTGTTGGC AGTTTTGGGC
GATCAAGCCT CGCGCCAACG CATGGGCCAA GCCGCTCGCC AACGGGTGCA ACAAGAGTTT
GCCCTGAGCG TGATTGCCAA GCAATTGCGC GGATTGTATG AAAATATAGC CACGTTAAGA
ACATAG
 
Protein sequence
MNDTLHVAMI IQGYFPRIGG AERQLAALAP FLAAEGVAIS VLTRRYAGFK PFEIIDNVPV 
HRLPIPGPVP TAALVFTAAA IPLLWRLKPN LVHAHEMFSP ATTAIAAKQA LGLPYAVTAH
RSGPLGDVLR MQQRPFGKGR LERITKTADA FFTISREIDQ EFEQILGIEA KRRHYVPNGV
DPEKYCPIAA EAKTALRREL NLPTEGTITL YAGRLSEEKR VRYLVEAWPA IRAKHPDATL
LILGQGPEEA NLKAKTSDGV IFGGAVHNVP PYLQAADVFV LPSIAEGFSV AMLEAMASEL
AVVITDVGGA RDAIDDTVHG LVIPPDDQPA LEQALLAVLG DQASRQRMGQ AARQRVQQEF
ALSVIAKQLR GLYENIATLR T