Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3581 |
Symbol | |
ID | 5735442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4502784 |
End bp | 4503920 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280730 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546345 |
Protein GI | 159900098 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTTG TTCAGGTGAT CGATTCGCTC TATCGTGGTG GAGCACAACA ATTGCTGGTA ACCTTCGCGA TCGAAGCCCA ACGCCGTGGG ATCAAAACCA GCGTTGTTTG TCTCAAGGAT GAAGATCGCG GCAGCAACTT GGTTGAACGG CTGAATGGTT TGGGCGTTGA AGTGCTGCGT TTGGCTGCGC CCAAAATGCT GGCCCCTAAA CGAATCTGGC AATTAACCCG TTGGCTGCGG CGCAATCAGG TCAGTGTGGT GCATACCCAT TTGACCTATG GCAATGTCGT AGGTATTTTG GCAGCCCGTT TGGCTAATAT TCCTGTGGTG GCGACGATGC ACTTAGCGGG CTTCGATCCA TCAATTGCCA ATCGCCAACA GCAGTTTGAG GCCCAAGTGG TACAGCGTTT GGCGCAGCAA ATTATCGCTG TTGGCTATAC CACTCGCGAT GCCTACCAGC CAATTATGCC CAATCGCCAG CTGCATGTGG TGCATAATGC CGTGGTAGCC GTGCCAGAAA TTAGCCCTGA GCAACGCCAA ACCACGCGCG AAGCAGTGTT GGGCGACCCA AATTTGACCA TGTTGATCAA CGTTGGGCGT TTTGCGGCAA TCAAAGATCT GCCAACCTTG ATCGATGCCT TTGCCTTGGT GCATGCTCAA CATCCCCAGG CGCGGCTCGT TTTGGCGGGC GAAGGCGATC AACGACCCAA AATCGAAGCC AAAATTAACG CACACCAATT GCCTGCAGTG GTCAATTTGC TTGGCGCACG CGATGATATT CCAGTGCTAT TGCGTAGCGC CGATTTGTTT GTCAATTCGT CAGCCAACGA AGGATTGCCG ATCGCCGTGC TCGAAGCCAT GGCCGCAGGC TTGCCGATTA TTGCCACCAA AGTTGGCGAC GTGCCGCATG TGGTTCGCGA ACAAGCGGGC ATTACGGTTG CGCCGCATGA TCATCAAGCC TTAGCTGCTG CAATTAATCA AGTTTTGAGC GAGCCAAGCC AGATCCAGGC GATGCAACAG GCTGCTCAAC AAATTATCGA GCAATACCAT AGCCCTAGCG CGTGGGTTGA TCAATTATTA AGTTTGTATA CCGCCGCCCA CGAGGGCGTT GACCAGCGCG AGGCTGTCTC GGCATGA
|
Protein sequence | MHVVQVIDSL YRGGAQQLLV TFAIEAQRRG IKTSVVCLKD EDRGSNLVER LNGLGVEVLR LAAPKMLAPK RIWQLTRWLR RNQVSVVHTH LTYGNVVGIL AARLANIPVV ATMHLAGFDP SIANRQQQFE AQVVQRLAQQ IIAVGYTTRD AYQPIMPNRQ LHVVHNAVVA VPEISPEQRQ TTREAVLGDP NLTMLINVGR FAAIKDLPTL IDAFALVHAQ HPQARLVLAG EGDQRPKIEA KINAHQLPAV VNLLGARDDI PVLLRSADLF VNSSANEGLP IAVLEAMAAG LPIIATKVGD VPHVVREQAG ITVAPHDHQA LAAAINQVLS EPSQIQAMQQ AAQQIIEQYH SPSAWVDQLL SLYTAAHEGV DQREAVSA
|
| |