Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0821 |
Symbol | |
ID | 5732722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 929042 |
End bp | 930097 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277953 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543597 |
Protein GI | 159897350 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.534399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCA CGATCTATCC GATTCCGCGT TTACTGGCAA CCAACCCATA TCTTGATTTG TTGTATGCGC CATTTGCCGC CTGGCCTGAT GTGCAGATTG CACGTCAGCC GTTTGCCCAA ACCTTGCGTC ATGGCTTGTT GCATGGCGGT CAGCGGGTGT TGCATTGGCA TTTTTTCGAT GAACTAACCC AGCAGCCTAA CCAAGCGGCA ACGGCATTGC GCAGCATAAG CTTTATTGCG CTATTGCGTT TGCTGGCTTT GCGTGAGGCA AAATTGGTTT GGACGGCTCA CAACTTGGAG CCGCATGAAT TGCGTTACCC TCAATGGGCG CAGCGTTGTT ATCAGGCGAT GGCGCAGGCG GCTGGGCAAA TTATTGTACA TTCGCAGCCC GCTGCCCAAC TGCTCGATCA ACGTTATCAG GTTGCTACAA AAACCCAAGT AATTGCCCAT GGCTCGTATA TTGGTGTCTA TGGCGAAGCT TGGGAGCAAG CGGCAGCCCG TAAGCATTTG CAACTAACGC CCGAAGGCTT TGTTGCGCTC AATCTTGGCA CATTGCGACC CTACAAAGGT GTGGAATTGT TGCTTGAAGC GTGGTCAGCC GAGCTTGGCC GTTTGCTGAT CGTGGGCGCG GCCAAAGATC AACGCTATGC TGAGCAATTG CAACAACAGG CAACCAACCC CAGCATCACG CTACGGCCCC AGTTTATTGC CGATTCTCTC TTGCCTGCTT GGTTTGCCGC CGCCGATGTG GTAGTGTTGC CCTATCGCAA AACCTTGACC TCAGGCATGT TGCTGGCAGC ACTTTCGTAT GCAGCACCTG TCGTTGCGCC TGATTTGCCG CCAGTCCGCG AATTGATTCG CGATGGAGAG AATGGCTTTT TGTTTGAGGT CAATAATGCG GCTAGTCTAA GGGCCGCGTT GCAACGAGCC GCTGCTCATC CCGACCGACA AGCTTTGCGC CAAAATGCGC TCCAAACAGT TCAAGCGCTT GATTGGGGCC AGATTGCCAA CCAAACAGCG GCAGTCTATC GCCGTTTATT TGAGAAACCC GCATGA
|
Protein sequence | MSITIYPIPR LLATNPYLDL LYAPFAAWPD VQIARQPFAQ TLRHGLLHGG QRVLHWHFFD ELTQQPNQAA TALRSISFIA LLRLLALREA KLVWTAHNLE PHELRYPQWA QRCYQAMAQA AGQIIVHSQP AAQLLDQRYQ VATKTQVIAH GSYIGVYGEA WEQAAARKHL QLTPEGFVAL NLGTLRPYKG VELLLEAWSA ELGRLLIVGA AKDQRYAEQL QQQATNPSIT LRPQFIADSL LPAWFAAADV VVLPYRKTLT SGMLLAALSY AAPVVAPDLP PVRELIRDGE NGFLFEVNNA ASLRAALQRA AAHPDRQALR QNALQTVQAL DWGQIANQTA AVYRRLFEKP A
|
| |