Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3898 |
Symbol | |
ID | 5735759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4889680 |
End bp | 4890981 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281049 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546660 |
Protein GI | 159900413 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATAA CCCCAGTCGC CTCAATCGAT CACCAATTAC GTGTCGCCAT GCTTTGTCGG GCGGTTTTTC CCTTGCATGG CTTTGGCGGC ATCGAGCGCC ATGTCTTTCA TTTGGTGACC CATCTCAGCG ATTTGGGAGT TAAGCTCGAT CTTTGGACGC AGACAATTCC CCACGATGTG CCAACTGCTG GCGAGGCCTA TGCTCGTTTG TGTCAAAACC CGTTGATTGA ACTGCATGAA ACCCGTTATG ATCGCACCAG CCCATGGTTG CGACCGAATA GCATTATCGG GCGACAGTTT AATTATCCGA TTTTTACTTG GCAACAAGCC AGCGCTGTGG CCAAAACTGC TCAGCAAGGC CAGATTGATA TTGTGCATAC TCAAGGTTTA TGTGCATTGG GCTGGGGCTT GGTACGTCAA CAGCAACCCA GTTTGCGGCG GATTCCTCAA TTGGCCAACC CCCATGGCAT GGAAGAATAT AAGAATGTTG ATTGGCGCAA GCAACTGGCC TATGCCCCGT TTCGTGCCCA ATATTCGTGG AGTCATCGCC AAGCTGATTG TGCAATTGCC ACCGATGCCT GTACCGCCGA CGATCTACCA AATTTGTTGG GCGTTGATCC GACGCGGGTT GCGGTGTTGC CCTCGGCGAT TGATGTGGCT GAGGCCTTGG GCCAGGTTGA TGAGCAATTG GGCAACGAGT TGGTGCAGCG CTTGCAGCTC GCCGACCACG ATCTGGTTTT TTTGACCGTC AGCCGTTTGG AGCGCAACAA GGGCTATCAT CTGCTCTTGG CGGCCTTGGC CGAATTGCGC GATCTGCTGC CTGCAAGCTG GCGTTTGTTG ATGGTTGGCA CTGGCAAAGA GCAAGCAGCG CTTGAACAGC AAGCCCAAAG TCTAGGCTTG GCGCAACATG TCAGCCTGCT TGGTCGTCTG AGTGATCGTG AATTGCATTC ACTGTATGAA CATGTTGATT TGTTCATTCA TCCAACCTTG TATGAAGGTT CGTCGTTGGT CACACTCGAA GCCATGATTC ATCGCTTGCC AGTTGTGGCA ACTGCGGCTG GTGGCATTCC CGATAAAGTT ATCAGCGGCC ATAATGGCTT GCTTGTGCCA GCCAACAATC AGCGGGCCTT GGTCAATGCG CTGCGGTTAG CCCTCGATTT GCGCGAATAT TGGCCGCAAT GGGGTGCTGC TGGCGCAGCG ATTGTACGGC GCAGCTTCGA TTGGCCCGTT GTGGCGCGAC AAACCCTCGC CACCTACCGC GAACTATTGC AATCTCGCTC TTTGCGTGGA GGATTCCAAT GA
|
Protein sequence | MAITPVASID HQLRVAMLCR AVFPLHGFGG IERHVFHLVT HLSDLGVKLD LWTQTIPHDV PTAGEAYARL CQNPLIELHE TRYDRTSPWL RPNSIIGRQF NYPIFTWQQA SAVAKTAQQG QIDIVHTQGL CALGWGLVRQ QQPSLRRIPQ LANPHGMEEY KNVDWRKQLA YAPFRAQYSW SHRQADCAIA TDACTADDLP NLLGVDPTRV AVLPSAIDVA EALGQVDEQL GNELVQRLQL ADHDLVFLTV SRLERNKGYH LLLAALAELR DLLPASWRLL MVGTGKEQAA LEQQAQSLGL AQHVSLLGRL SDRELHSLYE HVDLFIHPTL YEGSSLVTLE AMIHRLPVVA TAAGGIPDKV ISGHNGLLVP ANNQRALVNA LRLALDLREY WPQWGAAGAA IVRRSFDWPV VARQTLATYR ELLQSRSLRG GFQ
|
| |