Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4512 |
Symbol | |
ID | 5736363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5777818 |
End bp | 5778945 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281675 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547272 |
Protein GI | 159901025 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.115065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTATTG AAGCCTTGGT TGCTATGAAA ATTTTGATTA TTTCGAAAGC CCTGACCGCC GCCACCTACC ACCGCAAGCT TGAATTGATT GCGGCAGAGC CAGATGTTGA ATTGACGGCG ATTGTGCCGC CGATGTGGTT TGAGCCTGGC GTTGGCGAGT ATCCGCTGGA AGTGCAAACG CCGCGCAATT ATCGCATGCA TGTTGTACCA CTGGGTCACA ACGGTCATCA TCATACGTTT TGGTGGCAGG GTTTGGGCAA ATTAATCGCC GCTGAGCAAC CAGGTATTCT GCATGCCGAT GAAGAAGCCT TTAATTTGGC AACATTTCAG GCTTTTTGGC ATGCCCGCAA AACCAACGCC AAACTCTGTT TTTATAATTG GGCTGATGTA GCGCGACGTT ATCCACCGCC ATTTAGTTTC TTCGAGCGCT ACAGCTATCG CCATGCAGCC CATGCAATTG CTGGCAACCA TTTGGCTAAG CAACTCATTC GCGATCACGG CTACCCTGGC CCAATCAGCG TCATTCCACA ATTTGGGGTT GATGAGGCGA TTTTTAGGCC TGCGCCTGAG CCATTGCCTG CAAAACCGTT TGTCGTGGGC TTTTTTGGGC GCTTGATGCG CTCGAAAGGC GTGCTCGATT TGTTGGCAGC CCTCGAACGC TTGCCCAGCG ATATTCATTG TCGATTAATT GGCAAAGGCG ATTTGAGCAG CGAAGTTGAA CAACGCATTG CCAAAGCACC GCTGGCTGGG CGGGTCACGC TTGAGCCGTT GATTCCATCC AGCGCCATGC CCGACGCTAT GCGCAGCGTC CATGCCTATG TGCTGCCCTC GCGTACCACC CCCAATTGGA AAGAGCAATT TGGGCGGGTC TTGATTGAGG CCATGGCCTG CGGCGTGCCA GTGATCGGCT CCAATTCAGG TGAAATTCCG CATGTAATTG ATACAGCAGG CTTAGTGTTT CCCGAAGGCG ATGTTGCGGC ACTGGCCGAG GCAATTCACA AACTCTATCT GGATGAAAAG TATCGCCAAA ATATCGCTGA AGCTGGTCGC CAACGAGCGT TGAGCCATTA CACCCAAGCT AGCATTGCCC AGCAACATTT AGCAGTTTAT CGCTCGATGG TTGAGTAA
|
Protein sequence | MIIEALVAMK ILIISKALTA ATYHRKLELI AAEPDVELTA IVPPMWFEPG VGEYPLEVQT PRNYRMHVVP LGHNGHHHTF WWQGLGKLIA AEQPGILHAD EEAFNLATFQ AFWHARKTNA KLCFYNWADV ARRYPPPFSF FERYSYRHAA HAIAGNHLAK QLIRDHGYPG PISVIPQFGV DEAIFRPAPE PLPAKPFVVG FFGRLMRSKG VLDLLAALER LPSDIHCRLI GKGDLSSEVE QRIAKAPLAG RVTLEPLIPS SAMPDAMRSV HAYVLPSRTT PNWKEQFGRV LIEAMACGVP VIGSNSGEIP HVIDTAGLVF PEGDVAALAE AIHKLYLDEK YRQNIAEAGR QRALSHYTQA SIAQQHLAVY RSMVE
|
| |