Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3353 |
Symbol | |
ID | 5735223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4230172 |
End bp | 4231317 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280500 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546117 |
Protein GI | 159899870 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.207762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAT GTGTGGTTGG ACCAACCTAT CCCTACCGTG GGGGGATTGC TCATTACACC ACGCTGTTGG TCAAACATTT GCGCGAAGTT GGCCATCATG TACGATTTTA TTCGTATACC CGCCAATATC CGCGCTGGCT TTTCCCTGGT AAAACCGATA AAGACCCCAG TGCTACGCCG TTGCGGGTCG AATGCGAATA TGTGCTTGAC CCTACCAACC CAATTACCTG GTGGCGCTTG TGCCGCAAAA TTCGCGCCGA TAATCCAGAT TTGGTGGTAT TGCAATGGTG GGTTCCCTAC TGGACACCTT CGCTCAGCTA TATTTCGCGC TGGCTGAAAA AACACACCAA AGCCAAAATT GTCTATATTT GCCACAATGT CATGCCCCAC GATGGCGGTG GCTTTTTGGA TCGGCGCATG GCTTCAACGG TGCTCAAACA GGGCGATGCC TTGATTGTGC ATAGCGACCA AGATTTGCAT CGTGCCCAAG CATTGTTGCC GCAAGCAGCT GTGCTTAAAT CGCAACTGCC AACCTTTGAA GAAGTTGCCA AGCATACCGA TTCGGCGGCA ATTGAGCGCT TGCGTGGCCA ACTTGGCATC TCCAGCGATC ACGATATTTT GCTATTTTTT GGCTTTATAC GGCCTTACAA AGGCTTAGAA TATTTGATTC AAGCCTTGCC GTTGGTGGTG CAAGAGCGGC CTGTGCATTT GCTGGTGGTT GGCGAGTTTT GGGCTTCGCC AGAGTTTTAT CAGCGCTATA CCCGCGAATA TGGGGTTGAA GCCAATGTCA CCTTTGTCAA TCGCTATGTG CCCAACGAAG AGCTTGGCCC CTATTTCGAT TTAGCCGATG TGGTCGTGCT ACCGTACATT TCGGCGACCC AAAGCGCGGT CGTGCAATTG GCCTTTGGGC TAGGCAAGCC GGTCATCACC ACGCGGGTTG GCGGTTTGCA CGAAGTTGTG CGCGATGGCG TGAATGGCTT AGTCGTGCCG CCACAGGATG AAGTTGCCCT AGCCAAAGCG ATTCTGCGCT ATTTTCAGGC TGAATTAAAA GCCCCGATGA CTGCCGCCGT CCACGCTGAA CGCGGCCAAC AATTGCATGG CTGGGAACAT CTGATCAATT GCCTTGAACG AATTGGGGCC AAATAA
|
Protein sequence | MKLCVVGPTY PYRGGIAHYT TLLVKHLREV GHHVRFYSYT RQYPRWLFPG KTDKDPSATP LRVECEYVLD PTNPITWWRL CRKIRADNPD LVVLQWWVPY WTPSLSYISR WLKKHTKAKI VYICHNVMPH DGGGFLDRRM ASTVLKQGDA LIVHSDQDLH RAQALLPQAA VLKSQLPTFE EVAKHTDSAA IERLRGQLGI SSDHDILLFF GFIRPYKGLE YLIQALPLVV QERPVHLLVV GEFWASPEFY QRYTREYGVE ANVTFVNRYV PNEELGPYFD LADVVVLPYI SATQSAVVQL AFGLGKPVIT TRVGGLHEVV RDGVNGLVVP PQDEVALAKA ILRYFQAELK APMTAAVHAE RGQQLHGWEH LINCLERIGA K
|
| |