Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5094 |
Symbol | |
ID | 5737052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 121325 |
End bp | 122473 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641282259 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547850 |
Protein GI | 159901604 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATTG CGCTTGTGTT ATCAACACCC TTACCGGCCT GTGAAGGTAT TGGGTTTTAT GTTTGGAATC TTGGACGCTT TTTGACCCAC CATGGGCATG AAGTGCATAT CATTACGCGT GGTGAACCAA CGAAACCGGC CTATGAACAG GTGCAGGCGA TCCATATCTG GCGGCCAGCA TTTTGGCGAA TCTATCCCTT CCATGTCGAT ATGCATGGCT ATTTTGTGAC CCAAACCCTC GAAACGATTG CGAATACCTA TGGACTTGAT TTAATTCATG TTCATACCCC CCTCGTTAAA ATTCCGAAGA GTGCCTATCC GGTCGTCGTT ACTGTGCATA CGCCAATGAA GACCGACACT GCCGCCATTC CATTGCGATC GGTGTTTGAT ATGCTGATTA AGCTGCAAAC GCCATTTAGC ATTCGTTTGG AAAAACGGCT TTTCCGACAA GCCACAACCA TCACAACTGT GGCGACGAGT GTTGCCTCTG AATTAGGGGC CTATGGGTTG CAACCACATC AGGTAGCCGT AGTTGGGAAT GGTGTCGATA CGGCGACCTT CTATCCGCCC GTTGATCTGC AAGCACGATT TCACCAGCGC TACTTTTTAA CCGTTGGGCG ACTCGCACCG CGAAAAGGAT TAGAAGATTT AATTGCGAGT GCAGCAGAAG TCGTTAAACG CTATCCTACC TATCGCTTTT TCATTGTCGG CCAAGGGCCG CTCGCCGCAG TCCTACAAAA ACAAATTACC CAGCTTCACC TTGATCAGCA TGTGCAATTA CTCGGTCATA TGGCGGATCG AGAACAGCTT GCCGATCTGT ATCGTGGGGC ATGGGCCTAT ATCCACCCTG CCCATTATGA AGGGTTACCG ACGGCGTTGT TAGAGGCGAT GGCATGTGGC TGTCCCGTGG TGGCAACGGC GGTGAGTGGT GCCCTTGATG TGATTACACC GCACAATGGG GTATTGGTGA ACCCTCATGC CCCAGTGCAA TTAACACAGG CAGTATGTCG CTTCATTGAA CAGCCACAGG TCGCACGGGA TCTCGGCCAG CAAGCAGCCT TGACGATCCA ACAGCAGTAT GGGTGGACTG CGATAGGCCA ACGCTATCTT GCGACCTATC ACCATGCTAT CCAAGGAGCA ACTGCATGA
|
Protein sequence | MRIALVLSTP LPACEGIGFY VWNLGRFLTH HGHEVHIITR GEPTKPAYEQ VQAIHIWRPA FWRIYPFHVD MHGYFVTQTL ETIANTYGLD LIHVHTPLVK IPKSAYPVVV TVHTPMKTDT AAIPLRSVFD MLIKLQTPFS IRLEKRLFRQ ATTITTVATS VASELGAYGL QPHQVAVVGN GVDTATFYPP VDLQARFHQR YFLTVGRLAP RKGLEDLIAS AAEVVKRYPT YRFFIVGQGP LAAVLQKQIT QLHLDQHVQL LGHMADREQL ADLYRGAWAY IHPAHYEGLP TALLEAMACG CPVVATAVSG ALDVITPHNG VLVNPHAPVQ LTQAVCRFIE QPQVARDLGQ QAALTIQQQY GWTAIGQRYL ATYHHAIQGA TA
|
| |