Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5098 |
Symbol | |
ID | 5737056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 124481 |
End bp | 125491 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641282263 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547854 |
Protein GI | 159901608 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTGTC ACAATGCAAT TGTTCATCCC ACCGTCGCAC TTATTCCAGC TTTTAATGAA TCGCGATTCA TTGGGAGCTT GGTGCTTGCC GCAAAAGCGT ATGTTGATAT TGTGCTCGTG GTCGATGATG GCTCTACCGA TCATACGGTT GCTATTGCTC AAAAGGCTGG TGCATGTGTT TTACAGCATG CGGTAAATCA GGGGAAAGCC GCAGCGGTTA ATACGGGCTT TCGGTATATT GCAACGCTTA ACCCCTTTGC TGTCGTGATG CTTGATGGGG ATGGCCAACA CAAAGTTGAT GATATTCCAG CACTTTTAGC CCCTATCTGC CAAGGGCATG CCGATGTCGT GATTGGATCG CGCTATGGTG CGATTCACAG CGATATCCCC CTCTATCGAA AGGTTGGGCA GATGGGATTA ACCTCCTTAA CCAATTTGAT ATCAGGGGTG CAGGTTAGTG ACTCACAAAG TGGATTTCGC GCCTTTTCAG CCCATGCCAT CGCCGTGATG TCATTTACGG CGAATGGCGG ATTCTCAATC GAATCGGAAA TGCAGTTTCA TATTCATGAA CAGGCATTAC GGATCTGTGA GGTTCCCATT CATGTGTTGT ATGTGGAAAA AGCCAAGCGA AACCCCATTG GCCATGGCAT GCAAGTGGTG AAAGGTATTT TGGGCATCGC GACGACAATG CGTCCACTGC TCTTTTGGTG TGGCAGCGGC TTCGCGACGT TAATGATAAG CACCGCGCTG CTGGTCTTCT TGGCGGCCCA TACAACGATG GCATTGTCAC AGTTTGCCTG GCTGCTGAGC CTGCTCATGA TTGGGATGTT GTTGAGTATT GGATCGATTG GAACTGGGAT TATCTTACAG CGCCAGCGCG TTATGTTACA ACGAATGGAA ACGTCCTTAA AACAACAATT GATGCGTGCG CCCTCAGCGG CCTCCAACGA AACACTCTTC CTGACACCAC GGGAGCGGGT TTATGACGGG GTGAATCAAC CGCTTAATTA A
|
Protein sequence | MDCHNAIVHP TVALIPAFNE SRFIGSLVLA AKAYVDIVLV VDDGSTDHTV AIAQKAGACV LQHAVNQGKA AAVNTGFRYI ATLNPFAVVM LDGDGQHKVD DIPALLAPIC QGHADVVIGS RYGAIHSDIP LYRKVGQMGL TSLTNLISGV QVSDSQSGFR AFSAHAIAVM SFTANGGFSI ESEMQFHIHE QALRICEVPI HVLYVEKAKR NPIGHGMQVV KGILGIATTM RPLLFWCGSG FATLMISTAL LVFLAAHTTM ALSQFAWLLS LLMIGMLLSI GSIGTGIILQ RQRVMLQRME TSLKQQLMRA PSAASNETLF LTPRERVYDG VNQPLN
|
| |