Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3369 |
Symbol | |
ID | 5735230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4249064 |
End bp | 4250113 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280516 |
Product | glycosyl transferase family protein |
Protein accession | YP_001546133 |
Protein GI | 159899886 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000732835 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACGA TGCCAATGAG TGAAACCTAT ATTCAACCAA GCGAGCCAAC TCGTAGCCAT CCATGGCGGG TGATTATTCA GATTCCCTGT TTGAATGAAG AAGCAACCCT GCCCGATGTG TTGAATGATA TTCCCTTGGA TTTACCGAAT GTGCATTTTG AAACCTTGGT AATCGACGAT GGCTCAACTG ATCGCACGGT TGAGGTCGCT CGTGAACTAG GGGTTAACTA TATTGTACGC CATCGTGGAC GCAAAGGCCT ACCAGCCGCC TTTCAATCGG GCATGGATGC CGCATTAAAA CTTGGCGCTG ATATTATTGT CAACACCGAT GGCGATCATC AATACCCAGG CTCGGCGATT CCTGAGTTGA TCAAGCCAAT TTTAGAGGGC AAGGCCGATA TTGTGATTGG CGATCGCCAA ACCCAAAATG TTGAGCATTT CTCAACCCAA AAGAAGCTCT TGCAACGGGT TGGCAGTTGG GTTGTGCGCG TTGCTTCCGA TACCGATGTG CCCGATGCGC CCAGTGGTTT TCGAGCCTAC TCCAAAGAAG CTGGCTTGCG TTTATATGTC ACTAGCGAAT TTTCCTACAC GATTGAAAAT TTAATTCAGG CTGGCAAACG TCGCTTAAAT GTTGATCATG TGGCGATTAC CACCAAGCCA ACTCGCCCTT CACGCTTACA TCGCGGCAAT TTCAATTTCG TCAAACGCCA AGGCGCGACG ATTATCCGTA CCTATGCCCA ATATGAACCA CTCAAAACCT TCACCTACTT TGCCCTGCCA TTTTTGCTCA TTGGCGGCGG CCTGCTGCTA CGCTTTATGA TCTACTACCT GATCGATCCT AATCAGACCT ATACCCGCTA TTTGCAATCA GTTTTTATTG GCGGGGTGTT TATGATTGTC GGCATTTTGA CCTTCTTCAT TGGGATTTTG GCCGACCTCA GCGGCAACAA TCGCCGAATC AACGAAGAAA TTCTGTATCG TTTGCGTAAC CTTGAAGTCG AGGTCGCCCG CCGCTTGCCC AGCAACGATA GCGATGAAAT TCATGATTAA
|
Protein sequence | MATMPMSETY IQPSEPTRSH PWRVIIQIPC LNEEATLPDV LNDIPLDLPN VHFETLVIDD GSTDRTVEVA RELGVNYIVR HRGRKGLPAA FQSGMDAALK LGADIIVNTD GDHQYPGSAI PELIKPILEG KADIVIGDRQ TQNVEHFSTQ KKLLQRVGSW VVRVASDTDV PDAPSGFRAY SKEAGLRLYV TSEFSYTIEN LIQAGKRRLN VDHVAITTKP TRPSRLHRGN FNFVKRQGAT IIRTYAQYEP LKTFTYFALP FLLIGGGLLL RFMIYYLIDP NQTYTRYLQS VFIGGVFMIV GILTFFIGIL ADLSGNNRRI NEEILYRLRN LEVEVARRLP SNDSDEIHD
|
| |