Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2165 |
Symbol | |
ID | 5734052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2730886 |
End bp | 2732625 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279306 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001544933 |
Protein GI | 159898686 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGTTT CTGAACATCC CCGCTTACGA ATCGCGTGGT TGCTAACTGC GCCAATTGTC GGTTCTGGTG GATATACTAA TATTTTTCGA ATTATGAATC TCTTGGCGAG CTTTGGACAC GAGCAAGTTA TCTTTATGAA CCCGCACGAT TACCCAACCG ATGCGCTTGA TCGGCCTGAG CATTTTGTGC AACGCTATTT TGGCATGGTT CATGCGCCAA TTTATTTCTG GCCCCAAGAT ATTCGGGGTT TTGATGTGGT TTTAGCAACT CAATGGACGA CCACAATGGG CTTTGAGATG TGTGATCCGA CGATTCCACG CGCCTACTTT GTGCAAGATT TTGAGCCATT TTTCTATGCC ATGGGCGACG AATGGCTACG AGCTGAGGCA ACGTATAAAC AAGGTTGGCC ATGTATTACT TTAGGTCATT GGCTGGCCAA GCATCTACAC GAGCAATACA AGGCGACAAC CTATCCGTTT GATTTTGCAG TTGAACATGA GCGTTACTAT CCGCGCCCAC ATTTGTTAAC CAAAAAACCA CGGGTTATTT TTTATGCCCG GCCTTCAACG CCACGCCGTT GTTTCGATTT GGGGATGCGT GCCTTGGCGA TTGTCAAAGA GCAACGGCCT GATGTAGAAA TTGTGCTCTT TGGTGATAAA CATCTGAGCC ATCATTCTGC GCCGTTCAAA TTTACGGCCA TGGGCATTTT AAACCCCGAT GAATTAGCGG AATTGTATGC TTCGGCAACC TTGGGCTTAG TTATTTCCTC AACCAACCCT TCGCTGATGC CACCAGAAAT GATGGCCTCG GGTTGTCCAG TTGTTGATAT TGACTTGGCT CCGAACCATT TCTTAGTAAC CCATGGCAAA ACTGGCTTAC TCGCGACGGC TGAACCGAAA CCATTGGCCG AAGCTTTATT AAATGTGCTC AACGACGAAT CACTGCGCCA GCGCTTGATC AAAGCTGGCT TGGAGCATGC CCAAACATTA CGTTGGGAAA GCTCGGCGCG TGAGGTTGAA AAAGCCTTGT TGAAACTTGC CACAACCGCC TCTGGCAGCC TGATCCAACG GCTCTTGACT GATCATGAAC ATCCCAATGG CGATGGAGTT ACCGCCCCTT TGACTGGCGA TTTTCAAGTT GGGCAACGTT TGGTTTGTCA ACACGATGGA GTCTGTGCTT TTGAGGTGGC CTTGGCTCAA CCGGCAACTG GCCCTTGGCG CTTACAACTA TACGATGCCT TGATCAATCC GAATCGTACC TTGGTTGATG TTACTTCAAG CACAGTTAAC CAACAATGGC TGCATTTCGA ATTTCCGCCC TTGCCAGCGA GCCGCAACCA AGCCTTGCAT TTTGTGCTGA GCGCAACCAA TGGTTCAAGC TTGCGCTTTG ATTTCAAAGC ATCAGCAAAC GGTTCACTGA GTTTTAATGC AATGCCACAG TTGGGGCAAC TTTGCTGTCG CGTGTTATAT CAACCTGCCT ATGATTCAAC TGAACCGAGT GTAACGCCCG ATGTGGCCTA CCTAACAACC CAAAAAGCCC TGATTAGTAG TGAATATATT CAATTAAGCG AATTTGCCGA ACGGTTGCAC AACTTGATTG TGCCCAAAAA ATCGTGGCAG GAGCGTGGTG CCAAAACATG GCGCATGCTA CGCTCAGGAA ATTTCAACGG TTTAGCTGCT GAAGCCAAGC AATATGGCCG TTGGGTCTAC GATCGAGCTA AACAGCAATT GCGGCGCTAG
|
Protein sequence | MQVSEHPRLR IAWLLTAPIV GSGGYTNIFR IMNLLASFGH EQVIFMNPHD YPTDALDRPE HFVQRYFGMV HAPIYFWPQD IRGFDVVLAT QWTTTMGFEM CDPTIPRAYF VQDFEPFFYA MGDEWLRAEA TYKQGWPCIT LGHWLAKHLH EQYKATTYPF DFAVEHERYY PRPHLLTKKP RVIFYARPST PRRCFDLGMR ALAIVKEQRP DVEIVLFGDK HLSHHSAPFK FTAMGILNPD ELAELYASAT LGLVISSTNP SLMPPEMMAS GCPVVDIDLA PNHFLVTHGK TGLLATAEPK PLAEALLNVL NDESLRQRLI KAGLEHAQTL RWESSAREVE KALLKLATTA SGSLIQRLLT DHEHPNGDGV TAPLTGDFQV GQRLVCQHDG VCAFEVALAQ PATGPWRLQL YDALINPNRT LVDVTSSTVN QQWLHFEFPP LPASRNQALH FVLSATNGSS LRFDFKASAN GSLSFNAMPQ LGQLCCRVLY QPAYDSTEPS VTPDVAYLTT QKALISSEYI QLSEFAERLH NLIVPKKSWQ ERGAKTWRML RSGNFNGLAA EAKQYGRWVY DRAKQQLRR
|
| |