Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3622 |
Symbol | |
ID | 5735483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4552399 |
End bp | 4554483 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280771 |
Product | glycosyl transferase family protein |
Protein accession | YP_001546386 |
Protein GI | 159900139 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAT CAACCACGAC GAGCAATCTG CGCCGCCCAT TGGCTTGGCG GATTGCCCAA TTAACCCGTA GCCAAGCGTT GGTCCTGCTG AGTATCATTT TGCTATGTGC TGGCTGGCTG CGTTTGCAGA ATTTCGCTGC TGTAGCTGAA GGCAATACCT ACTACACCGC CGCCACCGTT GCCATGACCC AATCGTGGCA TAACTTCTTC TTTGCAGCAG CCGAGCCTGG TGGCTCAGTC ACAATTGACA AGCCAGCGCT AGGGTTATGG ATTCAGGCGA TTTTTGGCAA GTTGTTTGGG GTTAGTGGCA CAGTGGTGGT GCTGCCGCAA GTTTTGGCTG GGCTAGCCAC AATTGGCCTG CTATATTGGA TTGTGGCGCG GCGCTGGGGG CGGTCGGCAG GCTTATTGGC GGCAGCGATT CAGGCGATTA GCCCAATTAG CATCGCCGTC GAACGTACTA ACAATCTTGA TGCGTTGTTG ATCGTGACTT TGGTTGGGGC GATGGCCTTG TTTTTGGTGG CAACCGAACG TGCCAGCACT AAATATTTGT TATTGGCAGG TGCAGTGGTT GGCCTCGGCT TTAATATCAA AATGCTACAG GCCTTTTTGC CCTTGCCAGC CTTTTATGCG ATGTATTTTT TTGCTGCCAA AACTGGCTGG TGGCGTAAGC TTTGGCAACT TGGACTGACC ACACCTGTGC TGTTGGCGGT CAGCTTTTCG TGGGCAATCG CGGTTGATCT GGTTCCAGCC AGCGAACGGC CATACATCGG TAGTAGCGAT ACCAACTCGG TTGTCAACTT GATTTTGGGC TACAACGGGG TTGAGCGCTT GACTGGACGC GAAGGCCAAG CCATGGGCGG CGCAATTCCA ACAACTGATG ATCGCCAGCG ACCCAATGCC ACCGATGGTA ATGCTCAAAT GCCTGCAATG CCCAATGGCA ACGCACAAAC TCCGCCCAAC GGTATGACCG ATGAAGGCAT GCGCGGCCAA TTTGGCGGCC AAACTGGCCG CACTGGTGGC GGTGGCCCAG GCATGGATAG CGGCGAATCG GGTTTATTCC GAATGTTCAG TTCACCGATG AACACTAACA TTGGTTGGTT GCTAGGAGCG GCACTCTTCG CCGTGGTTGG CTTGGCTGCC CACTACATCA AGCAACGCCG CTGGCCCGAT GCCGATGTTT GGGGCTGGGC TGGTTGGCTG GTTACTGCCT TTGTCGTGCT GAGTTTTGCT GGTTTTTCGC ATGGCTACTA TAGCGCTACG ATTGCTCCGG CGATTGCCGG CACACTGGCA ATTGGCATCA CGGTTTGGCG ACGCAGCGCC AGCAAGATCG TTGGTTTATG GCTCATTGGT TTGGTAGCAG CGGCCTTGGT AGTGCAAGTG ATTGCTGCGC AGCCCAGCGT CAGCGGCTGG CTGATTCCCT CAGTGGCACT TGGGTTGGCG CTGGTGGCGG CTGGCTTGGC ATTCCGAGCT TCGTGGCGGG TGGCAGCAAC GGCGGTTGGC ATCGCCGCGA TTCTGCTCAT CCCTAGTGAA TGGGCCTACA AAACGTCGGC GATGGAGCAA ATGAATACAA CCTTGCCCAG CGCCGCTGCC CCAACCGATA ATGCCACTGG CTTTGCGGCG GGCTTTGCTG GCAATCGTAA CCGCTCGGAT AGCAGCAGCC CCAGCGCTTT GGCAACCTAT CTACAAGAGC GTACTAGCGA TACTTACTAT ATGCTCGCCG TGCCAAGCTC GATGATGGGT TCGTCGTTGG TGATTGAAAC TGGGCGGCCT GTTTTGATGA TGGGCGGTTT CTCCGGCAGT GATCCGGTGA TTGATGCCGC TGATATTGCC CAGTTGGTAG CCGACGGCAA ATTGCGCTAC ATCATGACTG GCGGTATGGG CGGCGCTGGC CGTGGTGGCA GTTCAACGGT GCAAACTTGG GTCCAGCAAA ACTGTACGGC AGTCACTGAT GCACCGAACA GTCAAGCAGG CTTTGATTTG CCAAATGGCC AAATGCCAAA TGCTCAAGCA GCCCCCACCA ATGGCAACAC TGGTGGCGCA CAATTTGCTC AAAACAATTC ATCATTGTAT CGTTGTGGCG AATAA
|
Protein sequence | MSASTTTSNL RRPLAWRIAQ LTRSQALVLL SIILLCAGWL RLQNFAAVAE GNTYYTAATV AMTQSWHNFF FAAAEPGGSV TIDKPALGLW IQAIFGKLFG VSGTVVVLPQ VLAGLATIGL LYWIVARRWG RSAGLLAAAI QAISPISIAV ERTNNLDALL IVTLVGAMAL FLVATERAST KYLLLAGAVV GLGFNIKMLQ AFLPLPAFYA MYFFAAKTGW WRKLWQLGLT TPVLLAVSFS WAIAVDLVPA SERPYIGSSD TNSVVNLILG YNGVERLTGR EGQAMGGAIP TTDDRQRPNA TDGNAQMPAM PNGNAQTPPN GMTDEGMRGQ FGGQTGRTGG GGPGMDSGES GLFRMFSSPM NTNIGWLLGA ALFAVVGLAA HYIKQRRWPD ADVWGWAGWL VTAFVVLSFA GFSHGYYSAT IAPAIAGTLA IGITVWRRSA SKIVGLWLIG LVAAALVVQV IAAQPSVSGW LIPSVALGLA LVAAGLAFRA SWRVAATAVG IAAILLIPSE WAYKTSAMEQ MNTTLPSAAA PTDNATGFAA GFAGNRNRSD SSSPSALATY LQERTSDTYY MLAVPSSMMG SSLVIETGRP VLMMGGFSGS DPVIDAADIA QLVADGKLRY IMTGGMGGAG RGGSSTVQTW VQQNCTAVTD APNSQAGFDL PNGQMPNAQA APTNGNTGGA QFAQNNSSLY RCGE
|
| |