Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_4199 |
Symbol | |
ID | 5375214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | + |
Start bp | 4924378 |
End bp | 4926183 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640845726 |
Product | glycosyl transferase family protein |
Protein accession | YP_001381361 |
Protein GI | 153007036 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.458284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC CGACTCAGCC GCTTCGCCGC GAGCCGACCG CCGGCCCGCG GCCCGCGGCC ACGACGGCGG ACGAGCCGTG GTCCTCCGCC GAGCGGCGCT GGTACGCGGG CGCGATGGCG CTCGCCCTGC TCGTCATGGC GGCGGGGCTC GTCTTCCCGG ACCTCATGGG CGGCGACGCC GCGCAGGACG CGGTGATGGC GCTGCGGATG TACCTCGCCG ACGACTGGGT CAACCTCGTC AAGAACGGGC GCGACTACCT CGACAAGCCG CACCTCCTCT TCTGGTCGGC GCGGGCGAGC TACGAGCTGT TCGGCGTGCA CGACTGGGCC TACCGGCTCC CGTCGGCCCT GGCCTCCCTG CTCGGCGCGT GGGCGGCCTA CGGCATGGCG AGGCGCCTCC ACGGCGAGAC GGCCGGCCGG CTCGCGGCGC TCATGGTGGT CACCGCGTAC GCGATCGTGC TCGGCAATCA CGACGTCCGC ATGGACGCCC TGCTCATGGG CTTCACCGCC TTCGGGACCT GGCAGCTCCT CGAGTACCTG GAGACCGGCC GCGCCCGAGC GGCGGTCCTC GGCGGCGCCG GCGTGGCGCT CGGGGTCTCC GCGAAGGGCA TGGTCGCGGT GGCGGTGAGC GGCTGCGTGC TCTTCTTCTA CGTGTGGGGC CGCGGCCGCT GGCGGCGGCT GTGGAGCTGG AAGATCGCGC TGGGGATCGC CGTGTTCGTC CTCGCGCTCT CGCCGGTGCT CTTCGCCTAC TACCAGCAGT ACGATCTCCA TCCCGACAAG GTCGTGAACG GTCGCACCGG CGTCTCGGGC GTGAAGTTCA TCCTGCTCGG GCAGAGCCTG GAGCGCTTCG GCGGGGGCCG CGGGCACAAG ATCGCCGACG ACCACCTCTT CTTCTTCCAC ACGCTGGCCT GGGCGTTCCT GCCCTGGAGC CTCCTGACCT ATGCCGCGTG GGCCGAGCGG TTCCGGGAGC TGTTCCGGCG GCGCTGGGCC GCGTTCCGTG AGCGCGAGCA GCTCACCTTC CTGGGCCCGT TCGCGTTCCT CGCCGTCCTG GGCTTCTCGC AGTTCAAGCT GCCCCACTAC CTGAACGTCG TGCTGCCCTT CCTCGCCGTG TTCACGGCGA GCTACCTCGC CGACCTGCGC CGCGAGGGCC GGCTGCGGGC GCTCGCGCGT CTCCGGTGGG TGCAGCTCGT CGTCATCGCC GCGCTCCTCG CGCTCGTCGT GGTCCTGAAC GCGTGGGCGT TCCCGGTCGA GCGCGCCTGG ATCGTGCTCG CCGCGCTCGC GCTGCTCGCC GTCCTCGTCG CGAGCCTCCG CGTCCGCGAG CCGCTCGCGC GCGTGTGGGC GCCCTCCGCC GTCGCCATCC TCCTCGCGGA GCTCCTGGCG AACACCAGCT TCTACCCGCG CCTCGGCCGC TACCAGCCGG GGAGGGACCT CGCGGCCGCC GCGGAGGCGA GCGGCGTCGA CTGGGAGCGC ACGTTCTTCC TGGAGACCGT CTACCAGCCG TTCCAGTTCT ACGCGGGGCG TGTCATCCCG CAGCTGGACT TCGCCGGCCT GCACCGCGAG GTCGCCGCCG GACGAGAGCT CTTCCTCGCC GTGTCCGCGG AGGAGGAGCG CCGCCTGCGC GACGAGGGCA TCCCGCACGA GGTGCTCGCC ACGAGCCCGA GCTGCCGGGT CCTCAACCTC ACGGGAAAGT TCGTGAACCC GCGCACCCGC GACGGCACGT GCAAGACGGT GTTCCTGGTC GCCGCGGGGG CGAGCGCCCC GGACCACCGG CGCGTCGAGC CCGGTCGAAC CGGCGACCGT CCGTAG
|
Protein sequence | MRIPTQPLRR EPTAGPRPAA TTADEPWSSA ERRWYAGAMA LALLVMAAGL VFPDLMGGDA AQDAVMALRM YLADDWVNLV KNGRDYLDKP HLLFWSARAS YELFGVHDWA YRLPSALASL LGAWAAYGMA RRLHGETAGR LAALMVVTAY AIVLGNHDVR MDALLMGFTA FGTWQLLEYL ETGRARAAVL GGAGVALGVS AKGMVAVAVS GCVLFFYVWG RGRWRRLWSW KIALGIAVFV LALSPVLFAY YQQYDLHPDK VVNGRTGVSG VKFILLGQSL ERFGGGRGHK IADDHLFFFH TLAWAFLPWS LLTYAAWAER FRELFRRRWA AFREREQLTF LGPFAFLAVL GFSQFKLPHY LNVVLPFLAV FTASYLADLR REGRLRALAR LRWVQLVVIA ALLALVVVLN AWAFPVERAW IVLAALALLA VLVASLRVRE PLARVWAPSA VAILLAELLA NTSFYPRLGR YQPGRDLAAA AEASGVDWER TFFLETVYQP FQFYAGRVIP QLDFAGLHRE VAAGRELFLA VSAEEERRLR DEGIPHEVLA TSPSCRVLNL TGKFVNPRTR DGTCKTVFLV AAGASAPDHR RVEPGRTGDR P
|
| |