Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2839 |
Symbol | |
ID | 5900294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3078070 |
End bp | 3079302 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563334 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001684464 |
Protein GI | 167646801 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.000929043 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0434623 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGA GGTTTGATCC GCTTTCGGCG CTCAGCGAAG CAGGAGAACC CCTGTTCAAG GCTGTCCGTC CGGGTGGAAA GCCGGTCAGG CTGGTCGATA CGACCATGCT CTATGCCCCC CGTAGCGGCG GGGTTCGCCG CTATCTGAAC TCCAAACGAG CCTGGATCGC GGCGAATCGC CCACAGGTCC GCCACACCCT CGTGGTGCCC GGCCCCCGCG ACGCGCACGA CGGCCATGGA CGCGTCTCGA TCTACGCCGC GCCGTTGCCC TTCGGCGACG GCTATCGCTG GCCGGTGGTC AAGAACGCCT GGATGGAGCG ACTGATTCGT CAGCGGCCGG ACATCATCGA GGCCGGCGAT CCCTATACCC CGGGTCTGGC GGCCCTGAAG GCTGGCGACG CTCTGGGCGT GCCGGTGGTC GGCTTCTGCC ACACCGACCT GGGCGCCTTG GCGGCCCTGC ACATCGGCGA ATGGGCCGAA AAGCCCGTAC AGAAGCGCTG GGCGGCGATC TACAGCCAGT TCGACCAGGC CGTCGCCCCC AGCCAGTTCA TCGCCGGGCG CCTGATCGAG GCCGGGGTCA AGAACGCCAT CGGCCTGCCG CTGGGCGTCG ACACCGAGAT TTTCCGTCCG GGCCGCGGTG ACCGAGAGGC GCTACGTCGA CGGCTCGGCC TGACCAGCCG CCATCGCATC CTGGTGTTCG CCGGCCGGCC GGCCAAGGAG AAGAAGCTCG ACGTGCTGGT CGAGGCCGTG GAGCGGCTGG GCGATCCCTA TGTGCTGCTG TTTGTCGGCG CGGGGGCGGG GGCGCCGTCC AGCGACCGGG TGATCTGCAT GGACTATCAG CGCGATCCGC AGGGCCTCGC CGCGGTGCTG GCCGGCTGTG ACGCCTTCGT GCACGCCAAC GACAACGAGC CGTTCGGCCT GATCGTGCTC GAGGCCATGG CCTGCGGCCT GCCGGTGATC GGCGTGGCGG CCGGCGGGGT GGCCGAATCG GTCGATGAGA CGGTCGGAGC CCTGGCCACG GCTTCGGAAG CCCGCGCCTT CGCCGAGGCC GTGGAATCGG TGTTCGCACG CGACGTCATC GCCCTCGGCC AGGCCGCGCG CCTGCGGGCC GAGCAGCGGC ACGGCTGGGA CCCGGTGTTC CGCAAGCTTT CGGCGATCTA CGGCCGGCTG ACCGGCTGCG CCGCGTTCGA GGACGCGCCC GCGCCGGTCG CCGAACCGCC CGGCTGGAAC TAG
|
Protein sequence | MNLRFDPLSA LSEAGEPLFK AVRPGGKPVR LVDTTMLYAP RSGGVRRYLN SKRAWIAANR PQVRHTLVVP GPRDAHDGHG RVSIYAAPLP FGDGYRWPVV KNAWMERLIR QRPDIIEAGD PYTPGLAALK AGDALGVPVV GFCHTDLGAL AALHIGEWAE KPVQKRWAAI YSQFDQAVAP SQFIAGRLIE AGVKNAIGLP LGVDTEIFRP GRGDREALRR RLGLTSRHRI LVFAGRPAKE KKLDVLVEAV ERLGDPYVLL FVGAGAGAPS SDRVICMDYQ RDPQGLAAVL AGCDAFVHAN DNEPFGLIVL EAMACGLPVI GVAAGGVAES VDETVGALAT ASEARAFAEA VESVFARDVI ALGQAARLRA EQRHGWDPVF RKLSAIYGRL TGCAAFEDAP APVAEPPGWN
|
| |