Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3234 |
Symbol | |
ID | 5900689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3491711 |
End bp | 3493741 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563739 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001684859 |
Protein GI | 167647196 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.519144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTGA TGAGCGAACG CGGCGCTTTC TTCGACCTGA CCTCCTCGCG GAGCGTCGCC TATGACGTGG CGTTCGCCCC CAGCGAGCGG GTCGAGATCC CGACCGACGC GCCGCTGGCC ATGCCCGTCC AGCCCCTGGC TTTCTGGCAG GGCGCGGCCA AGCCGGTCGC CACGGCCCCC GCCAATTCCC GCTGGGACAG CCTGACCCTG CGCCGTTGGG GCGTGTTCGT CGCCACCGCG ATCATGGGCT TCGCCGGGTG GAGAGCCACC TACGACACCA TCGCCCTGGG CGGCGTGACG CGCCTGGAAG CCGTCTGCAT CACCCTGCTG GCCCCGCTGT TCCTGGCCCT GGCCCTGTGG TTCTGCACCG CCGTGGTCGG CTTCGTCGTG CTGCTGGGCA AGCCGAAGGA TCCGCTGGGC ATCGACGATT CCGAGTCCTC GCCGATGCCT CGTCCCAAGG CCCGCACCGC CATACTGATG CCGGTGCACA ACGAGGACGC CGCCGAGGTG TTCGCCCGCC TGCGCGCCAT GGACGCCTCG ATCGCCGAAA CCGGCAACAG CCGCGCCTTC GACATCTTCA TCCTCAGCGA CACCCGCGAC GCCGCCGTGG CCCTGGCCGA GCAAGCCTGC TTCGCCCGCT TCCGCCGCGA GGCCAACAGC CACGTCTTCT ACCGCAAGCG CACCCAGAAC ACGGGCCGCA AGGCCGGCAA CGTCGCCGAC TGGGTGGCGC GCTGGGGCGG CGACTACGAG CACATGCTGA TCCTCGACGC CGACAGCCTG ATGACCGGCG AGGCCATGGT CCGCCTGGCC GACGCCATGG AGCGTCACCC GCGCGTGGGC CTGATCCAGA CCATGCCGAT GATCATCAAC GGCCAGACGA TCTTCGCCCG CACCCTGCAG TTCGCCACGC GCCTGTACGG CCGCGTCGCC TGGACGGGCC TGGCCTGGTG GTCGGGTTCG GAGAGCTCGT TCTGGGGCCA CAACGCCATC GTCCGCACCC GCGCCTTCGC CGAGACCTGC GGCCTGCCCA GCCTGGACGG TCCCAAGCCG TTCGGCGGCG AGGTGATGAG CCACGACGCC CTGGAAAGCG CCCTGCTGCG CCGCGGCGGC TGGGGCGTGC ACCTGGCCCC CTATCTGGGC GGCTCCTATG AGGAGAGCCC CTCCAACCTG CTCGACTTCG CCACCCGCGA TCGCCGCTGG TGCCGGGGCA ATGTGCAGCA CATCCCGCTG ATCGGCCTGC CCGGCCTGCA CTGGATGAGC CGCCTGCACC TGGTGATCGG TGTGCTGAGC TACGTGCTCT CGCCCCTGTG GTTCGTGGCC CTGTCGTCGG GGATCATCTC GCGGATGCTG ATGCCCGAGC TGAAGAAGGC CGCCTTCACC ATGGCCGACC TGCAGGCCGC GGCCCACGCC CTGATCGACT GGCGCGAGAT CCAGGCCACC GCCTGGGCGA TGATCATCAC CTTCGTGCTG CTGTTCGGCC CCAAGATCCT GGGCTCGATC CTGGTCTTCT CGCGCAAGAG CGAGATCAAG GGCTTCGGCG GCCGCCGCCG GATCCTGGCC GGCCTCGCCG TCGAGATGCT GCTCTCGGCC CTGGTGGCCC CGATGCTGAT GTTCACCCAG ACCCGCGCCC TGGTCGAGAT CCTGGCCGGC AAGGTCGGCG GCTGGGCCAC CCAGCGCCGC GACGCCGACA AGGTCACCGG CAAGGAAGCC TTCGCGGCCA TGGGCTGGAT CAGCGTCACC GGCCTGGTGC TGGCCGTGGC CTTCTGGTTC ACGCCGGACC TGCTGACCGC CACCCTGCCC ATCCTGGCCG GCCTGATCCT GGCCGTGCCC CTGACCATGC TGGGCGCCCA CAAGATCGCC GGCCTGGGCG TCAAGGCCAA CGGCCTGTTC ATGACCCCGG AAGAGCGCCG CCCGCCGGCC ATCGTCCGCG CGGCCCTGGG CCTGGCCTGC GAACCTCCCG CCGCCTGGTC TACCCGCCAA CGCCACACCG CGATGGTCGC CGAACAGCCG GCGGAACAGA ACGCCGCCTA G
|
Protein sequence | MDLMSERGAF FDLTSSRSVA YDVAFAPSER VEIPTDAPLA MPVQPLAFWQ GAAKPVATAP ANSRWDSLTL RRWGVFVATA IMGFAGWRAT YDTIALGGVT RLEAVCITLL APLFLALALW FCTAVVGFVV LLGKPKDPLG IDDSESSPMP RPKARTAILM PVHNEDAAEV FARLRAMDAS IAETGNSRAF DIFILSDTRD AAVALAEQAC FARFRREANS HVFYRKRTQN TGRKAGNVAD WVARWGGDYE HMLILDADSL MTGEAMVRLA DAMERHPRVG LIQTMPMIIN GQTIFARTLQ FATRLYGRVA WTGLAWWSGS ESSFWGHNAI VRTRAFAETC GLPSLDGPKP FGGEVMSHDA LESALLRRGG WGVHLAPYLG GSYEESPSNL LDFATRDRRW CRGNVQHIPL IGLPGLHWMS RLHLVIGVLS YVLSPLWFVA LSSGIISRML MPELKKAAFT MADLQAAAHA LIDWREIQAT AWAMIITFVL LFGPKILGSI LVFSRKSEIK GFGGRRRILA GLAVEMLLSA LVAPMLMFTQ TRALVEILAG KVGGWATQRR DADKVTGKEA FAAMGWISVT GLVLAVAFWF TPDLLTATLP ILAGLILAVP LTMLGAHKIA GLGVKANGLF MTPEERRPPA IVRAALGLAC EPPAAWSTRQ RHTAMVAEQP AEQNAA
|
| |