Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3563 |
Symbol | mdoG |
ID | 5901018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3847059 |
End bp | 3848600 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564071 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001685188 |
Protein GI | 167647525 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCGA GTTCCGTGAA CCGGCGCACG GCGCTCGGCG GGTTGTCGCT GTCGGCGATC CTCGCCCTGG CGCCCAGGCT GGCGGCGGCC CAGACGCCGG CCGTCGACGC CGCCTTGAGC GAGCCGTTCT CGGTCGAGCT TCTGCGCGAC CGGGCCCGCC GCCTGGCCGC CGCGCCCTAC GCCAAGCCGC CGGCCGGGCC GACGATGACC GGCCTGTCCT ACGACGACTA TTTCCAGATC CGCTTCCGCA AGGACGCCGA CCTCTGGAAG GATCGCCGGC CGCCATTCCG GGCCGAGTTC TTCCCGCCGG CCTATCTCTA CCCGCGTCCG CTGCCGATCT ACGAGGTCGC CGCGGGCCTG GCCCGGCCGA TCGCCTTCGA CCCGGCGATG TTCGACATCC CCGGCGAGCA CGCCGACGCG GCCAAGGGCT GGAGCGGCTT TTCGGGGTTC CGGCTGCTGT GGCCGCTGAA CGAAGCCGAG AAGATGGACG AGATCGCGGT GTTCCAGGGC GCGAGCTATT TCCGCAGCCT GGGGCGCGGC CAGCGCTACG GCCTGTCGGC GCGGGGCCTG GCGATCGGGG CGGGCGAGCC GAACGAGGAG TTTCCGGACT TCACCGCCTT CTGGCTGGAA CGGCCCCCCA TCGAGAGCGA GGCCATCGAT GGTGATGCGG TGACGGTCCA TGGCCTGATG GACAGCCCCA GCTGCGCGGG CGCCTACAGC TTCACGATCC GTCCGGGCGA GGACGTGGTG TTCGACGTCG CCGCCGCCGT CTTTCCCCGG ACGACCCTGG CCAAGGCCGG CGTCGCGGCG ATGAGCAGCA TGTTCCTGTT CAATGTCGCC GACGCCTCGC CGGTCGATGA CTTCCGCACC GCCGTGCACG ACAGCGACGG CCTGGCGATC TGGACGGGGC GGGGAGAGCA CCTGTGGCGA CCGCTGCGGA ACCCCAAGGT CCATCGCCTG AGCCACTTCC CGGACGACAG CCCCAAGGGC TTTGGCCTGG TGCAGCGGGC GCGGGCGCTG GAGGACTTCG GCGACCTGCA GGCCCGCTAC GACCTGCGGC CCAGCCTGTG GGTCGAGCCG CTGGACGCCT GGGGCAAGGG CGCGGTGCAT CTGGCCGAGT TGGCGACCAC CAAGGAAACC GACGATAACA TCGCCGTCTT CTGGCGGCCC GAACAGTCCT GGGCCGCCGG GTCTCAGGTC GACCTGCGCT ATCGCCTCTA CTGGGGCCAG GAACGGTTCG CCAGTCCCGA CGCCCGGGTG TTCCGCACCC GCACGGGCGC CGACGCCCAG GCCGGCCTGC GGCTGTTCAC GGTCGACCTG CAGGGCGGCG CCCTGGCCGA GGGGCTGGAC GGCGTGGCGC TGGAGATCCA GGCCGACGGC GGGACGATCG CCTGGAAGGA CCTGTCGCCC TATCCCGACG AAGCCTCGGC CCGGGCCGCC TTCGGCCTGC GCCCGACCCA GGCCAGCGTG GATCTGTCGC TGCGGCTGGT CCGCGGTGAT CAGCCGATTT CCGAGACCTG GCGCTATCCC TGGACGGCTT AA
|
Protein sequence | MFSSSVNRRT ALGGLSLSAI LALAPRLAAA QTPAVDAALS EPFSVELLRD RARRLAAAPY AKPPAGPTMT GLSYDDYFQI RFRKDADLWK DRRPPFRAEF FPPAYLYPRP LPIYEVAAGL ARPIAFDPAM FDIPGEHADA AKGWSGFSGF RLLWPLNEAE KMDEIAVFQG ASYFRSLGRG QRYGLSARGL AIGAGEPNEE FPDFTAFWLE RPPIESEAID GDAVTVHGLM DSPSCAGAYS FTIRPGEDVV FDVAAAVFPR TTLAKAGVAA MSSMFLFNVA DASPVDDFRT AVHDSDGLAI WTGRGEHLWR PLRNPKVHRL SHFPDDSPKG FGLVQRARAL EDFGDLQARY DLRPSLWVEP LDAWGKGAVH LAELATTKET DDNIAVFWRP EQSWAAGSQV DLRYRLYWGQ ERFASPDARV FRTRTGADAQ AGLRLFTVDL QGGALAEGLD GVALEIQADG GTIAWKDLSP YPDEASARAA FGLRPTQASV DLSLRLVRGD QPISETWRYP WTA
|
| |