Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1539 |
Symbol | |
ID | 5898994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1629174 |
End bp | 1630694 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562026 |
Product | undecaprenyl-phosphate glucose phosphotransferase |
Protein accession | YP_001683167 |
Protein GI | 167645504 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCCG CTATCGCACA TTCGCCGGAA CTCGACGGCC TCATCGTGAC CGACGTTCAG CCCGCGCCGC CGCGCCTGGT GATCGAGGAC GACGAGGCCG TCGGCCTGGG CCGGCGAGGA CCGTTCCGGC CTGGTCGGCT GGTTCCGGCG CGAGCCCGGA TGCAGGTGCG CAGCCTCTCC CGCCTGTTCC GCGTCGTGGA CGGGATCGCC TTCGCGGCCG TCACCGTCGC CACGATCCTT GTCGCCCGCC CGACCCCCGC TGCGTTCGCG CCGCTGATCC TCGGCGCCCT GACCCTGCTG CCCGCTCTCT ACCTGCTGGA GGCCTATGCC TTTCATCGGC GCGAGACCCT GGGACGGCAG GCCCTGCGCG TGCTCGGCGC GTTCGGCGTC GTCGGCGTGG TCGTCTTGCT GGCCACCTTG ATCTTCGGTC ATACGCACGT CGAGCCGGTG ATCCTGGCGG GATGGGCGGG CGCCCTGGTC GCCACGACCG CCGCGCTGCA CGCCCTCTGG TGGCGCGCCA TCGACCATGG CCGCCGCCAA GGCAGCCTGA CCCCCAACGT CGTGGTGGTG GGCGCCACGG TCAATGCCGA GCGCTTCATC CGCGGCGCCT TGGCGACCGG CGACGTCAAC GTGCTGGGCG TGTTCGACGA CCGCGCCGAT CGCGCCCCGC CCCAGGTGCT GGGCGTGCCG GTGCTGGGCG ACACCAACGC CCTGATCGAC CACCGCATCA TGCCCTATGT CGACCGGGTG ATCATCGCGG TGAGCTCCAG CGCCCAGGCT CGCGTCAGCC AGCTGGTCGA GCGCCTGGAG GTGCTGCCCA ATCCCGTCAG CCTGTTCGTC GACCTGGGCC GCCAGGCCCA GCGCGACGCC TCTCTGGCCC GCTTCGTCGA CCTCTCGGGC GCGACCACCG ACGCCCGCCG GGCCATCGCC AAGCGCGCCC AGGACCTGGT GGTCGGAACC GTCGGCCTGA TCGTCGCGGC GCCGATCATG CTGCTGGTCG CCATCGCCAT CCGGCTGGAC AGCCCCGGCC CCGTCTTCTT CCGCCAGCGC CGCCACGGCT TCAACAACGA GGCGATCCTG GTCTGGAAGT TCCGCTCGAT GCGCCACGAG GTCGCCGACG CCAAGGCGTC GCGCCAGGTC AGCGCCAACG ACGATCGCGT CACCAAGGTC GGCAAGTTCA TTCGCAAGAC CAGCCTCGAC GAGCTGCCCC AGTTGTTCAA CGTGCTGAAG GGCGAGATGT CGATGGTCGG CCCCCGCCCC CACGCCATCG GCATGAAGAG CGGCGACGTC GAGTCGGCCA AGCTGGTCGC CCACTACGCC CACCGCCACC GCATGAAGCC GGGCGTCACT GGCTGGGCGG CGATCAACGG CTCGCGCGGC CCGGTCGACA CGGCGCAGCT GGTGCAGGAG CGCGTCGCCC TGGACGTCGA CTACATCGAG CGCCAGTCGT TCTGGCTGGA CCTCTACATC ATCGCCATGA CCATCCCCTG CCTTCTGGGG GACAGGTCGG CGGTGCGCTA G
|
Protein sequence | MTSAIAHSPE LDGLIVTDVQ PAPPRLVIED DEAVGLGRRG PFRPGRLVPA RARMQVRSLS RLFRVVDGIA FAAVTVATIL VARPTPAAFA PLILGALTLL PALYLLEAYA FHRRETLGRQ ALRVLGAFGV VGVVVLLATL IFGHTHVEPV ILAGWAGALV ATTAALHALW WRAIDHGRRQ GSLTPNVVVV GATVNAERFI RGALATGDVN VLGVFDDRAD RAPPQVLGVP VLGDTNALID HRIMPYVDRV IIAVSSSAQA RVSQLVERLE VLPNPVSLFV DLGRQAQRDA SLARFVDLSG ATTDARRAIA KRAQDLVVGT VGLIVAAPIM LLVAIAIRLD SPGPVFFRQR RHGFNNEAIL VWKFRSMRHE VADAKASRQV SANDDRVTKV GKFIRKTSLD ELPQLFNVLK GEMSMVGPRP HAIGMKSGDV ESAKLVAHYA HRHRMKPGVT GWAAINGSRG PVDTAQLVQE RVALDVDYIE RQSFWLDLYI IAMTIPCLLG DRSAVR
|
| |