Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4459 |
Symbol | |
ID | 5901920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4826532 |
End bp | 4828457 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564978 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001686077 |
Protein GI | 167648414 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCGAG CGGGAAAGAT CGGGGCGCAT GTGGCGCTGG CCTTCGTGGC CATGCTGCTG GCTCAACTGC TGACCGCCGG CGTCGAACCC TTCGCCCTCT CCGCCCTGGT CCACGCCGCG CTGTTCGCCG CCTCGGCCCT GGCCGTCGAG ATGCTGTTCC AGGTCGAGCG CTCGCCCTGG CGATTCTTCG CGGCCAGCGA CGGTCTGCGC CTGGCCCGGT CGGCCCTGCT GTCAGTGCTG ACCTTCGCGC TTCTGGCCCG CATGGCCGAC GTGCGCCAGC CCGGCGGCCC GCGCACCCTG ATCGCCGCCT TCCTGCTGCT GCTGACCCTG TTGGCCGGAC TACGCATCGT TCGCCGCACG ATCCACGACA AGGTGCTGGT CGAATCCCTG ACCCGCCTGC GTCCGGCCCC CGCCTCGCCG TCCCTGCCGC GCCTGCTGAT CGTCGGCTCG GCCAGCGAGG CCGAGGCCTT CCTGCGCGCG CCCCTCGCCC TGGGCGAACG CTACGCCCCG GTCGGCGTGC TGACCCCCGA GGCCCGCGAA ACCGGCGACG AGTTGGGCGG AGTCTGCATC CTCGGGGTGA TCGACGATTT CGACGCCGTC ATGGCCCAGC TGCGGGACGG GGGCCTGCAG CCGTCGGCCC TGCTGTTCCT GACCGACGGC CCGCTCAGCG CCTTCGGGAC CGAACGGCTG GGTCGGCTGA AGACCGAGGG CGTGCGCCTG CTGCGGCGCC AGGGCCTGGT CGATATGAGC CCCATCGGAG GATCCGCCAG CGCGGCCCTG CGCGAGATCA GCCTCGAGGA ACTGCTCAGC CGCGCGCCCG TGCGGCTGGA TCCCGAGCCG GTCCGCGCCC TGGTCGCCGG GCGGCGGGTG CTGGTCACCG GGGCCGGCGG CAGCATCGGC TCCGAGCTGG CCCGCCAGAT CGCCGCCAGC GGCCCGGCCC ACCTGACCCT GCTGGACTCG GCCGAGGCCA ACCTGTTCCA CATCGACCGC GAGCTGGGCG AGGCCTGGCC CTCGCTGGCT CGCCGCGACG TGCTGTGCGA CGTGCGCGAC GCCGACCGCG TGGCCCTGGT GTTCACCGCC GAGAAGCCCG AGCTGGTCTT CCACGCCGCC GCCCTCAAGC ATGTCACCCT GGTCGAGAAC CACCCCTGCG AGGGCGTGCG CACCAACGTG CTGGGCAGCC GCAACGTCGC CGCCGCCGCC AAGGCCTGCG GCGCGGCCCA CCTGGCCCTG ATCTCCACCG ACAAGGCCGT GGCCCCGGCC AGCGTGATGG GCGCGACCAA ACGGGTGGCC GAGGCCGTGG TGCGGCAATT CGGGGCCAGC GAGGCGACCC GCGTCAGTGT CGTGCGGTTC GGCAACGTGC TGGGCTCGGC CGGCTCGGTG GTGCCGATCT TCCAGCACCA GATCGCCCGC GGCGGCCCGG TCACCCTGAC CGACGCCGAG GTCGAGCGCT ATTTCATGAC CATTCCCGAG GCCGTGCAGC TGGTCCTGCG GGCCGTGGCC CTGTCGGCCG GCGATCTGGA GCCCCCCACC GGCGTCTTGA CGCTTGAAAT GGGCGAGCCG GTCAAGATCA TCGACCTGGC CCGGCGGATG ATCGAGCTGC AGGGCCTGAC CCCCAGCCGC GACATCGAGA TCAAGATCAC GGGCTTGCGG CCCGGCGAAA AGCTCACCGA GGCCCTGGTC GATGTCAACG AGACCACCCG GCCCAAGGCC GACGGGGTCA CCGAGGCGAC GCCGCTCACG CCCCTGCCGG TGATCGCCGC CGCCGCCCTG GCCGCGCTGG AAGCCGCCGC CCTGGCCGGC GATGAGCTGA TGGTCCGTGA GCGGCTGTTC GCCCTGGTCG AGAGCCTGCG CGGCGCGCCC CCCTCCAATC CGGTTCGGGC CAAGCCAGTT CGGGGCAAGA CGACCCGGGC GCGCACCCAG GTCTAG
|
Protein sequence | MGRAGKIGAH VALAFVAMLL AQLLTAGVEP FALSALVHAA LFAASALAVE MLFQVERSPW RFFAASDGLR LARSALLSVL TFALLARMAD VRQPGGPRTL IAAFLLLLTL LAGLRIVRRT IHDKVLVESL TRLRPAPASP SLPRLLIVGS ASEAEAFLRA PLALGERYAP VGVLTPEARE TGDELGGVCI LGVIDDFDAV MAQLRDGGLQ PSALLFLTDG PLSAFGTERL GRLKTEGVRL LRRQGLVDMS PIGGSASAAL REISLEELLS RAPVRLDPEP VRALVAGRRV LVTGAGGSIG SELARQIAAS GPAHLTLLDS AEANLFHIDR ELGEAWPSLA RRDVLCDVRD ADRVALVFTA EKPELVFHAA ALKHVTLVEN HPCEGVRTNV LGSRNVAAAA KACGAAHLAL ISTDKAVAPA SVMGATKRVA EAVVRQFGAS EATRVSVVRF GNVLGSAGSV VPIFQHQIAR GGPVTLTDAE VERYFMTIPE AVQLVLRAVA LSAGDLEPPT GVLTLEMGEP VKIIDLARRM IELQGLTPSR DIEIKITGLR PGEKLTEALV DVNETTRPKA DGVTEATPLT PLPVIAAAAL AALEAAALAG DELMVRERLF ALVESLRGAP PSNPVRAKPV RGKTTRARTQ V
|
| |