Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4941 |
Symbol | |
ID | 5902403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5338696 |
End bp | 5339940 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641565461 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001686559 |
Protein GI | 167648896 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.421428 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAGC TCAGGGCGTT GGCTTGGCGC GTCGCGCCCG ATCTTTGCCA TGTCATTTCG CTCCGCCTCC CCGGCCTGCG CGCCTACTGG TTTCATGGAA GACGCCGGCG GCGCGGGCGG TCCTTGTCCG AAGGTCCGGT GATCGTCGCG GGATTTCACG GCGCGGTCCT GGGTCTGGGC GAGGCCGCCC GGGGCACCGT CACGGCGCTG GCCGCGACCG GCATCGAGGC CCAGGCTTGG GACGTGTCCG CCCAGCTGGG CCACGTGCGC CGCTTCGATA TCGGCGAGGT GGCGACCCCC CCGCCCGGCC CCGGCACGAT CATCACCCAG ATGAACCCGG CGGAGCTGAT TCGCCTGGTC AGCGCGACGC GCGGCGCGCC CTTCGAAGGA AAGCGCTCCA TCGGCTACTG GGCCTGGGAA TTGATGGACA TTCCCGAGGC CTGGAAGCCG GCCTTCCGCT ATGTGGACGA GATCTGGACG CCGTCGAACT TCTGCGCCGA GGCCATTCGC CGTTCCGCGC CTCGCGACCT GCCGATCAAG GTCGTCCCTC ATCAGGCTCC CCTGAACCAC GCCGCGCCCA ACCGGGAGCG GTTTGGCCTG TCGCCGGACC ATGTCGTCGT GCTCTGCGCC TTCGATCTGA GATCCACCCT GGCCCGCAAG AATCCGCTGG GCGCGCTGGA GGCCTTCCGG ATCGCCGCGG CCAAGGCCAA GCGGCCGGTG ACCCTGGTGT TCAAGACGGT CGGCGGCGCC GACGCTCCCG ATAGCCTGGC GACGCTGCGC GCGGCGATCG GCGACACCCC CGACGTGCTC GTGCTGACCG AGTCGTTGAG CATGGGCGCT CGCGACCAGC TCATGGCCAG CTGCGACATC TTTCTTTCGC TGCACCGATC GGAGGGCTTC GGCCTGCTGC TGGCCGAGGC CATGGCCGCC GGCAAGGCCG TGGTGGCGAC GGGCTGGTCG GCCAACATGG ACTTTATGGA CGCGGAGTCG GCGATGCTCG TGCCCTACGC CCTTTGCCCC GTCCGCGACC CCCAGGGTCT GTACCAAAAA GGCGTCTGGG CCGAGCCCGA CACAGAGGCC GCCGGCCGGG CCCTCGCGGA ACTGATCAAC AACCCCGATC AACGCGCCGA ACTCGGCGCC AAGGCCCTGG CCGCCGTCCG CCAACGTCTG AGCCCGCCGG CCATCGCCGC GATCATGCGA CGGGCCTTTG ACGGGTCGCC CGTCCGCAAG GGGGCCAACG GGTGA
|
Protein sequence | MKQLRALAWR VAPDLCHVIS LRLPGLRAYW FHGRRRRRGR SLSEGPVIVA GFHGAVLGLG EAARGTVTAL AATGIEAQAW DVSAQLGHVR RFDIGEVATP PPGPGTIITQ MNPAELIRLV SATRGAPFEG KRSIGYWAWE LMDIPEAWKP AFRYVDEIWT PSNFCAEAIR RSAPRDLPIK VVPHQAPLNH AAPNRERFGL SPDHVVVLCA FDLRSTLARK NPLGALEAFR IAAAKAKRPV TLVFKTVGGA DAPDSLATLR AAIGDTPDVL VLTESLSMGA RDQLMASCDI FLSLHRSEGF GLLLAEAMAA GKAVVATGWS ANMDFMDAES AMLVPYALCP VRDPQGLYQK GVWAEPDTEA AGRALAELIN NPDQRAELGA KALAAVRQRL SPPAIAAIMR RAFDGSPVRK GANG
|
| |