Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2842 |
Symbol | |
ID | 5900297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3080572 |
End bp | 3081840 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563337 |
Product | glycosyl transferase family protein |
Protein accession | YP_001684467 |
Protein GI | 167646804 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.142528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0240997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGGT TTCTGTTCAC GACCTGGGAA GGCGGCGGTC ACGTCCAGCC GATGCTGCTG GTCGCCCGCG AGTTGGCTGC GCGGGGCCAT CAGGTGCTGG CGCTCAGCGA TCCCTGCAAC GCGCCGGACG CCGCGGCGCT GGACGTGCCG TTTCGCGCCT GGACGACGGC CCCCTTCCAG ACCGGCAAGA CCCGCGATGA CGATCGGCTG AAGGACCACC AAGCCGATAG TCCGCTAGAG GTGATCCAGC GCCTGCTGGA CCGGGTCATG ACCGGCCCCG CCCTGGCCTA TGCCCGCGAC ACCCTGGCCG CGATCGACGC CTTCGCGCCC GACGTCGTCG TGTCGCAGGA ACTGTTGTTC GGACCCATGG CGGCGGCGGA AGCGCGCGGC CTGCCGCTGG CCTTGCTCGC CGCCAACGTC TGGTCGCTGC CGACGATGTC CGGCGCGCCG CCGTTCGGGG CCGGCATGCT CCCGGCGGCC AATGACGAGG AACGGGCCAT GCACGCCATG GTCGGCCAGA TGAGCCGCGG GTTCTACCAG GCGGGCCTGC CAGACCTCAA CGCCGCACGC GCCGTGCTGG GCCTGGTGCC GTTATCAGAC CTGTTCGACC AACTGGGCGC GGCTCGGGCG ATCCTGCTGG CCACCAGCCG GGCCTTTGAT TTCGCCCCTT CCCCCCTGCC CGCGCCATTC GCCCACGTCG GCCCCTATTT GACCGATCCC GCCTGGGTCG AGGCGTTCGA GCCGCCGAAG GGTGAGGCCC CCTTGGTGGT GGTTTCGTTC TCCAGCCTCT ACCAGGCCCA GGAACCGGTC CTGCGATCGG TCATCACCGC CCTGTCCGAC CTGGAGGTCC GCGCGGTGGT CACGACCGGT CCCACGATTG ATCCCGAACA ATTCCAAGCG CCGCCCCACG TGATGGTGGT CCGCAGCGCT CCGCACGGCG CCCTGCTCGA CGACGCCGCC GTCTTCATCA CCCATGCCGG CCATGGTTCG ACCCTGCGTC CGCTGATGGC CGGAGTTCCC TTGCTCTGCC TACCCATGGG CCGCGACCAG CACGACAACG CGGCGCGCGC CATCCATCGC GGCGCCGGCC TGACCTTGCC CGCGGATTCC AAGCCGGAAC TCATCGGCGC GGCGGTTCGC CAGCTGCTCG ACGAGCCGCA CTTCAAGATT GCCGCCAGCG CCTTAGGCTC GGCGATCCTG GCTGAAGCCA GGCCGGATTC CGCATCCACA ACGCTGGAGG CGCTTCTGTT CGAAGCTCAA AAAAACTGA
|
Protein sequence | MARFLFTTWE GGGHVQPMLL VARELAARGH QVLALSDPCN APDAAALDVP FRAWTTAPFQ TGKTRDDDRL KDHQADSPLE VIQRLLDRVM TGPALAYARD TLAAIDAFAP DVVVSQELLF GPMAAAEARG LPLALLAANV WSLPTMSGAP PFGAGMLPAA NDEERAMHAM VGQMSRGFYQ AGLPDLNAAR AVLGLVPLSD LFDQLGAARA ILLATSRAFD FAPSPLPAPF AHVGPYLTDP AWVEAFEPPK GEAPLVVVSF SSLYQAQEPV LRSVITALSD LEVRAVVTTG PTIDPEQFQA PPHVMVVRSA PHGALLDDAA VFITHAGHGS TLRPLMAGVP LLCLPMGRDQ HDNAARAIHR GAGLTLPADS KPELIGAAVR QLLDEPHFKI AASALGSAIL AEARPDSAST TLEALLFEAQ KN
|
| |