Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2081 |
Symbol | |
ID | 5899536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2228539 |
End bp | 2230992 |
Gene Length | 2454 bp |
Protein Length | 817 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562570 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001683707 |
Protein GI | 167646044 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.994259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCTAG GTTTGAAGCT CAAGAAGTCC GCCGGCGTAC CGGTGACCGC TGCGGTCCAC CGCCTGATTG CCGCCGCCAA CGCCGCGCGC GCCCGGTCGG ACTGGCGCGC CGCAGCCAAG CATTACGACG CCGCCCTCAG GCGCGACCCG CACCTTGGGC ACGTCTGGAT CCAACTGGGG CATGCCCTGA AGGAAAGCGG TGAGTTGTCG GCGGCCGACC GAGCCTATCA TCGTGCTGAA AGCCTGCGGC CGGACGACGC CGACGCACAC CTGCACCTGG GCCATGTGGC CAAGCTACGC GGCGATGTCG CGGGCGCGAT TCGCAGCTAT CTGACCGCTG CGCGGCTGGC GCCCAAGGCC CCCCACGCCA TCGGCGAACT GCACACCCTC ATCGCCAACG GAGCCAATGT GCCAATCGAA GCCATCCAGG GCCTGATCGA CCTCGAGGAT GACCCGATCA CGCGCTCGCC GCCAATGAGC AGCGCCATCG CGGCAGCCCA GACCGCGATG ACTGCCCTGG TGACAGCCCT CAAGCAGCAG GGCGGTCAGC CCGGCGCGCT GGAACGCGCG ACCTCGGCCG CCCACCTGAT CGCCGATCTC GCCAGTGATC CGGTCTCGTC CGGCTCCCAA GACTCGGGGC CAGCCCTGAT CTTCGACGTC TCGGACCTGC TGACCTATTT CCGTAACGTA CGTTCGCCAA CGGGCATTCA GCGCGTCCAG ATCGAGATCA TCCTCAGCAG CCTGCAGTCC GGCAATACGG CGGTTCGCAT CTGCTGTTTC CTGGAACAAC GCGACGAATG GGTCGAGATC CCCGCCCCGC TTTTCCTGCG GCTGAGTTGG CTCAGCCTGG GCGACGCCGA GGACGATGGC GGCGAATGGA CCGCCGCCCT GACCCGCACG CTGTTGCTGC TTAACATGGC GCCGCCGCTT GATTTTCCGA GGGGGGCCTT CCTGATCAAT CTGGGCACCT CCTGGTGGCT CCAGAACTAT TTCCTGTTCG TGCGGCGGAT CAAACGCGAG CGCGGTGTCC GTTACATCCC GTTTGTGCAC GACATGATTC CGGTCCTGCA CGGGGAGTTC TGTCCCAAGG TGCTGACCCA GGACTTCATC TCCTGGGCGA TCGGCGTCTT CGAGCATGCC GATTTCTTCT TCGTGAACTC GCAGTCCACC CGACGCGACC TGATCAAGGT GGGCGCGTTC CTGGGGCGCG AAATCGATCC GCTAGCCATC TCGGTCGTGA CGCTGGACGC CGACACACGC AAGCCCGACG CGCCTGCGCC GCGCGGGAAG ATATTGCGGC GTTGGGGGCT CAACGCCATC CCCTACGTGC TGTTCGTCTC CACCATCGAG CCACGGAAAA ACCACCTGCG TGTCTTCGAG GCGTGGATCG CGCTCCTCAA GCGCCATGGC TCGCGCAAGA CGCCCAAACT GGTCTGTGTG GGCCATCCGG GTTGGCTCAA CGACAGCATA CACGATCAGT TGAACGCCCA TGAGGACCTG CGCGCTCACG TCCAGGTGTT GCGTTTCGTG TCGGACGCCG ACCTGGCCGA ACTCTACAGC GGCTGCCTGT TCACCCTCTA TCCCAGCCAT TACGAGGGTT GGGGCCTGCC GGTCACCGAA TCCCTCTGCT ACGGCAAGGC GCCCTTGGTC GCCAACACCT CGTCCCTGCC CGAGGCCGGA GGCCGCTTCG CGGTCTATTT CAACCCGGAT TCCACGGTCG AACTGATCGC CGCGTTGGAG ACCCTGGCCT TCGACCACGA GGCGCGGCGC GCGCGCGAAC GGCTGATTAC GGCCGAGTTC AAGCCCCGCG GCTGGGCCGT GCTGGCGCGG CAGATGGCTG ACGACCTCGT CGCCTGGGAA GGCGTCGGTC GCCCCGTCCT CGGCGCCGAG GCGCCGGCCG CGCTTGTGGG CGCCTACCAC TCGCTCGGCC GCAATCTGAA AACCAGGGTC TGGCCGGGCA TGCGGTCGGG CGAGGTATAT CGCAGCGGTC CCAATTGGTG GGGTCCCGAC AACTGGGGGT GCTGGACCAA GCCGGGCGGT TCGACCCTCC GCATGACCGT GCCGCAGCCG GGACCCATAA TCGCCTATCT GCACCTGCAG GGGCTACCCG CTCAACGCTG CGGTTTTGTC GTCAAGACGA CGGGCGACGC GATCGTGCGA CACGGCGAGA TCGACCGCGG CCAGCACAAA TGGCTGGCGA TCGAGATCGC CCCCGACGAG TCCGAACCGC GCACCGTGAC GCTGGAAATC GAGGGAACGG CTTGCGAGAG CCTGGCGAAC GTCACCGACA ATTCGGACGC CCGCGTGGTC TCGCTCGGTG TGGCGGGCTT CTTCCTTTGC CGGGCCGATG ATCCGGCCGC CCGGGCCGCT TTCCTTGAAG CGGTAGCGAT CGGGAACATT CACGATCTCG ACTTCAGTCG GGAGCCCCTC GAATACACCC CCTTGATCTC GTGA
|
Protein sequence | MVLGLKLKKS AGVPVTAAVH RLIAAANAAR ARSDWRAAAK HYDAALRRDP HLGHVWIQLG HALKESGELS AADRAYHRAE SLRPDDADAH LHLGHVAKLR GDVAGAIRSY LTAARLAPKA PHAIGELHTL IANGANVPIE AIQGLIDLED DPITRSPPMS SAIAAAQTAM TALVTALKQQ GGQPGALERA TSAAHLIADL ASDPVSSGSQ DSGPALIFDV SDLLTYFRNV RSPTGIQRVQ IEIILSSLQS GNTAVRICCF LEQRDEWVEI PAPLFLRLSW LSLGDAEDDG GEWTAALTRT LLLLNMAPPL DFPRGAFLIN LGTSWWLQNY FLFVRRIKRE RGVRYIPFVH DMIPVLHGEF CPKVLTQDFI SWAIGVFEHA DFFFVNSQST RRDLIKVGAF LGREIDPLAI SVVTLDADTR KPDAPAPRGK ILRRWGLNAI PYVLFVSTIE PRKNHLRVFE AWIALLKRHG SRKTPKLVCV GHPGWLNDSI HDQLNAHEDL RAHVQVLRFV SDADLAELYS GCLFTLYPSH YEGWGLPVTE SLCYGKAPLV ANTSSLPEAG GRFAVYFNPD STVELIAALE TLAFDHEARR ARERLITAEF KPRGWAVLAR QMADDLVAWE GVGRPVLGAE APAALVGAYH SLGRNLKTRV WPGMRSGEVY RSGPNWWGPD NWGCWTKPGG STLRMTVPQP GPIIAYLHLQ GLPAQRCGFV VKTTGDAIVR HGEIDRGQHK WLAIEIAPDE SEPRTVTLEI EGTACESLAN VTDNSDARVV SLGVAGFFLC RADDPAARAA FLEAVAIGNI HDLDFSREPL EYTPLIS
|
| |