Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4805 |
Symbol | |
ID | 5902267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5198326 |
End bp | 5199345 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641565325 |
Product | glycosyl transferase family protein |
Protein accession | YP_001686423 |
Protein GI | 167648760 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACTC AAAAATCGAC CTTCGACGAC GTGCTGATCG TCATTCCCTG TCTCAACGAG GCCCGGCACC TGCCCGGACT GTTGACGGTG CTGGGGCGGG AGGCGCCGGC GGCGCTGATC GTCGTGGCGG ATGGCGGCAG CACCGACGGC AGCCTCGACA TCGTTCGGGA CTTCGCCGCG CGCGGCGCCC GAGTCCAGCT AATGGAGAAC CCGCGCCGCA TCCAGAGCGC CGGCGTCAAT CTGGCCGCTC GCCGGTTCGG GGCCGGTCGC AGCTGGATGA TCCGCGTGGA CGCCCACTGC GGTTATGGTC CCGGCTTCCT GACCGGGCTG CTGGCGGCCG CGGACCGGAC AGGAGCCACC TCGGTGGTCG TGCCGATGGC GACGGAGGGC GAGACCTGCT TCCAGAAGGC CTGCGCCGCG GCCCAGAATT CGGTGCTGGG CACCGGCGGT TCGGCGCACC GGCGCCTGGG CGACGGGCAG TTCGTCGACC ATGGCCATCA CGCCCTGTTC CGGCTGGAGG CGTTCCTGGC GGCGGGCGGC TACGACGAGA CCTTCAGCCA CAACGAAGAC GCCGAACTGG ACGCCCGACT GGTCCAGGCG GGCGCCCGGA TCTGGCTCGA GCCGGCCGCG GCGATCGTCT ACTACCCGCG CCGGACGCCG GGGGCGCTGT TTCGGCAGTA CATCAAATAC GGCGAGGGGC GGGCGAAGAC CATCCAGCGT CACCGGCCAA AACTGAAGGT CCGGCAGATG TTGCCCCTGG TCGTGGCCCC GGCGGTGCTG GTCGCCCTGG CCGGGTTCGC CTGGCCGCCG CTGGCCCTGC CGGCCCTGAT GTGGGCCGCG CTCTGCCTGG GATTCGGCGT CCTGCTCGGC GTGCGCCAAC GCAGCCCTTG CGCGGCGCTG GCCGGCGTGG CGGCGATGAT CATGCACTTC GCGTGGTCGG CCGGTTTCCT GCGCCAGATG CTGCTGGGCC GCCGCCCCGG CGCGACGCCC GTCGCGCTGA GCACGGAGCC CGCCGGATGA
|
Protein sequence | MTTQKSTFDD VLIVIPCLNE ARHLPGLLTV LGREAPAALI VVADGGSTDG SLDIVRDFAA RGARVQLMEN PRRIQSAGVN LAARRFGAGR SWMIRVDAHC GYGPGFLTGL LAAADRTGAT SVVVPMATEG ETCFQKACAA AQNSVLGTGG SAHRRLGDGQ FVDHGHHALF RLEAFLAAGG YDETFSHNED AELDARLVQA GARIWLEPAA AIVYYPRRTP GALFRQYIKY GEGRAKTIQR HRPKLKVRQM LPLVVAPAVL VALAGFAWPP LALPALMWAA LCLGFGVLLG VRQRSPCAAL AGVAAMIMHF AWSAGFLRQM LLGRRPGATP VALSTEPAG
|
| |