Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4299 |
Symbol | |
ID | 5901760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4671764 |
End bp | 4672804 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641564817 |
Product | flagellin modification protein FlmD |
Protein accession | YP_001685917 |
Protein GI | 167648254 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase |
TIGRFAM ID | [TIGR03590] pseudaminic acid biosynthesis-associated protein PseG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCT TCCCCCGCAT CCTGTTCCTG GCCGACGCCG GGCCGCGCGT CGGGGGCGGC CACGTCATGC GCTGCCTGAC CCTGGCCCAG GCGCTCGAGG CCGAGGGCGC GACCTGCACC TTCCTGGTCA CCCCGGCGGG CAAGGCGTTG CTCGAGGCCT TCGGCGGCGG GATGCCCTAT GTCGAGACCG ACGACATCAC CCTGGCCGAG CTGATCGAAT GCGCCGCCGA GGTCGCCCGC GACCACGACG CCGTGGTCAT CGACCACTAT GGCCTGTCGG CCCTGGATCA CCTGGCGATC GCCGACGGCC GCCCGACCCT GGTCCTCGAC GACCTGGCCA ACCGGCCGCT GGCCGCCGAT CTGGTGCTGG ACTCCGGTCC GGCCCGCGAG GCCGAGGATT ACGTGGACTT GGTCGGACGG GAGACCGAGC TGCTGCTGGG TCCCGACAAC GCCCCGGTCC GGCCAGACTT CGCCGCCCTG CGCCGCGAGG CCCTGGCGCG TCGGGCCGGC GCGCCGCCGG TGCGGCGGAT CCTGGTCTCG CTGGGCCTGA CCGACCTGGA CGGGATCACC GGCCGCGTCG TGGACCTGAT GCTGCCGATC ACCGGAGATA GGACGCTGGA CGTCGTGCTC GGCTCCGGCG CGCCCAGCCT GCACCGCCTG CGGGGCCTGG CCACGCACGA ACCGCGCCTG AGGCTGCACG TCGACAGCCA GGACATGCCG CAACTCACCC TGGAGGCCGA CCTGGCCGTC GGGGCCGGCG GCTCGACCAG CTGGGAGCGT TGCGTCCTGG CCCTGCCGAC CCTGCTGCTG GTTCTGGCCG CCAACCAGCG CGAGGCCAGC CAAGCCCTGG CAGAAGCCGA CGCCGTCGTG GCCCTGGATG TCGCCGCGCC CGACTTCGAC GCCGCCTTCG CCGCCGAGCT CCGCCGCCTG ATCAATGACC CGCTGCTGCG CGACCGCCTG TCCTCGGCCT CGGCGGCGGT GTGCGACGGC CGCGGCGCGG CGCGGGTCGC GGCGCGGTTC CTGGCGCTGC TGCGGCGTTA G
|
Protein sequence | MTPFPRILFL ADAGPRVGGG HVMRCLTLAQ ALEAEGATCT FLVTPAGKAL LEAFGGGMPY VETDDITLAE LIECAAEVAR DHDAVVIDHY GLSALDHLAI ADGRPTLVLD DLANRPLAAD LVLDSGPARE AEDYVDLVGR ETELLLGPDN APVRPDFAAL RREALARRAG APPVRRILVS LGLTDLDGIT GRVVDLMLPI TGDRTLDVVL GSGAPSLHRL RGLATHEPRL RLHVDSQDMP QLTLEADLAV GAGGSTSWER CVLALPTLLL VLAANQREAS QALAEADAVV ALDVAAPDFD AAFAAELRRL INDPLLRDRL SSASAAVCDG RGAARVAARF LALLRR
|
| |