Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4541 |
Symbol | |
ID | 5902002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4914822 |
End bp | 4916072 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641565060 |
Product | 3-deoxy-D-manno-octulosonic-acid transferase domain-containing protein |
Protein accession | YP_001686159 |
Protein GI | 167648496 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1519] 3-deoxy-D-manno-octulosonic-acid transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.200224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCTCTCT ACCGCGCGGC CACCGGCGCG CTGGAGCCGT TCGCGCCCTT CCTCCTGGAA CGCCGCGCCA AGGCCGGCAA GGAGGACCGC GCGCGGCTGA ACGAGCGCCT GGCCCGGCCG ACCACGCCGC GACCGGACGG TCCGCTGGTC TGGCTGCACG GGGCCAGCGT CGGCGAGAGC CTGTCGATCC TGCCGCTGGT CGACCGCCTG CGCGCCGAGC GGCCGGACGT CCAGGTACTG GTCACGTCCG GCACGGTGAC CTCGGCCGAG CTCTTGGCAC GGCGCCTGCC GGCCGGGGCG ATCCACCAAT ATCTGCCGGT CGACACCCCC CGCGGCGCCC GGCGGTTCCT CGACCACTGG CGGCCCAGCC TGGCGGTCTT CGTCGAGAGC GAGCTGTGGC CCAACCTGCT GCTGACCGCC AAGGCGCGCG GCGTGAAGCT GGCCCTGGTC TCGGCCAAGC TGTCGGACAG GAGCTACGCC CGCTGGCGAG CCCGGCCGTT CGCGGCCCAT GAACTGTTCA GCGGCTTCGA CCTGATCCTG GCCCAGGACG CCCGCGCCGC CGAGCGTCTG GCCAGCCTGG GCGGCGCGGT GGGCGGCGAG GCCGACCTGA AGTTCGGCGC CGCGCCCCTG CCCGTCGATG AGGCGGCGCT GACCAGCCTG CGCGTGCGGC TCAGCGACCG GCCCGTCCTG CTGGCCGCCA GCACCCATCC GGGCGAGGAC GAGATCGTGC TGCGGGCCTG GGGCGCCCTG GCGAGCCGCC CGCGCCTGGT GGTCGTCCCG CGCCACCCCG AACGCGGCCC GGCCATCGCC GACCTGGCGC TGGCGACCGG CACCACCGTC TGCCTGCGCA GCCTGGAGCC GGACGACTCC GCCGACATCA TCGTCGCCGA CACCCTGGGA GAGCTGGGCC TGTGGTACCG CCTGGCCGAC CTGGCCCTGG TGGCCGGCAG CCTGGTGGCC GGGATCGGCG GCCACAATCC GCTGGAACCG GCCCGCCTGG CCTGCCCGAT CGTCTCGGGG CCGCATATCG AGAACTGGCT GACCGCCTAT GCCGACCTGC GGGCCGAGGA CGCCGTGGCC TTCGCCGACG CCTCGGTGCT GGGCGCGCGC CTGGCCGACC TGCTGGCCGG GCCGGAGATC ATGCGGCTGC AGGCGGCTCG CGCCCAGGCC TTCGTCGCCC GCCGCGACGC CGAGGCCCGC GCCGGACTCG ACCGGATCCT GGAGCTTCTC GACGCGGAAG GCGGGGCATG A
|
Protein sequence | MALYRAATGA LEPFAPFLLE RRAKAGKEDR ARLNERLARP TTPRPDGPLV WLHGASVGES LSILPLVDRL RAERPDVQVL VTSGTVTSAE LLARRLPAGA IHQYLPVDTP RGARRFLDHW RPSLAVFVES ELWPNLLLTA KARGVKLALV SAKLSDRSYA RWRARPFAAH ELFSGFDLIL AQDARAAERL ASLGGAVGGE ADLKFGAAPL PVDEAALTSL RVRLSDRPVL LAASTHPGED EIVLRAWGAL ASRPRLVVVP RHPERGPAIA DLALATGTTV CLRSLEPDDS ADIIVADTLG ELGLWYRLAD LALVAGSLVA GIGGHNPLEP ARLACPIVSG PHIENWLTAY ADLRAEDAVA FADASVLGAR LADLLAGPEI MRLQAARAQA FVARRDAEAR AGLDRILELL DAEGGA
|
| |