Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4294 |
Symbol | |
ID | 5901755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4668262 |
End bp | 4669632 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564812 |
Product | hypothetical protein |
Protein accession | YP_001685912 |
Protein GI | 167648249 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3660] Predicted nucleoside-diphosphate-sugar epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.954414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGA TCACCGACGT GGTTGAAAAG AAGACATCCG AAGCGGCGAA GCCCAAGGTG AGGGCCGCTC GCAAGGCTCC GGCCAAGGCT TCGGCCAGGA CGGCGCCGAA GGTCGCCGTC GCCTCGAAGG ACGCCAAGCC GGTTGAAGCC GTGACCGCGC CTTCCGCCGA AGCCAAGATC GCGCCCAAGC CGCGCACGGC CGCCAGACCG AAGTCGCGCC CAGGCGCCGT CGAGACGCCG CCCGCGCCGA CCGGCGATGC GCGCCCCTCG GCGACAGTCC CCGAGACGAA GCCCAAGCCC GCGCCGCGCC GCGAGGTCGC CCAGAAGTCC GAGCCCGCCG CCGCCCCCGT TCCCGAGACT CCCGCCCGCG AACCGATCGT CATCTGGGCG ATCTCGGACG GCCGGGCCGG CATCGAGGCC CAGGCCGTCG GCCTGGCCGA GGCCGTGGCC CGTCAGGTCC CCGCCCAGAT CGTGATCAAG CGCGTCGGCT GGAGCGGCCG GACCGGTCGC CTGCCCTGGT GGGCCAACTG GCTGCCGCGC CGCTGGCTGA CCCCCGACAG CGGCATAGAG CCGCCCTGGC CCGACCTGTG GATCGCCGCC GGCCGCGCCA CCCTGCCCCT GTCGATCCGC GCCAAGCGCT GGTCGTCCGG CAAGACCTAT GTCGTGCAGA TCCAGGACCC GCGCGTCCCG GCCACCATGT TCGACCTGGT GATCCCGCCC AAGCACGACC GACTGTCCGG CGACAACATC CTGCCGATCA CCGGCTCGCC GCATCGCGTG ACCAGCCAGC GGCTCGAGAC GGAATACGAG AAATTCAAGG ACCAGATCGA CGCCCTGCCC CGCCCGCGCG TGGCGGTGCT GTTGGGCGGC AAGTCCAGGG CCTTCGACCT GTCGGCCCTG CGCGCCGCCG AGATGGCCCA CCAGATCCAG CTGCCGCTGG AGCAGGAGGG CGGATCGCTG CTGATGACCT TCTCGCGGCG CACGCCCGAC CAGGCCAAGG CCCTGCTGAC CGCCCGCCTG CGCCACCTGC CCGGGATCAT CTGGGACGGC GAAGGCCCAA ACCCCTATTT CGCCTTCCTG GCCGCCGCCG ACTACATCCT GGTCACCGAG GACTCGACCA ACATGGCCAC CGAGGCCGCC TCGACGGGCA AGCCGGTGTT CATCCTCAAG ATGGACGGCC AGAGCCTGAA GTTCCGGCTG TTCCACCAGG AGCTGGAGAG CATGGGCGCC GCCCGCCCCT ACGGCGGGGC CTTCCACGGC TGGACCTACG AGCCGGTCGA CGAGACCGGC CGCGCGGCGG CCGAGGTGGT GGCGCGGATG GACGGGCGGG GGGCGCGCTC TGGCGTCCCG CCACAACTTC CCCTAGCTTG A
|
Protein sequence | MAGITDVVEK KTSEAAKPKV RAARKAPAKA SARTAPKVAV ASKDAKPVEA VTAPSAEAKI APKPRTAARP KSRPGAVETP PAPTGDARPS ATVPETKPKP APRREVAQKS EPAAAPVPET PAREPIVIWA ISDGRAGIEA QAVGLAEAVA RQVPAQIVIK RVGWSGRTGR LPWWANWLPR RWLTPDSGIE PPWPDLWIAA GRATLPLSIR AKRWSSGKTY VVQIQDPRVP ATMFDLVIPP KHDRLSGDNI LPITGSPHRV TSQRLETEYE KFKDQIDALP RPRVAVLLGG KSRAFDLSAL RAAEMAHQIQ LPLEQEGGSL LMTFSRRTPD QAKALLTARL RHLPGIIWDG EGPNPYFAFL AAADYILVTE DSTNMATEAA STGKPVFILK MDGQSLKFRL FHQELESMGA ARPYGGAFHG WTYEPVDETG RAAAEVVARM DGRGARSGVP PQLPLA
|
| |