Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4033 |
Symbol | |
ID | 5901495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4370273 |
End bp | 4371154 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564554 |
Product | NmrA family protein |
Protein accession | YP_001685656 |
Protein GI | 167647993 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.027293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.618163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC TCGTTACCGG CGCCACCGGC CGTGTCGGCG GCCACGTCGT CCAGCAACTC GTCAATCGCG GCGCCGATGT GCGCGTCCTT GTCCGCGATC CCTCGAAGGC AGACTTCCCG GCCAACGTGG GGGTCGCCCA GGGCGACCTT CTCGACATCG ACGCGCTGCG CGCCGCCTTC ACCGGCGTCA AGACGCTGTT CCTGCTCAAC GCCGTGGCGG GAGACGAATT CACCCAGGCG CTGATCACCC TGAACGTCGC CCGCGAGTCC GGCGTCGAGC GGGTCGTCTA CCTGTCGGTG ATCCATGCCG ATCGCTTCGT GAACGTGCCG CACTTCGCGG TGAAGTTCGG CGCCGAGCGG ATGATCGAAC AGATGGGCTT TTCCGCCACG ATCCTGCGTC CCGCCTACTT CATCGACAAT GACCTGACGA TCAAGGACGT CATCCTCGAC CACGGCGTCT ATCCGATGCC GATCGGCGGC AAGGGTCTGG CCATGGTCGA CGCCCGCGAC ATCGCCGAGG TCGCGGCGAT CGAGTTGGTC CGCCGCGATC AGGCCCCGGG CAAGCTGCCG ATCGAAACCA TCAATCTGGT CGGCCCCGAG ACCCTGACGG GCTCCGACGT GGCCGCGATC TGGTCGGACG TCCTGGGCCG CCCCGTCGCC TATGGCGGCG ATGATCCCGC CGGGTTCGAG CAGAACCTGG CGAGCTTCAT GCCCAAATGG ATGGCCTATG AAATGCGCCT GATGGCCGAG CGTTTTGTCA GCGACGGCAT GATCCCGGAG GTCGGCGACG TCGAGCGCCT GACCAAGATC CTGGGCCGCC CTCTGCACCC GTATCGCGAC TTCGCGACCC AGATCGCCGC GGCCGCCTCG AAACCGGCCT GA
|
Protein sequence | MTILVTGATG RVGGHVVQQL VNRGADVRVL VRDPSKADFP ANVGVAQGDL LDIDALRAAF TGVKTLFLLN AVAGDEFTQA LITLNVARES GVERVVYLSV IHADRFVNVP HFAVKFGAER MIEQMGFSAT ILRPAYFIDN DLTIKDVILD HGVYPMPIGG KGLAMVDARD IAEVAAIELV RRDQAPGKLP IETINLVGPE TLTGSDVAAI WSDVLGRPVA YGGDDPAGFE QNLASFMPKW MAYEMRLMAE RFVSDGMIPE VGDVERLTKI LGRPLHPYRD FATQIAAAAS KPA
|
| |