Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2521 |
Symbol | |
ID | 5899976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2735008 |
End bp | 2736126 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641563012 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001684146 |
Protein GI | 167646483 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGCC CGCGCCCTCC CGCCAATAAA GGGGCGCCGT CCAGGGCGCC GTCGCCCCGC AAGGCGCCTC GCAAGCCGAG CCCACGCCGG CGGGAGTCCG CGCCCATCGG TCGCCGCCTG GTCGCCATGG GCGGCGGCCT GCTGACCATG CTGGGCGTGG CCGCCCTGGC GGTCGTGCTG GGGGCGGTGT GGCTCTACCA GGGGCCCGGC CCGGCGGCGC GTTCGGGCGA GGTCACCACC GTCGTCCTGC GTCGCGGCGC CAGCCTGCCC GAGATCGCCT CGACCCTGGA GCAGGCCGGA GTGATCCGCT CGTCCTCGAT CTTCCTGACC GCCGCCCAGA CCACCGGCGC GGCGCGGCGG CTGAAGGCCG GCGAATATGA GTTCCCGTCG CGCGCTTCGC TGCGCCAGGT TCTGGGCAAG ATCCGCGACG GCAAGATCGT GCGCCACCAC GTGACGATCG CCGAGGGCCT GACCTCGGAC ATGGTGGTCG ATATTCTGAT GCGCGCGCCT GAGTTGACCG GCACCGTGCC GACCCCGCCG GAAGGCTCGA TCCTGCCCGA GACCTATCAG GTCCAGCGCG GCGAGGACCG CGCGGCGGTG CTGCAGCGGA TGATGGACGA CCGCGACGCC CTGCTGGACA AGCTGTGGGC GCAGCGCCAG CCGGGCCTGC CGTTCGAGAC CAAGGATCAG GCCGTGACCA TGGCCTCGAT CGTCGAGAAG GAAACCGGCC TGGCCGCCGA GCGTCCGCAT GTGGCGGCGG TGTTCATCAA CCGCCTGCGC CAGGGGATCC GCCTGGGCAG CGACCCGACC ATCATCTACG GCCTGACCCG CGGTCGGCCG CTGGGCCGCG GCATCCTGCA GTCGGAACTG CAGCGCCAGA CGCCCTACAA CACCTATCTG ATCGAGGGTC TTCCACCGAC CCCGATCGCC AATCCCGGCA AGGCCGCTCT GGAAGCCGTG CTCAATCCGA TGAAGAGCAA TGACCTCTAC TTCGTCGCCG ACGGCACGGG CGGCCACGTC TTCGCCTCGA CCTATGCGGA GCACGAGCGC AATGTCGCCA GGTGGCGGCA GGTCGAGCGC TCGAAAGCGG TCGCGAAGAT CCCGTTGGCC GGAGGCTAG
|
Protein sequence | MSRPRPPANK GAPSRAPSPR KAPRKPSPRR RESAPIGRRL VAMGGGLLTM LGVAALAVVL GAVWLYQGPG PAARSGEVTT VVLRRGASLP EIASTLEQAG VIRSSSIFLT AAQTTGAARR LKAGEYEFPS RASLRQVLGK IRDGKIVRHH VTIAEGLTSD MVVDILMRAP ELTGTVPTPP EGSILPETYQ VQRGEDRAAV LQRMMDDRDA LLDKLWAQRQ PGLPFETKDQ AVTMASIVEK ETGLAAERPH VAAVFINRLR QGIRLGSDPT IIYGLTRGRP LGRGILQSEL QRQTPYNTYL IEGLPPTPIA NPGKAALEAV LNPMKSNDLY FVADGTGGHV FASTYAEHER NVARWRQVER SKAVAKIPLA GG
|
| |