Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3705 |
Symbol | |
ID | 5901161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4000253 |
End bp | 4001554 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564216 |
Product | allantoate amidohydrolase |
Protein accession | YP_001685330 |
Protein GI | 167647667 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGAGG CCTTCGAGAT CGGCGACTTC AAGCCAGGGA TCCGGGCCAA GGCCCGCTGC GACCTGATGG GCCTGGCGCC CTACAGCGAG GCCGACGGCA TGCTGGTGCG CCGCTTCCTG ACCCCCGCCC ATGATGAAGC CCTCAAGACC CTAGCCTTCT GGATGGAGCA GGCCGGCATG AGCGCCCGCC GCGACCACGC TGGCAACCTG GTCGGTCGCT ATGAGGGCGA GACGCCCAAC GCCCCGGCCC TGCTGATCGG CTCGCACATC GACAGCGTCC GCAACGGCGG CCGCTACGAC GGCGCACTGG GCGTGATGCT GGGGATCGAC CTGGTCGAGG CCTTGAGCAT CGCCGGCCGC CGCCTGCCCT TCGCCGTCGA GGTGATCGCG TTCGGCGACG AGGAGGGCTC GCGCTTCCCC GCCTCCATGA CCTGCAGCCG CGCCGTGGCC GGGACCGTCG ATCCCATGGT CATGGAGATG ACCGACGGCG ACGGCGTTTC GCTGGCCGAA GCATTCGCCG GGTTTGGCCT GGATCCGACG CGCCTGGAGG AGGCCGCCCG CAAGCCGGGT GAGATCTTCG CTTTCCTCGA GGCCCATATC GAACAGGGAC CCGTCCTCGA GGCCGAGGGC ATGGCCCTGG GCGTGGTCAC CGCCATCGCC GCCCAGAAGC GGCTGATGGT GCGGTTCACG GGAATGGCCG GCCATGCGGG CACCACGCCA ATGAACCTGC GCAAGGATCC CGGCCCCGCC GCCGCCGAGG CGATCCTGGC GCTCGAGCGG ATCTGCGCGC CCCAAGGCGA TTTTGGCGGC AAGGACGGGC TCGTGGGCAC CGTCGGCCGG ATCACCGCCC TGCCCGGCGC CTTCAACGTC ATTCCCGGCG CGGTCGAATA TTCGATGGAT GTCCGGGCCG AGGTCGCGGC CACGCGTGAT GCTGCCATCG ACGCCGTCAC CACCGAGATC CAGGCCATCG CCGCGCGGCG CGGCCTGGAG GTCTCGGTCA CCCTGATGCA GGACTTGGCC GCCAGTCCCT GCGACCCCGG CCTGACCGCC CTGTTGGAAG CCGCCGTCGC CGCGACGGGC CAGGCGCCGC GCCGCCTGCC CAGCGGGGCC GGCCACGACG CCATGGTCAT CGCCGACCTC TGCCCCACCG CCATGCTGTT CATCCGCTGC GAGGGCGGGA TCAGCCACAA CCCGCGCGAG GCCGTGACCG AGGCCGACTG CGCGGTCGCG GCCGAGGCGA TGTTGGGGTT CGTGGAGCGG CTGGCGACGC GAAATCCTCG TCCTTCGACA AGCTCAGGAT GA
|
Protein sequence | MAEAFEIGDF KPGIRAKARC DLMGLAPYSE ADGMLVRRFL TPAHDEALKT LAFWMEQAGM SARRDHAGNL VGRYEGETPN APALLIGSHI DSVRNGGRYD GALGVMLGID LVEALSIAGR RLPFAVEVIA FGDEEGSRFP ASMTCSRAVA GTVDPMVMEM TDGDGVSLAE AFAGFGLDPT RLEEAARKPG EIFAFLEAHI EQGPVLEAEG MALGVVTAIA AQKRLMVRFT GMAGHAGTTP MNLRKDPGPA AAEAILALER ICAPQGDFGG KDGLVGTVGR ITALPGAFNV IPGAVEYSMD VRAEVAATRD AAIDAVTTEI QAIAARRGLE VSVTLMQDLA ASPCDPGLTA LLEAAVAATG QAPRRLPSGA GHDAMVIADL CPTAMLFIRC EGGISHNPRE AVTEADCAVA AEAMLGFVER LATRNPRPST SSG
|
| |