Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4125 |
Symbol | |
ID | 5901587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4481356 |
End bp | 4482990 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564646 |
Product | amidohydrolase 3 |
Protein accession | YP_001685747 |
Protein GI | 167648084 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.101406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.239825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCGCA TCCTCACTCT CGCCGCCCTG ATGGCCTCGG CCAGCCTGGC TCCGGCGTTC GCCGGCGACA TCCTGATCCA CGGCGGCCCG ATCCACACCG GCGTCGCCGC CGCGCCCACG GCCCAGGCGG TGCTGATTCG CGATGACCGC ATCCTGTTCG TCGGGGATCT CTCCGCCGCC AAGGCCAGGG CCGCCAAGGG CGCCCGCGAC GTCGACCTGA AGGGCGCCGC CGCCTTCCCC GGCTTTGTCG ACGCCCACGC CCACCTGACC GGCATCGGCC TGCGCGAACT GACCCTCAAC CTGGACCGGA TCCAGTCGGT CGAGGCCCTG GTGGCCGCCG TGAAAGCCTA TGCCGACGCC CATCCGGATG GCCCGATCTA CGGCAGGGGC TGGATCGAGA CCCACTGGCC GGAAAAGCGC TTCCCCAACC GCGCCGACCT CGACCGGGCC GCGCCAGGCC GCGTCGTCGT GCTGGAACGG GCCGACGGCC ACGCGGTGGT CGTCTCCACC GCCGCCCTGG CCAAGGCCGG CGTCACCCAG GACACCGCCG CCCCGGCCGG CGGCCAGATC CTCAAGGGCC AGGACGGCGC GCCAGACGGC ATGCTGATCG ACCACGCTCA AAGCCTGGTG GCCGGGGTGA TCCCGCCGCC GTCCGACGCC CTCAAGCGCC AGGCCCTGGA GAAGGCCGGC GCGCTCTACG CCTCGCGCGG CTGGACGGGC CTGGGCAATA TGAGCGTCGA GGGGCCGGAT CTGGCGATCC TCACCAGCCT GGCGGCCGAC AAGACGTTCA GCCTGCGCGT CGATAACTAC ATGGATCCCA GCGGCGCGGC CGAGGTGCTG GCCAAGGGGC CATCGACCGA CGCCACGGGC CTGATCCGGG TGCGGGGGAT CAAGCTCTAC ATGGACGGCG CCCTGGGCTC GCGCGGCGCG GCGCTGCTCG AACCCTACAG CGACGCCGAG GGGCTGGGCC TGCAACTGAC CCCGCGCGAC AAGGGGCTGG CGCTGATGAA GGCCGCCAAG GCCGCCGGCG CCCAGGTGGC CATGCACGCC ATCGGCGACC GCGGCAATCG CATGACCCTG GACTGGTTCG AGGAGAGCCT GGCCGGGGAC ACCAAGGCCC GCTGGCGGAT CGAGCACGCG CAGATCGTCG CCGACACCGA CGTGCCGCGC TTCGCCAAGC TGGGGGTGAT CGCCTCGATG CAGCCCAGCC ACGCGATCGG CGACCTCTAT TTCGCCCCGG CCCGCCTGGG CAAGGATCGG CTGCACGAGG GCTATCGCTG GAAGGATTTC CTGGCCAGCG GCGCGGTGAT CGCCGCCGGC TCGGACGCCC CGGTCGAGGT CGGCGACCCG CGCATCGAGT TCTACGCCGC CGTCTATCGC CACAGCCTGG ACGGCTTCGC GGGCGCCGAC TGGCATCTGG ACGAGGCCGT CACCCGCGAT CAGGCCCTGC GCACGCTGAC CTGGGCCCCG GCCTACGCCG CCTTCGCCGA GCAGGATCGC GGCACGCTCG AGGCCGGCAA GAAGGCGGAC GTGACGGTGT TTTCGAAGGA CCTGATGACG GTGGCCCCGG CGGAGATCCT CAAGGCGCAG GCGGTGCTGA CGATGGTCGA CGGCAAGGTG GTGTTCGAGA AGTAG
|
Protein sequence | MRRILTLAAL MASASLAPAF AGDILIHGGP IHTGVAAAPT AQAVLIRDDR ILFVGDLSAA KARAAKGARD VDLKGAAAFP GFVDAHAHLT GIGLRELTLN LDRIQSVEAL VAAVKAYADA HPDGPIYGRG WIETHWPEKR FPNRADLDRA APGRVVVLER ADGHAVVVST AALAKAGVTQ DTAAPAGGQI LKGQDGAPDG MLIDHAQSLV AGVIPPPSDA LKRQALEKAG ALYASRGWTG LGNMSVEGPD LAILTSLAAD KTFSLRVDNY MDPSGAAEVL AKGPSTDATG LIRVRGIKLY MDGALGSRGA ALLEPYSDAE GLGLQLTPRD KGLALMKAAK AAGAQVAMHA IGDRGNRMTL DWFEESLAGD TKARWRIEHA QIVADTDVPR FAKLGVIASM QPSHAIGDLY FAPARLGKDR LHEGYRWKDF LASGAVIAAG SDAPVEVGDP RIEFYAAVYR HSLDGFAGAD WHLDEAVTRD QALRTLTWAP AYAAFAEQDR GTLEAGKKAD VTVFSKDLMT VAPAEILKAQ AVLTMVDGKV VFEK
|
| |