Gene Caul_4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4125 
Symbol 
ID5901587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4481356 
End bp4482990 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content72% 
IMG OID641564646 
Productamidohydrolase 3 
Protein accessionYP_001685747 
Protein GI167648084 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.101406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.239825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCGCA TCCTCACTCT CGCCGCCCTG ATGGCCTCGG CCAGCCTGGC TCCGGCGTTC 
GCCGGCGACA TCCTGATCCA CGGCGGCCCG ATCCACACCG GCGTCGCCGC CGCGCCCACG
GCCCAGGCGG TGCTGATTCG CGATGACCGC ATCCTGTTCG TCGGGGATCT CTCCGCCGCC
AAGGCCAGGG CCGCCAAGGG CGCCCGCGAC GTCGACCTGA AGGGCGCCGC CGCCTTCCCC
GGCTTTGTCG ACGCCCACGC CCACCTGACC GGCATCGGCC TGCGCGAACT GACCCTCAAC
CTGGACCGGA TCCAGTCGGT CGAGGCCCTG GTGGCCGCCG TGAAAGCCTA TGCCGACGCC
CATCCGGATG GCCCGATCTA CGGCAGGGGC TGGATCGAGA CCCACTGGCC GGAAAAGCGC
TTCCCCAACC GCGCCGACCT CGACCGGGCC GCGCCAGGCC GCGTCGTCGT GCTGGAACGG
GCCGACGGCC ACGCGGTGGT CGTCTCCACC GCCGCCCTGG CCAAGGCCGG CGTCACCCAG
GACACCGCCG CCCCGGCCGG CGGCCAGATC CTCAAGGGCC AGGACGGCGC GCCAGACGGC
ATGCTGATCG ACCACGCTCA AAGCCTGGTG GCCGGGGTGA TCCCGCCGCC GTCCGACGCC
CTCAAGCGCC AGGCCCTGGA GAAGGCCGGC GCGCTCTACG CCTCGCGCGG CTGGACGGGC
CTGGGCAATA TGAGCGTCGA GGGGCCGGAT CTGGCGATCC TCACCAGCCT GGCGGCCGAC
AAGACGTTCA GCCTGCGCGT CGATAACTAC ATGGATCCCA GCGGCGCGGC CGAGGTGCTG
GCCAAGGGGC CATCGACCGA CGCCACGGGC CTGATCCGGG TGCGGGGGAT CAAGCTCTAC
ATGGACGGCG CCCTGGGCTC GCGCGGCGCG GCGCTGCTCG AACCCTACAG CGACGCCGAG
GGGCTGGGCC TGCAACTGAC CCCGCGCGAC AAGGGGCTGG CGCTGATGAA GGCCGCCAAG
GCCGCCGGCG CCCAGGTGGC CATGCACGCC ATCGGCGACC GCGGCAATCG CATGACCCTG
GACTGGTTCG AGGAGAGCCT GGCCGGGGAC ACCAAGGCCC GCTGGCGGAT CGAGCACGCG
CAGATCGTCG CCGACACCGA CGTGCCGCGC TTCGCCAAGC TGGGGGTGAT CGCCTCGATG
CAGCCCAGCC ACGCGATCGG CGACCTCTAT TTCGCCCCGG CCCGCCTGGG CAAGGATCGG
CTGCACGAGG GCTATCGCTG GAAGGATTTC CTGGCCAGCG GCGCGGTGAT CGCCGCCGGC
TCGGACGCCC CGGTCGAGGT CGGCGACCCG CGCATCGAGT TCTACGCCGC CGTCTATCGC
CACAGCCTGG ACGGCTTCGC GGGCGCCGAC TGGCATCTGG ACGAGGCCGT CACCCGCGAT
CAGGCCCTGC GCACGCTGAC CTGGGCCCCG GCCTACGCCG CCTTCGCCGA GCAGGATCGC
GGCACGCTCG AGGCCGGCAA GAAGGCGGAC GTGACGGTGT TTTCGAAGGA CCTGATGACG
GTGGCCCCGG CGGAGATCCT CAAGGCGCAG GCGGTGCTGA CGATGGTCGA CGGCAAGGTG
GTGTTCGAGA AGTAG
 
Protein sequence
MRRILTLAAL MASASLAPAF AGDILIHGGP IHTGVAAAPT AQAVLIRDDR ILFVGDLSAA 
KARAAKGARD VDLKGAAAFP GFVDAHAHLT GIGLRELTLN LDRIQSVEAL VAAVKAYADA
HPDGPIYGRG WIETHWPEKR FPNRADLDRA APGRVVVLER ADGHAVVVST AALAKAGVTQ
DTAAPAGGQI LKGQDGAPDG MLIDHAQSLV AGVIPPPSDA LKRQALEKAG ALYASRGWTG
LGNMSVEGPD LAILTSLAAD KTFSLRVDNY MDPSGAAEVL AKGPSTDATG LIRVRGIKLY
MDGALGSRGA ALLEPYSDAE GLGLQLTPRD KGLALMKAAK AAGAQVAMHA IGDRGNRMTL
DWFEESLAGD TKARWRIEHA QIVADTDVPR FAKLGVIASM QPSHAIGDLY FAPARLGKDR
LHEGYRWKDF LASGAVIAAG SDAPVEVGDP RIEFYAAVYR HSLDGFAGAD WHLDEAVTRD
QALRTLTWAP AYAAFAEQDR GTLEAGKKAD VTVFSKDLMT VAPAEILKAQ AVLTMVDGKV
VFEK