Gene Caul_4450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4450 
Symbol 
ID5901911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4817600 
End bp4819474 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content72% 
IMG OID641564969 
Productallophanate hydrolase 
Protein accessionYP_001686068 
Protein GI167648405 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACT TCCAGCGCCT GTCGGTCAGC GCCATCGCCG ACGCCGTGAA CGGCGGGGCC 
AGCGCGGTCG AGGTCGCGCG GGCCGCGCTC GACGTCGTCG CCGCCTATGA TGAGATCCAG
CCCCAGGCCT GGATCCTGCG TCTGCCCGCC GAGGCCGTCC TGGCCCAGGC CCGCGTCGTC
GAGGCCCGGA TCGCGGCTGG CGAGGCCCTG CCCCTGGCCG GCGTCCCGTT CGCGGTGAAG
GACAATATCG ACGTCGCCGG CTGGCCGACC AGCGCCGCCT GCCCGGCCTT CGCCTATGTG
CCCGAGCGCT CGGCGACGGT CGTCGAGCGG CTGGTCGCCG CGGGAGCGGT GCTGGTCGGC
AAGACCAATC TGGACCAGTT CGCCACCGGC CTGGTCGGCG TCCGCAGCCC CTACGGCGCG
CCGCTTTGCG TGTTCGACCA GGCCTATGTC AGCGGCGGGT CCAGCTCCGG CTCGGCGGTG
GCCGTCGCGG CGGGGCTGGT GGCGTTCTCG CTGGGCACCG ACACCGCCGG CTCGGGCCGC
GTGCCGGCCG CGTTCAACCA CCTGATCGGA TTGAAGCCCT CCAAGGGCCG CTGGAGCACG
CGCGGGCTGG TCCCCGCCTG CCGCTCGCTG GACTGCATCA GCGTGTTCGC CGCCGACCTG
GAGGGGGCGG CGCTGGTCGA CGAGGTGCTG ACGGGGTTTG ATCCCGAGGA CGACTACTCG
CGGCGTGCGC CGGAAGACCC CCTCCGACCC TCTGGGCCAC CTCCCCTAGA GGCGGAGGAT
CTGCACGGTG CGCCGGCCGA TGGATGCTCC CCCTCTGGGG GACCTGTCGC GGAGCGACTG
AGGGAGTCCT TCCGCTTCGG CGTTCCCAAA CCCGACCAAC GCCTCTTCCT CGGCGACCAC
CAATCGGCCG CCCTCTACGC CGCCGCCATC GCGCGTCTGA CGACCGCCGG CGGAACACCC
GTCGAGATCG ACATCGCCCC GCTGTTCGAT TGCGCCAAGC TGCTCTACAG CGGGCCATGG
GTCGCCGAGC GAACGGCCGC CGTGGAGACC CTGCTGCGCG ACACGCCCGG CGCGATCCAC
CCCACCGTGC GCGCCATCGT CCAGGGCGGC CTGGCGGTCA CCGGCGTCGA GACCTTCAAG
GGCTTCCATG CGCTGGAGGC CCATCGCCGC GCGGCCGAGG CGATCTGGGA CGCCGTGGAC
GTGATGCTGC TGCCCACCGC CCCGACCATC TACCGCCTGA AGGCGGTGCA GGCCGAGCCG
ATCGCGCTGA ACGCCAATCT GGGCCTCTAC ACCAACTTCG TGAACCTGCT CGACATGAGC
GCCCTCGCCG TCCCCGCCGG CTTTCGTGAG AACGGGACGG GCTTTGGCGT CACCCTGATC
GGGCCGGCCT TCGCCGACCG CGCGCTGCTG GCCCTGGCCG AGCGCTACCT GGAGACCTTC
CCCATGGCCG ACATGCCCCC GCTCGACCTG ACGCCCAAGA AGCCCGGCGT GAAGCTGGCC
GTGGTCGGCG CCCACCTGGC CGGCATGCCG CTGCACTGGC AGCTGACCTC GCGCGAGGCG
CGCCTGGTCA GCGCGACCAG GACCGCCCCG ACCTACAAGC TCTACGCCAT GGCCGAAACG
ACGCCGCCCA AACCCGCCCT GATCCACGTC GGCGAGGGCG GCGCGGCGAT CCTGGTCGAG
GTCTACGAGC TGGACTTCGA GGCCTTCGGA TCCTTCGTCG CCGAGGTCCC CGCCCCGTTG
GCCATCGGCA CGGTGACCCT GGAGGATGGG ACCTTGGTCA AGGGCTTCGT CGCCGAACCC
CGCGCCTTGA ACGGGGCCAC TGACATCACC GAACTGGGCG GCTGGCGGGC CTATATCGCC
TCGTTGGCGG CCTGA
 
Protein sequence
MTDFQRLSVS AIADAVNGGA SAVEVARAAL DVVAAYDEIQ PQAWILRLPA EAVLAQARVV 
EARIAAGEAL PLAGVPFAVK DNIDVAGWPT SAACPAFAYV PERSATVVER LVAAGAVLVG
KTNLDQFATG LVGVRSPYGA PLCVFDQAYV SGGSSSGSAV AVAAGLVAFS LGTDTAGSGR
VPAAFNHLIG LKPSKGRWST RGLVPACRSL DCISVFAADL EGAALVDEVL TGFDPEDDYS
RRAPEDPLRP SGPPPLEAED LHGAPADGCS PSGGPVAERL RESFRFGVPK PDQRLFLGDH
QSAALYAAAI ARLTTAGGTP VEIDIAPLFD CAKLLYSGPW VAERTAAVET LLRDTPGAIH
PTVRAIVQGG LAVTGVETFK GFHALEAHRR AAEAIWDAVD VMLLPTAPTI YRLKAVQAEP
IALNANLGLY TNFVNLLDMS ALAVPAGFRE NGTGFGVTLI GPAFADRALL ALAERYLETF
PMADMPPLDL TPKKPGVKLA VVGAHLAGMP LHWQLTSREA RLVSATRTAP TYKLYAMAET
TPPKPALIHV GEGGAAILVE VYELDFEAFG SFVAEVPAPL AIGTVTLEDG TLVKGFVAEP
RALNGATDIT ELGGWRAYIA SLAA