Gene Caci_4726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4726 
Symbol 
ID8336080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5390698 
End bp5392029 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID644957826 
Productallantoinase 
Protein accessionYP_003115428 
Protein GI256393864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.16823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGC CCTTCGATCT CGTTCTGCGT TCGACGCGTG CTGTGCTGCC TGATGGTGTG 
CGGCCTGCTG CCGTTGCGGT TCGGCGTGGG CGTATTGCGG AGGTCGGCGA TCATCGGGCT
GCTTTCGGTG CGCGCGTTGA TCTCGATCTC GCCGATACCG CGTTGCTGCC CGGGTTGGTG
GATACGCACG TCCACGTCAA CGAGCCGGGC CGGACGCGCT GGGAGGGCTT CGCCTCCGCG
ACGCGTGCGG CGGCGGCGGG GGGTGTCACG ACACTGATCG ACATGCCGCT CAACTCGATT
CCGCCGACGG TGGATGTGCC GGCGTTGGCG GTCAAGCGGA AGGCGGCTGA GGGGAAGTGC
TTCGTCGATG TCGGGTTCTG GGGTGGTGCG ATTCCCGGTA ATGGCGCGGC GTTGCGTCCG
TTGCATCGCA GTGGGGTGTT CGGGTTCAAG TGCTTCCTTG CCGATTCCGG GGTTGAGGAG
TTTCCCGAGC TCAGTGTCGC GGAAATGCGG CTGGCAATGC GGGAGATCGC GCGGTTCGGC
GGACTTCTCA TCGTGCATGC CGAAAACGCC GAAGCGCTCG GTGCCGCGCC TTCCAGTGTC
CGCTATCGCG ATTTCCTCGC CTCGCGTCCG GCGGTGGCTG AGGACAGTGC CATCGCCGAT
GTCATCGATG CCGCCCGGGC CTACCGCGCG CGTGTCCACA TCCTGCATCT CGCCGCTGCC
GAAGCGTTGC CGCGGCTGGC CGCTGCCAAG GCGGACGGCG TGCGCATCAG CGCCGAGACG
TGCCCGCACT ACCTCACCTT CAGCGCCGAC GAGATCCGGG ACGGCGCCAC GCAGTTCAAG
TGCTGCCCGC CGATCCGGGA CGCCGCCGAC CGGGAGGCGT TGTGGGCGGC GCTCGCCGAC
GGCTTGATCG ACGTCGTCGT GTCCGACCAC TCGCCCTCCA CGCCCGACCT CAAGCGCCTG
GACTCCGGCG ACTTCGGCGC GGCGTGGGGC GGGATCTCCT CGCTCCAGCT CGGGCTGGCG
GCGGTGTGGA CCGGTGCGCG GGCGCGCGGG TTCGGGCTGG CCGACGTCGC GCGCTGGATG
GCCGCGCGTC CCGCCGAGCT GGTCGGGCTG GCGGGCAAGG GCCGCATCGC CGTGGGCTAC
GACGCCGACC TGGTGGCCTT CGACCCCGAA GCGGCCTTCA CCGTCGACCC CGCGAACCTG
CACCACAAGA ACCCTGTCAC GCCCTACGCC GGACGCGAGT TGCACGGCGT CGTGCGTGCC
ACCTACCTGC GCGGCGAGCC GGTGACCGAC GTCCCGCGCG GAGGATTCCT CACGCACCCG
GAGGTCCGAT GA
 
Protein sequence
MSGPFDLVLR STRAVLPDGV RPAAVAVRRG RIAEVGDHRA AFGARVDLDL ADTALLPGLV 
DTHVHVNEPG RTRWEGFASA TRAAAAGGVT TLIDMPLNSI PPTVDVPALA VKRKAAEGKC
FVDVGFWGGA IPGNGAALRP LHRSGVFGFK CFLADSGVEE FPELSVAEMR LAMREIARFG
GLLIVHAENA EALGAAPSSV RYRDFLASRP AVAEDSAIAD VIDAARAYRA RVHILHLAAA
EALPRLAAAK ADGVRISAET CPHYLTFSAD EIRDGATQFK CCPPIRDAAD REALWAALAD
GLIDVVVSDH SPSTPDLKRL DSGDFGAAWG GISSLQLGLA AVWTGARARG FGLADVARWM
AARPAELVGL AGKGRIAVGY DADLVAFDPE AAFTVDPANL HHKNPVTPYA GRELHGVVRA
TYLRGEPVTD VPRGGFLTHP EVR