Gene Caci_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4734 
Symbol 
ID8336088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5401183 
End bp5402550 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content74% 
IMG OID644957834 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_003115436 
Protein GI256393872 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA TCGTCATCCA GAACGCCGCG ATCGCCACGG TCGACGCCGC GAGCACCGAA 
CACGCGACCG GACATGTGGT CATCGAGGAC GGCCGGATCA CCGCCGTGGG TTCGGGTCCC
GCGCCGCGCG ATATCGAGGG CGCGCGCGTC GTGGACGGCA CCGGCTGCCT GGCCACCCCC
GGCCTGGTCA ACACCCACCA CCACCTCTAC CAATGGATCA CCCGCGGGCT GGCGCAGGAC
GCGGCGCTGT TCGGCTGGCT CGTCGAGCTG TACCCGATCT GGGCCAGGCT GACCGAGGAC
ACCCTCGGCG CGGCGGCGCG CGGCGGTCTG GCCTGGCTGG CCAAGACCGG CTGCACCACC
TCCGCCGACC ACCACTACGT CTTCCCGCGC GACGGCGGCG ACCTGCTCGG CGCGGAGATC
GAGGCGGCGC GCGAGATCGG TCTGCGCTTC CAGCCGACGC GCGGCTCGAT GGACCGCGGC
CGCAGCGACG GCGGGCTGCC GCCGGACGAG GTGGTCGAGC GGCTCGACGA CATCCTGGCG
GCGAGTCAGG ACGCGATCAC CCGGTACCAC GACCCGTCCT TCGACTCGAT GCTGCGCATC
GGCCTGGCGC CGTGCTCGCC GTTCTCGGTC AGCTCCGACC TGATGGTGCA GAGCGCGTCG
CTGGCGCGCG AACACGGCGT CCGGCTGCAC ACGCACCTGG CCGAGACCCT CGACGAGGAA
AAGTTCTGTC TGGAGACCCA CGGCTGCACC CCGGCCGAAT ACGCCGACAA GCTCGGCTGG
CTCGGACCGG ACGTGTGGCT GGCGCACTGC GTGCACCTGT CGGACGCGGC GATCCGCCGC
TTCGCCGACA CCCAGACCGG GACCGCGCAC TGCGCGTCCT CCAACGCCCG CCTCGGCTCG
GGCCACGCCC GTATCAAGGA CCTGCTCGCC GCCGGTGCGC CGGTCGGACT CGGCGTGGAC
GGCGCGGCGT CGCAGGAGTC GGGGATGCTG GTGGAGGAGC TGCGGCAGGC GATGTACGTC
TCGCGGCTGC GCGCGCTGTC CTCCGATCCG GCCGAGGCGC TGAACGCCCG CGCCGCGCTG
CGCTTGGGGA CCGCCGGGGG AGCGCGGGTG CTGGGCCGCG AGGCCGAGAT CGGCTCGCTG
GAGGTCGGCA AGCTCGGCGA CGTCGCGCTG TGGGACCTGA CCGGGCTCGG CCACGCCGGC
ATCGCCGACC CGGTCGCCGC CCTGGTCTTC GGCCCGCCGG CGCCGCTGCG GCTGCTCGCG
GTCGGCGGGC GCCCGGTCGT CGAGGACGGC GCGCTGACCA CCGCCGACGA GGACGCGCTG
GCGCGCGCCT GCCGGACCGC CGCCCGCTCG CTGACGGAGG TGTCCTGA
 
Protein sequence
MSTIVIQNAA IATVDAASTE HATGHVVIED GRITAVGSGP APRDIEGARV VDGTGCLATP 
GLVNTHHHLY QWITRGLAQD AALFGWLVEL YPIWARLTED TLGAAARGGL AWLAKTGCTT
SADHHYVFPR DGGDLLGAEI EAAREIGLRF QPTRGSMDRG RSDGGLPPDE VVERLDDILA
ASQDAITRYH DPSFDSMLRI GLAPCSPFSV SSDLMVQSAS LAREHGVRLH THLAETLDEE
KFCLETHGCT PAEYADKLGW LGPDVWLAHC VHLSDAAIRR FADTQTGTAH CASSNARLGS
GHARIKDLLA AGAPVGLGVD GAASQESGML VEELRQAMYV SRLRALSSDP AEALNARAAL
RLGTAGGARV LGREAEIGSL EVGKLGDVAL WDLTGLGHAG IADPVAALVF GPPAPLRLLA
VGGRPVVEDG ALTTADEDAL ARACRTAARS LTEVS