Gene Caci_8056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8056 
Symbol 
ID8339434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9349908 
End bp9351194 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID644961141 
Productamidohydrolase 
Protein accessionYP_003118720 
Protein GI256397156 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGACCC TCCACGCTGC CCCGCTCGTC CTGCCGATGA CCGCCGCCCC GATCCCGGAC 
GGCGCCGTCG TCGTCGACAA CGACCGCGTC GTCGCCGTCG GCACCCGTGC CGACCTCGTC
GCCGCGCATC CCGAGGCGCG CGTCCGCGAA TGGCCGGGCG TCATCACGCC GGGACTGGTC
AACAGCCACG CGCACACGCA GTACTACGAC TTCGGGGACC TGGCGTCCTC CGGACTCCCG
TTCCCGGAGT GGCTGCACCA GATGGTCGCG CGCCGGGCGA CGTTCACCGA CGCGATGTGG
CAGGAGTCCA CACGCCGCGG GCTGCACGCG TATCTGAAGA CCGGGACCAC GGCTGTCGCG
GACATCGTGA CCGAGCCGGC GGTGCTGTCC GCGATCGCCC GCAGCGGGAT CCGGGGTGTG
GCCTACATCG AGGCGGTCTT CGCCGACGAC GCCTCGTGGG CGGCGGGCAA GCGGGCCGAT
CTGGTCACGG CGGTCGACGG CTCCGGAGGA CCGGTCCGGG GCGTCTCGCC GCACACGCCG
TTCACCATCA GCACGGGTGT GTACGAGGAC TGCGTCACCA TCGCGCACGA GCGCGGCAAG
CGCCTGCATC CGCACATCGC GGAGACCATG CAGGAGTCGG AGTACGTGCT GACCGGTACC
GGTCCCTTCG CCGACAACGC CAAGCAGTTC GGGCTCGACT TCTCCGACAT CCTCGACCAC
GGCACCGGCC TCACTCCCGT GGAGTGGGCC GACGCGCGCG GTGCGCTCGG CGACGATTGC
CATATCGCGC ACGGCATCCA CGTCAGCGCC TCCGACCGCG CGCTGCTGCG CGAGCGGGGG
ACCGCGGTGG CGTTGTGCGT GCGCTCGAAC CGGATCCTGG AGGCCGGCGA GCCGCCGGTC
GCCGCGTATC TGGAGGAGGG ATCGCCGATC GGCATCGGTA CCGACTCCGC CGCCTCCTCG
CCCTCGCTCG ATCTGCTGGA GGAGGCGCGG GCTTTGCGCG CCGTGGCCCG GAACCAGGGA
TACACCGCCG AGGATCTGGA CCGGCGCATC GTGGAGGCGG CGACGCTCGG CGGCGCGGCG
GCGCTCGGGC TGACCGAGGG ACCGGATCGG GTCGGACGTC TGGAGCCCGG GGTCCGTGCC
GATTTCGCGG TGTTCTCGGT GGCGGACGGC AGCGGTGACA GCAGCACTGA CGGCGACGCT
GACGGCGACC CCTACAAGCG GCTGATCGAC CGCGGCGCGT GTGTGGCGAC GGTGCTCGCC
GGCAAAATCG TGCACCGTAC GGTCTGA
 
Protein sequence
MLTLHAAPLV LPMTAAPIPD GAVVVDNDRV VAVGTRADLV AAHPEARVRE WPGVITPGLV 
NSHAHTQYYD FGDLASSGLP FPEWLHQMVA RRATFTDAMW QESTRRGLHA YLKTGTTAVA
DIVTEPAVLS AIARSGIRGV AYIEAVFADD ASWAAGKRAD LVTAVDGSGG PVRGVSPHTP
FTISTGVYED CVTIAHERGK RLHPHIAETM QESEYVLTGT GPFADNAKQF GLDFSDILDH
GTGLTPVEWA DARGALGDDC HIAHGIHVSA SDRALLRERG TAVALCVRSN RILEAGEPPV
AAYLEEGSPI GIGTDSAASS PSLDLLEEAR ALRAVARNQG YTAEDLDRRI VEAATLGGAA
ALGLTEGPDR VGRLEPGVRA DFAVFSVADG SGDSSTDGDA DGDPYKRLID RGACVATVLA
GKIVHRTV