Gene Caci_7068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7068 
Symbol 
ID8338435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8211873 
End bp8214896 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content72% 
IMG OID644960149 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003117739 
Protein GI256396175 
COG category[R] General function prediction only 
COG ID[COG3568] Metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.332521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.746742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACGA GCCAGGAGAT CTACCACCGC GTGCGGTGGG ACGCGCGCTT CGACGCGTCG 
CGGTTCGTGC TCGGCGTGGA AATGCGCGGG CGGGAGCCGA AGCGGGTGCC GCTGCCCTCG
TTCGATCCGC ACGGCGACAT TCCTTGGCAC CGGGTGCTGT TCTTCGAGGC CGACGGTCGG
CTGGTGTGGG ACCGGGCTTC CGGGCTGGAC GCCCTGGACG CCCTGGACGC CTCGGGGGCG
GGTCTGGCGC AGCGCGTCCG GTTGCTGGCC GCCCCGTTCT TCGAGCCGCG TACGCCGCAC
GCTTTCATCG ACGGGCGCTG GCAGCCAGGG ACCAACGCCT CCTCTCCTGC CATCGGGTTG
GACCTGCGAA TACTCACGTG GAACACCCTG TGGGACCGCT ACGACAAGGA CCTGATTCGC
ACCGCCGAAC GCCGCCCGAT GCTGCTCGCG GCACTGCGCG CTGCGGACGT TGATGTGATC
GCACTGCAGG AGGCTGAGCC CGCGCTGGTG AAGATGCTGC TGGCCGAGGA TTGGGTGCGG
CGCGAGTGGA CGCTCGGGGG CGATCCGCGC TCGTCGGACG TCGCGGACAG CGGGGTGCTG
GTGCTCAGCC GGCTGCCGGT GGTCGAGGCC GGGTGGCATG CGCTGGGTCG GTACAAGGCG
GTCGCGGCGG TGGTCGTGGA GGGCGGCGCC GGTCCGGTCG TCGTCGCGAA CACCCACCTG
AGCAGCGACC ACTCAGCCGA CGGCGCCAGC CTGCGCACCG AGCAGCTCGG GCAACTAGCC
GACGGCCTGC GAGCCATCGA CGCGGTCCCC GTGGTTCTGG TCGGCGACTT CAACGACGAC
ACCGACGCCC CCGCCTCCCG CCTCGGCATG ACGGACGCGT GGACGCAGGT CCACGGCGTA
GCGGACGACA CCGCGACCTT CGATCCGTCG GCGAACCCCC TGGCAGCGGT GTCTTCGCTG
ACAGGCGACG CCAAGCGTCT GGACCGCGTG CTGCTGCTAG GCGCCACCGC TTCGGACGTA
CGCCTGATCG GCGAGGTTCC GAACGCCGAC GGCCTGTTCG TGTCGGACCA CTACGGAATC
ACCGCACTGG TAACGGCGAC CGCCTCAGCC ACGCCAACCT CCCCTATCGC AAGCGCCACC
GCCTCCCCCG CACTCGACGC GACACCAACC GCCCGAACCG CCCTCGCCTG GATCCCGCCA
ACCCCGGCAT GGGAGCCGAT CCAGGAGATC CGCCGCCGCC TCGACCCGCA GGTCCACCGC
TGGCCGCCGC ACGTCAACCT CCTGTTCGGC TTCATCCCCG AGTCAGAGTT CGACGCCGCG
ATCCCCCTGC TGAGCAAGGC CGCCGCCACC GTCGCACCCT TCGAGACAGA ACTCACCGAA
GTCCGCCACT TCACCCACCG CACCGACAGC ACCCTCTGGC TCCACCCCAC CGGCACCGCA
TGGCAAACCC TGCACACAGC CCTCCTCGAA GCCTTCCCCA CCTGCCGCAA CCGCGAGACC
TACACCCCGC ACCTGACCAT CGCCAAGGTC CCCAACCCAC CCCGCACCCC ACCCAAGATC
GCCCCCACCA CCACGCCAGT CACCGAACTG GCCCTCCTCT CCCGCCGCGC CGACGGCCCG
ATGGAAGTCC GCGCAGTCAT CGAGCTGGGA ACCGGCGCAG TCCGGCTGCT GGATGTTGAC
CCCGGCGCCA CTTCCGCCAC CCCGCAGCCC GCCGCGCCCG GATCATCTGG CGCTGGCACT
GGCACTGGCG CCGCCATCTC GCCGAACGCC CCTGCCGCCT CGGATCCCCC GCACCTCGAC
CACCCAGCCG CCACCAGCCA CCGCGCGCTG AGCACCGGCG CAGTCCGGTC GATGGCTGCT
GATGCCGGTG CGGCCTCCGA CTCCGCGCAC CTCGACCGCC CCGCTGCCAC CGGCCACCGC
GCGCTGAGCA CCGGCGCAGC CCGGTCGATG GCTGCTGATG CCGATGCGGC CTCCGACTCC
GCGCACCTCG ACCGCCCCGC TGCCACCCTC ACCGCGCGCA TCACCGAAGC CCTCCCCGAG
GCCACCATCC ACCTCGTCGG CTCCCGCCGT ACCGCCACAC ACCTCCCGGC CGCCGACATC
GATCTCGTCG CGGCCGTCCC GGGCGCCCCC GACATCGGCG CGCTCGAGCG CCGCGTCGCC
AAGGCGCTCG GCGCGGGTCA TCGCGTGCGC CAGCTCGTCG CCGCGCGCGT GCCGGGTCTG
CGCATCGTCA CCCCTCGCCT CAGTGCCGAC CTCGTACTCG CCCCTACCGG CGACATTCCG
CCCGCCGATG CCGTCGCGCG GCGTGCCGAA CTCGGCGAGA CCATCGCCAG CTCGCTGTCT
GCGGTGTCCG ACGCCGAGGC GCTCATTGCC CTGCGGCCGG GACCGCATGT CCGCGTCGCC
AAAGCCTGGG CGCGTGCGCG CGGCCTCGAC GCCGCCCCGC TCGGCGGGCT CCCCGGGCTC
GCGTGGGCGC TGATGGCGGC GCGCTGCCGC GACCTGACCG ACTTCTTCGA GACCTGGGCC
GCCCACAGTT GGCAGGACCC GATTCCCACC CCCACCGCGC CGATCCGCGA CCTGACGCTG
CATCTCACCG CATCGATGCG CGACCTGATC ACCGAAGAGC TCTACAACGG CTGGGAGACG
GTCACCTCGA CACCCGACCC GCTCCCGACG CTCCTCGCGC CGCCCCCGAT GCACCGCCGC
CACCGCCGCT GGGCCGTGAT CACCCTCAGG GCAGACGGCA TCGAAGCACG CGACGTCCTC
GAAGGCCGCG TCCGAGGCCG CACCCGCAGC CTCCTGTCAG CCCTCGACGA AGCCGGCGTC
ACCGACACCC ACGCGTGGCC TCGCGCCTTC CACACCGCCG ACGCCAACGC CAACGCCAAC
ACCGAGCTCC GCACCGCCAT CGGCCTCGGC CGCACCCCGC CGACCCGCGA CGCGCTCGCC
GAGATCACCA AGCCATGGCT CCGCGGCCTG AGCGGCGTCA CGGTCGAGCT GGCTGGGAAC
GGCGACGTGC CGACGCTGAT CTAG
 
Protein sequence
MRTSQEIYHR VRWDARFDAS RFVLGVEMRG REPKRVPLPS FDPHGDIPWH RVLFFEADGR 
LVWDRASGLD ALDALDASGA GLAQRVRLLA APFFEPRTPH AFIDGRWQPG TNASSPAIGL
DLRILTWNTL WDRYDKDLIR TAERRPMLLA ALRAADVDVI ALQEAEPALV KMLLAEDWVR
REWTLGGDPR SSDVADSGVL VLSRLPVVEA GWHALGRYKA VAAVVVEGGA GPVVVANTHL
SSDHSADGAS LRTEQLGQLA DGLRAIDAVP VVLVGDFNDD TDAPASRLGM TDAWTQVHGV
ADDTATFDPS ANPLAAVSSL TGDAKRLDRV LLLGATASDV RLIGEVPNAD GLFVSDHYGI
TALVTATASA TPTSPIASAT ASPALDATPT ARTALAWIPP TPAWEPIQEI RRRLDPQVHR
WPPHVNLLFG FIPESEFDAA IPLLSKAAAT VAPFETELTE VRHFTHRTDS TLWLHPTGTA
WQTLHTALLE AFPTCRNRET YTPHLTIAKV PNPPRTPPKI APTTTPVTEL ALLSRRADGP
MEVRAVIELG TGAVRLLDVD PGATSATPQP AAPGSSGAGT GTGAAISPNA PAASDPPHLD
HPAATSHRAL STGAVRSMAA DAGAASDSAH LDRPAATGHR ALSTGAARSM AADADAASDS
AHLDRPAATL TARITEALPE ATIHLVGSRR TATHLPAADI DLVAAVPGAP DIGALERRVA
KALGAGHRVR QLVAARVPGL RIVTPRLSAD LVLAPTGDIP PADAVARRAE LGETIASSLS
AVSDAEALIA LRPGPHVRVA KAWARARGLD AAPLGGLPGL AWALMAARCR DLTDFFETWA
AHSWQDPIPT PTAPIRDLTL HLTASMRDLI TEELYNGWET VTSTPDPLPT LLAPPPMHRR
HRRWAVITLR ADGIEARDVL EGRVRGRTRS LLSALDEAGV TDTHAWPRAF HTADANANAN
TELRTAIGLG RTPPTRDALA EITKPWLRGL SGVTVELAGN GDVPTLI