Gene Caci_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3026 
Symbol 
ID8334377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3343854 
End bp3345512 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID644956172 
ProductHNH endonuclease 
Protein accessionYP_003113776 
Protein GI256392212 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.409509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0700786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTCA GCAAGCGCCT CCGCTACGAA ATCCTCCGGC GCGACAATCA CACCTGCCGG 
TATTGCGGCG CCACCGCACC GACCGTCCCG CTGCGCGTCG ACCACGTCGT TCCTGTCGCT
CTCGGCGGCA CCGACGACGC CACCAACCTC GTCGCCTCGT GCGAGCCCTG TAACAGCGGC
AAGACGTCCA CCGCCCCTGA CTCCCCACTG GTTGAGCAGG CACGCGAAGA CGCCATGCGC
TGGCAGATGG CGTGGACGGT TGCAGTCGCC GAAGCCGAGA CCGAGGGCAA GCAGCGCGCC
AAGGACATCG CCAAGGTCAA GAAGAACTAC GTCGCCGCCT ACAAGGGGCG GCACGGACAT
GCACCGATCC TTCCCGAGGG CTGGGAGGCG TCCGTCGGGC GGTGGCTCGA CCTCGGGCTG
CCGCTGACGC TCATCGACAA GGCCATCGCA TCCGCTGTCG GGCGAACCTA CGTTCCCGCC
AAGGACCGGT TCGCCTACTT TGCCGGCTGC TGTTGGAGCC TTCTCCGGGA GCTGAAGGAC
CGCACCGAGG CCATCGCCAT GCAGGCTTCG CCGACAACGC AGGATGAGCA AGGCGACGGA
CAATGCGAGT ACTGCGACGG CGGACAGGAT GATCGCAACA TCGTCGAGTA CGCCACGGAC
GTCTTCGCAG AAGCGTGGTC CCAGGACGAA GAACCCAACT CATACTGCCG CCGCATGCTG
GCCGCTTATG CAAGCGCGGC GAGCGGCGCC GGCTACGACA GGCTCTCCAT CGGATATGCA
GTTCACCAGG CCGCTCGCGA CGGGCACGCC GATATCGGCG CCTACCTCTC GACCCTTGAC
GATGTCCTTG AGCGAGCGTC CGAACCCATC ATCGACAGCC CTTTCGGATC CCGCGTCATC
GATGCAGACC TGCTGCCCAC AGACGAAGAT CGCGCCGCAC GCGCCGTAGC GGAAGCTGTA
GTCGCGGCCT GGCGCGCGTC GTGGCGGGAC GCCATGGAGC ATCCTCCCCC TGGACGCCGC
AGCACCGAAG CTTGCGCTGT TCGCGACTAC GCATTGGCCA CCTACCGCAA GACCGAGAAC
GCTCATGAGT TGCTGCGCGC TGCCGAGTTC GCAGGCGCGG AAGGCAACAG CAACCTCCCA
CAGGCAACTG CTCATGCGGA GGCGTACTAC GCCACCGAAC CGGCCGTCTC CGCATGGGGC
TGGGCGTGGT ACAAGGCGAC AGGCTTGGAC GCGCCAGGGT CGGTTCACGA AAGCGTGTGG
GCCGATTGCC GCACACTGCA CGCGAGTGGC GCCTGGGATC ACAAAATCAC CCTTGCCGCG
TCATTCGCGG GCGCACACGC AACGACACGT ATGCACTTCG GACTTGACGC CAATGAGGCC
GAGCTGATCG GCGTGGAGGC TACCACCCAG CGCATCGAGG ACTACTGGGC CCGTTCCTGG
AATGAATCCA GCCACTCGTG GCCGGGCGAA GGAGACCGCG CAGCGCTCCG AGCATGCCTC
TCGTCCATCG CTGACGGCAA GGCACACACG GTCGGGGACG TAACCGCTGC CGCTGTTGCC
GCGGGCGCCT ACCAGAGTGC CGACCTCTAC CCGAGCCTCA CGCGCTCGCA GTCCACGTTC
GTTGCCGCGG CCCACCTGCC GCACCTGGGA GGTGAATAA
 
Protein sequence
MAVSKRLRYE ILRRDNHTCR YCGATAPTVP LRVDHVVPVA LGGTDDATNL VASCEPCNSG 
KTSTAPDSPL VEQAREDAMR WQMAWTVAVA EAETEGKQRA KDIAKVKKNY VAAYKGRHGH
APILPEGWEA SVGRWLDLGL PLTLIDKAIA SAVGRTYVPA KDRFAYFAGC CWSLLRELKD
RTEAIAMQAS PTTQDEQGDG QCEYCDGGQD DRNIVEYATD VFAEAWSQDE EPNSYCRRML
AAYASAASGA GYDRLSIGYA VHQAARDGHA DIGAYLSTLD DVLERASEPI IDSPFGSRVI
DADLLPTDED RAARAVAEAV VAAWRASWRD AMEHPPPGRR STEACAVRDY ALATYRKTEN
AHELLRAAEF AGAEGNSNLP QATAHAEAYY ATEPAVSAWG WAWYKATGLD APGSVHESVW
ADCRTLHASG AWDHKITLAA SFAGAHATTR MHFGLDANEA ELIGVEATTQ RIEDYWARSW
NESSHSWPGE GDRAALRACL SSIADGKAHT VGDVTAAAVA AGAYQSADLY PSLTRSQSTF
VAAAHLPHLG GE