Gene Namu_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3052 
Symbol 
ID8448665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3361605 
End bp3362741 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content66% 
IMG OID645042135 
ProductCRISPR-associated protein, Cse4 family 
Protein accessionYP_003202377 
Protein GI258653221 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.000809112 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00221304 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTGCA TCGACATCCA CATCCTGCAG ACCGTCCCGC CGAGCAACCT CAACCGTGAC 
GACACCGGCA GCCCGAAGAC GGCAATCTAC GGCGGCGTTC AGCGCGCCCG CGTGTCGAGC
CAGGCATGGA AACGGGCTAC CCGCAAGGCA TTCGATGGTC GAATCAAGCC GGCGGACCTC
GGGGTGCGCA CCAAACGGGT CGTCGAGTTG GTCAGTGAAG AGATCCTCCG CCAATCACCG
GGGGTCGGCG CCGAGGGCGC GGTCGAACTG GCCAAGAAGG TCCTGGTGGC TGCTGGCATC
ACGTTGAGCG CGCCGAAGCC GAAAAAGAAG GGGGAAGCGC CTGGTCTCGA CGAGTCCGGG
TACCTGCTGT TCCTGGCCCG GCATCAGGTC GAGCGACTCG CCGAACTCGC CATCGGCGCC
GCCGAGGAAA CGACGATCGA CAAGAAGCAG GCCAAGGCCG CCGCTGACTC CAGCCAGAGC
GTGGACGTCG CGCTGTTCGG TCGGATGGTC GCCGACGCCG CCGACCTGAA CGTCGACGCC
GCGGCGCAGG TTGCCCACGC ACTCTCCGTA CATGCCGTCC GCAACGAATT CGACTATTTC
ACCGCCGTTG ACGACCGTAA AGAAAATGAG GAGGAGACCG GGGCCGGCAT GATCGGAACG
GTGGAGTTCA ACTCTTCCAC GCTCTACCGC TACGCGACGG TGAACATCGA CGGGCTGCGG
GTCAACCTCG GTGACGATGC TGCCACGATC CGCGCGGCTC AGGAATTCGT TCGTGCCTTC
GTGACTTCAA TGCCCACCGG AAAGCAAAAC ACTTTCGCCA ATCGCACCCT GCCCGACGCC
GTCGTGGTGC AGGTCCGCGA CTCCCAGCCG ATCAACTTGG TCGGTGCCTT CGAGGAACCC
GTCGAGGTCC CGGCCGGCGG ATCCCGGCTG CGGGAAGCCG CGGACCGGCT CGTCGCTCAC
GCGCAGAGCG TCGACCACGC CTACGGCACC GCGCCCACCC GATCGATGAC AGTGCTGGCG
TCGCCCACCG TCGGGACCCT CGCGGCCCTG GGGGAGTCGA TCGCGCTCGA CGACATGATC
GCAGCGGTGG GGGAGGCGGT CGCCGACGCT TTGGTCGCCT CCGCGGTCCG GGCGTGA
 
Protein sequence
MKCIDIHILQ TVPPSNLNRD DTGSPKTAIY GGVQRARVSS QAWKRATRKA FDGRIKPADL 
GVRTKRVVEL VSEEILRQSP GVGAEGAVEL AKKVLVAAGI TLSAPKPKKK GEAPGLDESG
YLLFLARHQV ERLAELAIGA AEETTIDKKQ AKAAADSSQS VDVALFGRMV ADAADLNVDA
AAQVAHALSV HAVRNEFDYF TAVDDRKENE EETGAGMIGT VEFNSSTLYR YATVNIDGLR
VNLGDDAATI RAAQEFVRAF VTSMPTGKQN TFANRTLPDA VVVQVRDSQP INLVGAFEEP
VEVPAGGSRL REAADRLVAH AQSVDHAYGT APTRSMTVLA SPTVGTLAAL GESIALDDMI
AAVGEAVADA LVASAVRA