Gene Namu_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3054 
Symbol 
ID8448667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3363382 
End bp3365049 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content67% 
IMG OID645042137 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_003202379 
Protein GI258653223 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.044999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000092711 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGCGCA CCTTCAACGT GATTGACGAA CCGGTCCTGC CGGCTGTTTG GCTCGACGGG 
ACCTCCGCTG ACATATCGAT CCGCCAGGCG CTCATCGATG CGCATCGGAT CGCGGCAATC
GAAGGTGAGC CGGCGTCTAT GACGTTCGCG CTGCACCGCT TGCTGCTAGC GATCGTCTAC
CGGGCGCTGC CGGTGGAACG CCCGCGGCAG GAATGGCGAG AGCTCTGGGA CGCGCCGGAA
CTGCCCGCCG AGGATCTCAA TTCCTACCTC GACGATTGGT ACCAACGCTT CGACCTGCTC
GATCCGGCGC AGCCCTGGCT GCAGGTGGCC GGACTGCATA CGACGCGCAG TGAATTCAGT
GAGCTCGAAA AGCTGATTCC CGATATTCCC AACGGTGAGC AGTTCTTCAC GGTGCGAGCC
GGTCTGGCGG CTCGGTCCAT TTCGCTGGCG GAGGCTGCGC GATATCTCAT CCACGCGCAG
GCTTTCGACC CCTCGGGCAT CAAGTCGGGG GCCGTGGGGG ACCCGCGGGT GAAGGGGGGC
AAGGGTTATC CGATCGGCAC CGCCTGGGCC GGCAACCTGG GTGGCGTCCT GGTCCAGGGG
CGCACGCTCA AGGAGACTTT GCTGCTCAAC CTGACTCTCG GCTCTCCGAA CGATGACGAT
CGGCCGTGGT CAGGCGAGGA GGATCAGCCG GTCTGGGAAC GCGAGCCGCT GACCGCCGCC
GAGGAATTCC CCGGGGAGAC GACCGGAGAT ATCCCCGGTC GAGCGCCCCG GGGGCCGGCC
GATCTATTGA CCTGGCCGAG TCGGCGGATG CGGTTGCGGG TGGAGGCCGA CCGGGTCACC
GGAGTGCTCA TCGCCAACGG GGACGTCCTG TGGCCGCAGA ATCGGCATGC CTGGGAGCCC
ATGACGGCCT GGCGGCGAAG CGATCCGCAG TCCAAGAAGT ATAAGACCAC CGTCTACATG
CCGCGCCAGC ACGATGCCGA CCGCAGCTTC TGGCGCGGCG CCGGGGCGGT GCTGCCGCGA
GCCGACCGAA GTCACCACAC CGTCGACGGG GAAACCGGTC TGCCCGCCGC GTCTCTGCGC
TGGCTTCAGG GAGCGGTCGA CGACGTTCTT GGTCCGAACT TTGTCTTGCG CGCTCGGGCG
ATCAGCGTCA TCTATGGATC CAACAGCTCC GTGATCGACG CGGTCTACGA CGACACGATG
GTCGTTCCTG CGGCGGTGCT GGACCATCCG GATCTGCAGC AGGGCTTGAT CGACGCGATC
GCGCACACGG ATAAGGCGGT TCAGGCGTTG GGCGATCTCG GCCGCGACCT CGAACAGGCC
GCTGGTGGTG TCGGTGACGC GCTACGCGGG CGGGCACGGC AGTTGGGCTT CCATCGGTTG
GACGAGCCGT TCCGTCGCTG GATGGGGACG CTGGTCTCCG GTTCCAATCT CGATGCCGCC
GTCACGCAGT GGTTCGTCGT CGCTCGGCGC GATCTCGTGA GCATCGGTCT AGCCCTGATC
GACGCCGCCC CGCCCGAGGC CTGGACCGGT CGACCCGACC CCCGAAGACC CGACCGACGC
CTTGATGTCG TCTCCGCGGA GCGACGCTTC CATTGGAACC TGGCCGCCGC CCTGCCAACC
GCCCCGCCCG TCTCCGAGCC AACCGCAGAG TCAGGAGCAC GCCCATGA
 
Protein sequence
MTRTFNVIDE PVLPAVWLDG TSADISIRQA LIDAHRIAAI EGEPASMTFA LHRLLLAIVY 
RALPVERPRQ EWRELWDAPE LPAEDLNSYL DDWYQRFDLL DPAQPWLQVA GLHTTRSEFS
ELEKLIPDIP NGEQFFTVRA GLAARSISLA EAARYLIHAQ AFDPSGIKSG AVGDPRVKGG
KGYPIGTAWA GNLGGVLVQG RTLKETLLLN LTLGSPNDDD RPWSGEEDQP VWEREPLTAA
EEFPGETTGD IPGRAPRGPA DLLTWPSRRM RLRVEADRVT GVLIANGDVL WPQNRHAWEP
MTAWRRSDPQ SKKYKTTVYM PRQHDADRSF WRGAGAVLPR ADRSHHTVDG ETGLPAASLR
WLQGAVDDVL GPNFVLRARA ISVIYGSNSS VIDAVYDDTM VVPAAVLDHP DLQQGLIDAI
AHTDKAVQAL GDLGRDLEQA AGGVGDALRG RARQLGFHRL DEPFRRWMGT LVSGSNLDAA
VTQWFVVARR DLVSIGLALI DAAPPEAWTG RPDPRRPDRR LDVVSAERRF HWNLAAALPT
APPVSEPTAE SGARP