Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3054 |
Symbol | |
ID | 8448667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3363382 |
End bp | 3365049 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645042137 |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_003202379 |
Protein GI | 258653223 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.044999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000092711 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGCGCA CCTTCAACGT GATTGACGAA CCGGTCCTGC CGGCTGTTTG GCTCGACGGG ACCTCCGCTG ACATATCGAT CCGCCAGGCG CTCATCGATG CGCATCGGAT CGCGGCAATC GAAGGTGAGC CGGCGTCTAT GACGTTCGCG CTGCACCGCT TGCTGCTAGC GATCGTCTAC CGGGCGCTGC CGGTGGAACG CCCGCGGCAG GAATGGCGAG AGCTCTGGGA CGCGCCGGAA CTGCCCGCCG AGGATCTCAA TTCCTACCTC GACGATTGGT ACCAACGCTT CGACCTGCTC GATCCGGCGC AGCCCTGGCT GCAGGTGGCC GGACTGCATA CGACGCGCAG TGAATTCAGT GAGCTCGAAA AGCTGATTCC CGATATTCCC AACGGTGAGC AGTTCTTCAC GGTGCGAGCC GGTCTGGCGG CTCGGTCCAT TTCGCTGGCG GAGGCTGCGC GATATCTCAT CCACGCGCAG GCTTTCGACC CCTCGGGCAT CAAGTCGGGG GCCGTGGGGG ACCCGCGGGT GAAGGGGGGC AAGGGTTATC CGATCGGCAC CGCCTGGGCC GGCAACCTGG GTGGCGTCCT GGTCCAGGGG CGCACGCTCA AGGAGACTTT GCTGCTCAAC CTGACTCTCG GCTCTCCGAA CGATGACGAT CGGCCGTGGT CAGGCGAGGA GGATCAGCCG GTCTGGGAAC GCGAGCCGCT GACCGCCGCC GAGGAATTCC CCGGGGAGAC GACCGGAGAT ATCCCCGGTC GAGCGCCCCG GGGGCCGGCC GATCTATTGA CCTGGCCGAG TCGGCGGATG CGGTTGCGGG TGGAGGCCGA CCGGGTCACC GGAGTGCTCA TCGCCAACGG GGACGTCCTG TGGCCGCAGA ATCGGCATGC CTGGGAGCCC ATGACGGCCT GGCGGCGAAG CGATCCGCAG TCCAAGAAGT ATAAGACCAC CGTCTACATG CCGCGCCAGC ACGATGCCGA CCGCAGCTTC TGGCGCGGCG CCGGGGCGGT GCTGCCGCGA GCCGACCGAA GTCACCACAC CGTCGACGGG GAAACCGGTC TGCCCGCCGC GTCTCTGCGC TGGCTTCAGG GAGCGGTCGA CGACGTTCTT GGTCCGAACT TTGTCTTGCG CGCTCGGGCG ATCAGCGTCA TCTATGGATC CAACAGCTCC GTGATCGACG CGGTCTACGA CGACACGATG GTCGTTCCTG CGGCGGTGCT GGACCATCCG GATCTGCAGC AGGGCTTGAT CGACGCGATC GCGCACACGG ATAAGGCGGT TCAGGCGTTG GGCGATCTCG GCCGCGACCT CGAACAGGCC GCTGGTGGTG TCGGTGACGC GCTACGCGGG CGGGCACGGC AGTTGGGCTT CCATCGGTTG GACGAGCCGT TCCGTCGCTG GATGGGGACG CTGGTCTCCG GTTCCAATCT CGATGCCGCC GTCACGCAGT GGTTCGTCGT CGCTCGGCGC GATCTCGTGA GCATCGGTCT AGCCCTGATC GACGCCGCCC CGCCCGAGGC CTGGACCGGT CGACCCGACC CCCGAAGACC CGACCGACGC CTTGATGTCG TCTCCGCGGA GCGACGCTTC CATTGGAACC TGGCCGCCGC CCTGCCAACC GCCCCGCCCG TCTCCGAGCC AACCGCAGAG TCAGGAGCAC GCCCATGA
|
Protein sequence | MTRTFNVIDE PVLPAVWLDG TSADISIRQA LIDAHRIAAI EGEPASMTFA LHRLLLAIVY RALPVERPRQ EWRELWDAPE LPAEDLNSYL DDWYQRFDLL DPAQPWLQVA GLHTTRSEFS ELEKLIPDIP NGEQFFTVRA GLAARSISLA EAARYLIHAQ AFDPSGIKSG AVGDPRVKGG KGYPIGTAWA GNLGGVLVQG RTLKETLLLN LTLGSPNDDD RPWSGEEDQP VWEREPLTAA EEFPGETTGD IPGRAPRGPA DLLTWPSRRM RLRVEADRVT GVLIANGDVL WPQNRHAWEP MTAWRRSDPQ SKKYKTTVYM PRQHDADRSF WRGAGAVLPR ADRSHHTVDG ETGLPAASLR WLQGAVDDVL GPNFVLRARA ISVIYGSNSS VIDAVYDDTM VVPAAVLDHP DLQQGLIDAI AHTDKAVQAL GDLGRDLEQA AGGVGDALRG RARQLGFHRL DEPFRRWMGT LVSGSNLDAA VTQWFVVARR DLVSIGLALI DAAPPEAWTG RPDPRRPDRR LDVVSAERRF HWNLAAALPT APPVSEPTAE SGARP
|
| |