Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1441 |
Symbol | |
ID | 9138136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | - |
Start bp | 1851967 |
End bp | 1853763 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003629474 |
Protein GI | 296121696 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0376991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAGCC CTGTCCCCGG CCCACCTTCT CTCGATCTCG AAGCGCTCTT CACAGGGGAA GGAGAAAATC AGTCACAGTC GGCTGATGAT CTTATTCCCG CTCGTATGCT CAACGAATTC ACCTACTGCC CCAGACTGGC TTACCTTGAA TGGGTGCAAG GTGAGTTTCG AGACAACATC GAGACCAAAG AAGGGACATT CGGACATCGA AATGTGGATA TCCCCACCAA AAAATCATTC GATGCTCCTG ATGAAAACCC AGACGAATCT TCTCACCATG TCAGTGAAGG TTCTGCAATC CAGGAGATCA CAGCCGACAG CTTGGCGGCT CGGGCTTTGA TGCTCTCTGC ACCCTCTGAA GGATTGCTGG CCAAACTTGA TCTGATTGAA CTGAAAGGTT CGAAAGCAGT CCCGATCGAC TACAAAAGAG GGAACGTTCC CGATGTTCCT CACCAGGCTT GGGAACCAGA ACGGGTTCAA CTTTGTGCTC AAGGCTTGAT TCTGAAGGCC AATGGCTACG AATGCGACTA TGGCGAATTG TATTACATCG AATCCCGACG CCGGATTCGT GTTCAATTTG ACGACACCCT GATTGCCCGC ACGCGCGAAC TGGTTCGCGA AATGCGGCAC ATGGCCTCCA CACGCCAGAT TCCCGCCCCG CTTGTTGATA GCCCCAAATG TCCGAAATGT TCTCTTGTCA GCATTTGCCT CCCCGATGAA ACCAATTGCC TTAGAAATAG CACCAGGGAA GATTCGGCTC CAGAGAGCTC GGAAAGTATC CGCAAACTTG TTCCCGCCCG TGACGATGCA CTGCCAATTT ATGTTCAGGA TCAAGGAACC TATATAGGCA AAGATGGCGA GCGTCTGAAA CTGACTCCCG CGAAATCCTC TCCACTGTTC ATTCCACTCA TTCAAGTTTC ACAAGTTTGC CTGATGGGGA ATGTGCAGGT CACAGCCGCT GCAATTCGAG AACTGGCGGA CCGCAATATC CCCATCAGTT ACTTTTCCTA CGGCGGATGG TTCACGGCAC TCACTTCGGG AATGTGCCAC AAAAACGTCG AGTTGCGCAT GGCCCAGTCG AAGGCGGCTT TTGATCCTCA GGCCGCCCTG TCGATAGCGC GTGGTTTCAT TTCTGCAAAG ATCAAAAACT CACGCACACT GTTAAGGCGA CACGCTGACG ACAAGCATAG AAGCGATCTC GACCGCCTTG CTGATTACAT TCAGAAAGTC GAGCAGGTCG ATAATTTGAA TTCTCTCATG GGCCTGGAGG GAATGGCTGC GAAGACCTAT TTTGCAGGAT TTTCCAGATT GCTTAGAGGT GGAGATGAGT TCAATCTCGA AGGGCGTAAT CGCCGCCCTC CGACCGATCC CGTCAATGCG CTTCTATCTT TTGTCTATTC GCTGTTAACC AAAGAGTTGA CGATCACGAC ACAAGCTGTC GGCTTCGATC CATTCCTCGG ATTTCTGCAC CAGCCTCGCT ATGGCAGACC TTCTTTAGCA CTTGATCTTG CCGAAGAGTT CCGTCCGCTC GTGGGAGACT CAACAGTGCT TACGCTCATT AACAACGAGG AAGTCAGCCC AAAAAGCTTT ATCCGTCGTG CAGGAAGCGT CGCTTTGACA GAAACAGGTC GCAAAGCCGT CATTGCCGCT TATGAGCGGC GGATGGAAAC CGAGATTACG CACCCCATCT TCGGCTACAA GATCAGCTAC CGCCGGCTTT TTGAAGTCCA GGCTCGCTTA CTTTCCCGAG TTCTACTTGG CGAACTCGAT AAATATCCTG GCTTCTGCAC TCGTTAA
|
Protein sequence | MISPVPGPPS LDLEALFTGE GENQSQSADD LIPARMLNEF TYCPRLAYLE WVQGEFRDNI ETKEGTFGHR NVDIPTKKSF DAPDENPDES SHHVSEGSAI QEITADSLAA RALMLSAPSE GLLAKLDLIE LKGSKAVPID YKRGNVPDVP HQAWEPERVQ LCAQGLILKA NGYECDYGEL YYIESRRRIR VQFDDTLIAR TRELVREMRH MASTRQIPAP LVDSPKCPKC SLVSICLPDE TNCLRNSTRE DSAPESSESI RKLVPARDDA LPIYVQDQGT YIGKDGERLK LTPAKSSPLF IPLIQVSQVC LMGNVQVTAA AIRELADRNI PISYFSYGGW FTALTSGMCH KNVELRMAQS KAAFDPQAAL SIARGFISAK IKNSRTLLRR HADDKHRSDL DRLADYIQKV EQVDNLNSLM GLEGMAAKTY FAGFSRLLRG GDEFNLEGRN RRPPTDPVNA LLSFVYSLLT KELTITTQAV GFDPFLGFLH QPRYGRPSLA LDLAEEFRPL VGDSTVLTLI NNEEVSPKSF IRRAGSVALT ETGRKAVIAA YERRMETEIT HPIFGYKISY RRLFEVQARL LSRVLLGELD KYPGFCTR
|
| |