Gene Plim_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1441 
Symbol 
ID9138136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1851967 
End bp1853763 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content51% 
IMG OID 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003629474 
Protein GI296121696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0376991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGCC CTGTCCCCGG CCCACCTTCT CTCGATCTCG AAGCGCTCTT CACAGGGGAA 
GGAGAAAATC AGTCACAGTC GGCTGATGAT CTTATTCCCG CTCGTATGCT CAACGAATTC
ACCTACTGCC CCAGACTGGC TTACCTTGAA TGGGTGCAAG GTGAGTTTCG AGACAACATC
GAGACCAAAG AAGGGACATT CGGACATCGA AATGTGGATA TCCCCACCAA AAAATCATTC
GATGCTCCTG ATGAAAACCC AGACGAATCT TCTCACCATG TCAGTGAAGG TTCTGCAATC
CAGGAGATCA CAGCCGACAG CTTGGCGGCT CGGGCTTTGA TGCTCTCTGC ACCCTCTGAA
GGATTGCTGG CCAAACTTGA TCTGATTGAA CTGAAAGGTT CGAAAGCAGT CCCGATCGAC
TACAAAAGAG GGAACGTTCC CGATGTTCCT CACCAGGCTT GGGAACCAGA ACGGGTTCAA
CTTTGTGCTC AAGGCTTGAT TCTGAAGGCC AATGGCTACG AATGCGACTA TGGCGAATTG
TATTACATCG AATCCCGACG CCGGATTCGT GTTCAATTTG ACGACACCCT GATTGCCCGC
ACGCGCGAAC TGGTTCGCGA AATGCGGCAC ATGGCCTCCA CACGCCAGAT TCCCGCCCCG
CTTGTTGATA GCCCCAAATG TCCGAAATGT TCTCTTGTCA GCATTTGCCT CCCCGATGAA
ACCAATTGCC TTAGAAATAG CACCAGGGAA GATTCGGCTC CAGAGAGCTC GGAAAGTATC
CGCAAACTTG TTCCCGCCCG TGACGATGCA CTGCCAATTT ATGTTCAGGA TCAAGGAACC
TATATAGGCA AAGATGGCGA GCGTCTGAAA CTGACTCCCG CGAAATCCTC TCCACTGTTC
ATTCCACTCA TTCAAGTTTC ACAAGTTTGC CTGATGGGGA ATGTGCAGGT CACAGCCGCT
GCAATTCGAG AACTGGCGGA CCGCAATATC CCCATCAGTT ACTTTTCCTA CGGCGGATGG
TTCACGGCAC TCACTTCGGG AATGTGCCAC AAAAACGTCG AGTTGCGCAT GGCCCAGTCG
AAGGCGGCTT TTGATCCTCA GGCCGCCCTG TCGATAGCGC GTGGTTTCAT TTCTGCAAAG
ATCAAAAACT CACGCACACT GTTAAGGCGA CACGCTGACG ACAAGCATAG AAGCGATCTC
GACCGCCTTG CTGATTACAT TCAGAAAGTC GAGCAGGTCG ATAATTTGAA TTCTCTCATG
GGCCTGGAGG GAATGGCTGC GAAGACCTAT TTTGCAGGAT TTTCCAGATT GCTTAGAGGT
GGAGATGAGT TCAATCTCGA AGGGCGTAAT CGCCGCCCTC CGACCGATCC CGTCAATGCG
CTTCTATCTT TTGTCTATTC GCTGTTAACC AAAGAGTTGA CGATCACGAC ACAAGCTGTC
GGCTTCGATC CATTCCTCGG ATTTCTGCAC CAGCCTCGCT ATGGCAGACC TTCTTTAGCA
CTTGATCTTG CCGAAGAGTT CCGTCCGCTC GTGGGAGACT CAACAGTGCT TACGCTCATT
AACAACGAGG AAGTCAGCCC AAAAAGCTTT ATCCGTCGTG CAGGAAGCGT CGCTTTGACA
GAAACAGGTC GCAAAGCCGT CATTGCCGCT TATGAGCGGC GGATGGAAAC CGAGATTACG
CACCCCATCT TCGGCTACAA GATCAGCTAC CGCCGGCTTT TTGAAGTCCA GGCTCGCTTA
CTTTCCCGAG TTCTACTTGG CGAACTCGAT AAATATCCTG GCTTCTGCAC TCGTTAA
 
Protein sequence
MISPVPGPPS LDLEALFTGE GENQSQSADD LIPARMLNEF TYCPRLAYLE WVQGEFRDNI 
ETKEGTFGHR NVDIPTKKSF DAPDENPDES SHHVSEGSAI QEITADSLAA RALMLSAPSE
GLLAKLDLIE LKGSKAVPID YKRGNVPDVP HQAWEPERVQ LCAQGLILKA NGYECDYGEL
YYIESRRRIR VQFDDTLIAR TRELVREMRH MASTRQIPAP LVDSPKCPKC SLVSICLPDE
TNCLRNSTRE DSAPESSESI RKLVPARDDA LPIYVQDQGT YIGKDGERLK LTPAKSSPLF
IPLIQVSQVC LMGNVQVTAA AIRELADRNI PISYFSYGGW FTALTSGMCH KNVELRMAQS
KAAFDPQAAL SIARGFISAK IKNSRTLLRR HADDKHRSDL DRLADYIQKV EQVDNLNSLM
GLEGMAAKTY FAGFSRLLRG GDEFNLEGRN RRPPTDPVNA LLSFVYSLLT KELTITTQAV
GFDPFLGFLH QPRYGRPSLA LDLAEEFRPL VGDSTVLTLI NNEEVSPKSF IRRAGSVALT
ETGRKAVIAA YERRMETEIT HPIFGYKISY RRLFEVQARL LSRVLLGELD KYPGFCTR