Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0400 |
Symbol | |
ID | 5711316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 391798 |
End bp | 395037 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641266305 |
Product | CRISPR-associated protein |
Protein accession | YP_001531750 |
Protein GI | 159042956 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.519358 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTGG GCCTTGACAT AGGAACCAGC TCAATCGGCT GGTGGCTGTA CGAAACCGAC GGCGCCGGGT CGGATGCCCG CATAACCGGT GTCGTGGATG GGGGCGTGCG TATCTTCTCC GACGGGCGCG ATCCGAAATC CGGTGCATCG CTGGCCGTGG ACCGTCGGGC CGCCCGTGCC ATGCGCCGCC GCCGCGACCG GTATCTGCGT CGGCGCGCCA CCCTGATGAA AGTGCTTGCC GAGACAGGCC TGATGCCCGC GGACCCGGCC GAGGCGAAGG CGCTGGAAGC CCTTGATCCC TTCGCCCTGC GGGCCGCCGG GCTGGACGAA CCCCTGCCCT TGCCCCATCT CGGTCGCGCG CTGTTTCACC TCAACCAGCG GCGGGGCTTC AAGTCGAACC GCAAGACCGA TCGGGGCGAC AACGAAAGCG GCAAGATCAA GGACGCGACC GCGCGGCTCG ACATGGAGAT GATGGCCAAC GGCGCCCGCA CCTATGGCGA GTTCCTGCAC AAGCGCCGGC AGAAGGCGAC CGATCCGCGC CATGTGCCCA GCGTGCGCAC CCGCCTGTCG ATTGCCAATC GCGGTGGGCC GGATGGCAAG GAAGAAGCCG GGTATGATTT CTACCCGGAC CGGCGGCACC TGGAGGAGGA GTTTCACAAG CTCTGGGCCG CGCAGGGGGC CCATCACCCG GAACTGACCG AGACCCTGCG CGATCTGCTG TTCGAAAAGA TCTTCTTTCA GCGGCCCCTG AAAGAACCCG AGGTCGGGCT TTGCCTCTTC TCCGGCCATC ACGGGGTTCC GCCGAAAGAC CCCCGCTTGC CCAAGGCGCA CCCCCTGACA CAGCGCCGGG TGCTCTACGA GACAGTCAAC CAGCTACGTG TCACGGCGGA CGGGCGCGAG GCTCGTCCCC TGACGCGGGA AGAGCGCGAC CAGGTCATCC ACGCGCTCGA CAACAAGAAG CCGACGAAAT CGCTTTCGTC GATGGTCCTG AAACTGCCCG CGCTGGCGAA GGTTCTGAAA CTCCGCGATG GTGAGCGGTT CACCCTGGAA ACCGGCGTGC GAGACGCCAT CGCCTGCGAC CCGTTGCGCG CCAGCCCTGC CCATCCGGAC CGCTTCGGCC CGCGCTGGTC CATCCTTGAT GCCGATGCCC AATGGGAGGT GATCTCGCGC ATCCGCCGAG TGCAGAGCGA TGCCGAACAT GCCGCCTTGG TGGATTGGCT GACCGAGGCA CACGGGCTCG ACCGCGCGCA TGCTGAGGCG ACGGCCCATG CCCCCCTGCC CGATGGCTAC GGACGGCTGG GCCTGACCGC GACGACCCGT ATCCTCTACC AGCTTACGGC GGACGTGGTG ACCTACGCCG ATGCGGTCAA GGCCTGCGGC TGGCACCATT CCGACGGGCG GACCGGCGAA TGCTTCGACC GCCTGCCCTA TTACGGCGAA GTGCTCGAAC GCCACGTCAT TCCCGGCAGC TACCACCCCG ACGACGACGA CATCACGCGG TTCGGCCGGA TCACCAATCC GACGGTTCAT ATTGGCCTGA ACCAGCTTCG CCGCCTCGTG AACAGGATCA TTGAAACCCA TGGCAAGCCC CATCAAATCG TCGTGGAGCT TGCCCGCGAT CTCAAGAAGT CCGAAGAGCA GAAGCGCGCC GACATCAAGC GTATCCGCGA CACGACCGAG GCGGCGAAGA AGCGGAGCGA AAAGCTCGAA GAGCTGGAGA TCGAAGACAA TGGCCGCAAC CGGATGCTCC TGCGGCTGTG GGAGGATCTG AACCCCGACG ATGCGATGCG CCGGTTCTGC CCCTACACGG GCACGCGTAT CTCGGCCGCC ATGATTTTCG ACGGCAGCTG CGACGTGGAT CACATCCTGC CCTATTCGCG CACGCTGGAC GACAGTTTCC CCAACCGGAC GCTCTGCCTG CGCGAAGCCA ACCGGCAGAA GCGCAACCAG ACCCCATGGC AGGCCTGGGG CGATACCCCG CACTGGCACG CCATCGCCGC CAACCTGAAG AACCTGCCCG AGAACAAGCG CTGGCGTTTC GCCCCGGATG CTATGACGCG GTTCGAGGGC GAAAACGGGT TCCTCGATCG GGCGCTAAAG GACACGCAAT ACCTCGCCCG GATCTCTCGC AGCTACCTCG ACACGCTCTT CACCAAGGGT GGGCACGTCT GGGTGGTGCC TGGTCGGTTT ACTGAAATGC TCCGTAGACA CTGGGGCCTG AACTCTCTGC TTAGCGATGC CGGGCGCGGC GCGGTAAAGG CCAAGAACCG AACCGACCAC CGCCATCACG CCATCGATGC CGCCGTGATC GCCGCAACCG ACCCTGGCCT GCTCAACCGG ATCAGCCGCG CTGCGGGTCA GGGCGAGGCA GCCGGCCAGT CCGCGGAACT GATCGCCCGG GACACACCCC CGCCCTGGGA AGGCTTCCGC GACGACCTGC GCGTCCGGCT CGATCGGATC ATCGTCAGCC ACCGCGCGGA TCACGGGCGC ATCGACCACG CGGCCCGCAA ACAGGGCCGG GACAGCACCG CGGGGCAATT GCACCAGGAG ACAGCCTATT CCATCGTCGA TGATATCCAC GTTGCCAGCC GGACCGATCT TCTGAGCTTG AAACCCGCGC AGCTGCTGGA TGAGCCGGGC CGAAGCGGCC AGGTGCGCGA CCCGCAGTTG CGAAAGGCCT TGCGCGTGGC GACTGGCGGC AAGACCGGCA AGGATTTCGA GAACGCCCTG CGCTACTTCG CGTCCAAGCC CGGCCCATAT CAGGCCATCC GCCGGGTGCG CATCATCAAG CCGCTGCAAG CGCAGGCGCG CGTTCCCGTG CCCGCGCAGG ACCCGATCAA GGCCTACCAG GGCGGCAGCA ACCACCTGTT CGAGATCTGG CGACTTCCGG ACGGAGAGAT CGAAGCACAG GTCATCACGA GTTTCGAGGC CCATACCCTC GAAGGCGAGA AGCGTCCTCA CCCTGCGGCC AAGCGTCTTT TGCGTGTGCA CAAGGGGGAC ATGGTGGCTT TGGAACGGGA CGGTCGACGG GTCGTCGGAC ATGTCCAAAA AATGGACATT GCAAACGGCT TGTTTATTGT CCCGCACAAT GAGGCGAACG CCGATACACG CAACAACGAC AAATCGGACC CGTTCAAATG GATTCAAATC GGCGCGCGGC CCGCGATCGC ATCCGGAATT CGCCGCGTAT CCGTCGATGA AATCGGCCGC CTTCGCGACG GGGGAACCAG GCCCATATAG
|
Protein sequence | MRLGLDIGTS SIGWWLYETD GAGSDARITG VVDGGVRIFS DGRDPKSGAS LAVDRRAARA MRRRRDRYLR RRATLMKVLA ETGLMPADPA EAKALEALDP FALRAAGLDE PLPLPHLGRA LFHLNQRRGF KSNRKTDRGD NESGKIKDAT ARLDMEMMAN GARTYGEFLH KRRQKATDPR HVPSVRTRLS IANRGGPDGK EEAGYDFYPD RRHLEEEFHK LWAAQGAHHP ELTETLRDLL FEKIFFQRPL KEPEVGLCLF SGHHGVPPKD PRLPKAHPLT QRRVLYETVN QLRVTADGRE ARPLTREERD QVIHALDNKK PTKSLSSMVL KLPALAKVLK LRDGERFTLE TGVRDAIACD PLRASPAHPD RFGPRWSILD ADAQWEVISR IRRVQSDAEH AALVDWLTEA HGLDRAHAEA TAHAPLPDGY GRLGLTATTR ILYQLTADVV TYADAVKACG WHHSDGRTGE CFDRLPYYGE VLERHVIPGS YHPDDDDITR FGRITNPTVH IGLNQLRRLV NRIIETHGKP HQIVVELARD LKKSEEQKRA DIKRIRDTTE AAKKRSEKLE ELEIEDNGRN RMLLRLWEDL NPDDAMRRFC PYTGTRISAA MIFDGSCDVD HILPYSRTLD DSFPNRTLCL REANRQKRNQ TPWQAWGDTP HWHAIAANLK NLPENKRWRF APDAMTRFEG ENGFLDRALK DTQYLARISR SYLDTLFTKG GHVWVVPGRF TEMLRRHWGL NSLLSDAGRG AVKAKNRTDH RHHAIDAAVI AATDPGLLNR ISRAAGQGEA AGQSAELIAR DTPPPWEGFR DDLRVRLDRI IVSHRADHGR IDHAARKQGR DSTAGQLHQE TAYSIVDDIH VASRTDLLSL KPAQLLDEPG RSGQVRDPQL RKALRVATGG KTGKDFENAL RYFASKPGPY QAIRRVRIIK PLQAQARVPV PAQDPIKAYQ GGSNHLFEIW RLPDGEIEAQ VITSFEAHTL EGEKRPHPAA KRLLRVHKGD MVALERDGRR VVGHVQKMDI ANGLFIVPHN EANADTRNND KSDPFKWIQI GARPAIASGI RRVSVDEIGR LRDGGTRPI
|
| |