Gene Dshi_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0400 
Symbol 
ID5711316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp391798 
End bp395037 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content65% 
IMG OID641266305 
ProductCRISPR-associated protein 
Protein accessionYP_001531750 
Protein GI159042956 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.519358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGG GCCTTGACAT AGGAACCAGC TCAATCGGCT GGTGGCTGTA CGAAACCGAC 
GGCGCCGGGT CGGATGCCCG CATAACCGGT GTCGTGGATG GGGGCGTGCG TATCTTCTCC
GACGGGCGCG ATCCGAAATC CGGTGCATCG CTGGCCGTGG ACCGTCGGGC CGCCCGTGCC
ATGCGCCGCC GCCGCGACCG GTATCTGCGT CGGCGCGCCA CCCTGATGAA AGTGCTTGCC
GAGACAGGCC TGATGCCCGC GGACCCGGCC GAGGCGAAGG CGCTGGAAGC CCTTGATCCC
TTCGCCCTGC GGGCCGCCGG GCTGGACGAA CCCCTGCCCT TGCCCCATCT CGGTCGCGCG
CTGTTTCACC TCAACCAGCG GCGGGGCTTC AAGTCGAACC GCAAGACCGA TCGGGGCGAC
AACGAAAGCG GCAAGATCAA GGACGCGACC GCGCGGCTCG ACATGGAGAT GATGGCCAAC
GGCGCCCGCA CCTATGGCGA GTTCCTGCAC AAGCGCCGGC AGAAGGCGAC CGATCCGCGC
CATGTGCCCA GCGTGCGCAC CCGCCTGTCG ATTGCCAATC GCGGTGGGCC GGATGGCAAG
GAAGAAGCCG GGTATGATTT CTACCCGGAC CGGCGGCACC TGGAGGAGGA GTTTCACAAG
CTCTGGGCCG CGCAGGGGGC CCATCACCCG GAACTGACCG AGACCCTGCG CGATCTGCTG
TTCGAAAAGA TCTTCTTTCA GCGGCCCCTG AAAGAACCCG AGGTCGGGCT TTGCCTCTTC
TCCGGCCATC ACGGGGTTCC GCCGAAAGAC CCCCGCTTGC CCAAGGCGCA CCCCCTGACA
CAGCGCCGGG TGCTCTACGA GACAGTCAAC CAGCTACGTG TCACGGCGGA CGGGCGCGAG
GCTCGTCCCC TGACGCGGGA AGAGCGCGAC CAGGTCATCC ACGCGCTCGA CAACAAGAAG
CCGACGAAAT CGCTTTCGTC GATGGTCCTG AAACTGCCCG CGCTGGCGAA GGTTCTGAAA
CTCCGCGATG GTGAGCGGTT CACCCTGGAA ACCGGCGTGC GAGACGCCAT CGCCTGCGAC
CCGTTGCGCG CCAGCCCTGC CCATCCGGAC CGCTTCGGCC CGCGCTGGTC CATCCTTGAT
GCCGATGCCC AATGGGAGGT GATCTCGCGC ATCCGCCGAG TGCAGAGCGA TGCCGAACAT
GCCGCCTTGG TGGATTGGCT GACCGAGGCA CACGGGCTCG ACCGCGCGCA TGCTGAGGCG
ACGGCCCATG CCCCCCTGCC CGATGGCTAC GGACGGCTGG GCCTGACCGC GACGACCCGT
ATCCTCTACC AGCTTACGGC GGACGTGGTG ACCTACGCCG ATGCGGTCAA GGCCTGCGGC
TGGCACCATT CCGACGGGCG GACCGGCGAA TGCTTCGACC GCCTGCCCTA TTACGGCGAA
GTGCTCGAAC GCCACGTCAT TCCCGGCAGC TACCACCCCG ACGACGACGA CATCACGCGG
TTCGGCCGGA TCACCAATCC GACGGTTCAT ATTGGCCTGA ACCAGCTTCG CCGCCTCGTG
AACAGGATCA TTGAAACCCA TGGCAAGCCC CATCAAATCG TCGTGGAGCT TGCCCGCGAT
CTCAAGAAGT CCGAAGAGCA GAAGCGCGCC GACATCAAGC GTATCCGCGA CACGACCGAG
GCGGCGAAGA AGCGGAGCGA AAAGCTCGAA GAGCTGGAGA TCGAAGACAA TGGCCGCAAC
CGGATGCTCC TGCGGCTGTG GGAGGATCTG AACCCCGACG ATGCGATGCG CCGGTTCTGC
CCCTACACGG GCACGCGTAT CTCGGCCGCC ATGATTTTCG ACGGCAGCTG CGACGTGGAT
CACATCCTGC CCTATTCGCG CACGCTGGAC GACAGTTTCC CCAACCGGAC GCTCTGCCTG
CGCGAAGCCA ACCGGCAGAA GCGCAACCAG ACCCCATGGC AGGCCTGGGG CGATACCCCG
CACTGGCACG CCATCGCCGC CAACCTGAAG AACCTGCCCG AGAACAAGCG CTGGCGTTTC
GCCCCGGATG CTATGACGCG GTTCGAGGGC GAAAACGGGT TCCTCGATCG GGCGCTAAAG
GACACGCAAT ACCTCGCCCG GATCTCTCGC AGCTACCTCG ACACGCTCTT CACCAAGGGT
GGGCACGTCT GGGTGGTGCC TGGTCGGTTT ACTGAAATGC TCCGTAGACA CTGGGGCCTG
AACTCTCTGC TTAGCGATGC CGGGCGCGGC GCGGTAAAGG CCAAGAACCG AACCGACCAC
CGCCATCACG CCATCGATGC CGCCGTGATC GCCGCAACCG ACCCTGGCCT GCTCAACCGG
ATCAGCCGCG CTGCGGGTCA GGGCGAGGCA GCCGGCCAGT CCGCGGAACT GATCGCCCGG
GACACACCCC CGCCCTGGGA AGGCTTCCGC GACGACCTGC GCGTCCGGCT CGATCGGATC
ATCGTCAGCC ACCGCGCGGA TCACGGGCGC ATCGACCACG CGGCCCGCAA ACAGGGCCGG
GACAGCACCG CGGGGCAATT GCACCAGGAG ACAGCCTATT CCATCGTCGA TGATATCCAC
GTTGCCAGCC GGACCGATCT TCTGAGCTTG AAACCCGCGC AGCTGCTGGA TGAGCCGGGC
CGAAGCGGCC AGGTGCGCGA CCCGCAGTTG CGAAAGGCCT TGCGCGTGGC GACTGGCGGC
AAGACCGGCA AGGATTTCGA GAACGCCCTG CGCTACTTCG CGTCCAAGCC CGGCCCATAT
CAGGCCATCC GCCGGGTGCG CATCATCAAG CCGCTGCAAG CGCAGGCGCG CGTTCCCGTG
CCCGCGCAGG ACCCGATCAA GGCCTACCAG GGCGGCAGCA ACCACCTGTT CGAGATCTGG
CGACTTCCGG ACGGAGAGAT CGAAGCACAG GTCATCACGA GTTTCGAGGC CCATACCCTC
GAAGGCGAGA AGCGTCCTCA CCCTGCGGCC AAGCGTCTTT TGCGTGTGCA CAAGGGGGAC
ATGGTGGCTT TGGAACGGGA CGGTCGACGG GTCGTCGGAC ATGTCCAAAA AATGGACATT
GCAAACGGCT TGTTTATTGT CCCGCACAAT GAGGCGAACG CCGATACACG CAACAACGAC
AAATCGGACC CGTTCAAATG GATTCAAATC GGCGCGCGGC CCGCGATCGC ATCCGGAATT
CGCCGCGTAT CCGTCGATGA AATCGGCCGC CTTCGCGACG GGGGAACCAG GCCCATATAG
 
Protein sequence
MRLGLDIGTS SIGWWLYETD GAGSDARITG VVDGGVRIFS DGRDPKSGAS LAVDRRAARA 
MRRRRDRYLR RRATLMKVLA ETGLMPADPA EAKALEALDP FALRAAGLDE PLPLPHLGRA
LFHLNQRRGF KSNRKTDRGD NESGKIKDAT ARLDMEMMAN GARTYGEFLH KRRQKATDPR
HVPSVRTRLS IANRGGPDGK EEAGYDFYPD RRHLEEEFHK LWAAQGAHHP ELTETLRDLL
FEKIFFQRPL KEPEVGLCLF SGHHGVPPKD PRLPKAHPLT QRRVLYETVN QLRVTADGRE
ARPLTREERD QVIHALDNKK PTKSLSSMVL KLPALAKVLK LRDGERFTLE TGVRDAIACD
PLRASPAHPD RFGPRWSILD ADAQWEVISR IRRVQSDAEH AALVDWLTEA HGLDRAHAEA
TAHAPLPDGY GRLGLTATTR ILYQLTADVV TYADAVKACG WHHSDGRTGE CFDRLPYYGE
VLERHVIPGS YHPDDDDITR FGRITNPTVH IGLNQLRRLV NRIIETHGKP HQIVVELARD
LKKSEEQKRA DIKRIRDTTE AAKKRSEKLE ELEIEDNGRN RMLLRLWEDL NPDDAMRRFC
PYTGTRISAA MIFDGSCDVD HILPYSRTLD DSFPNRTLCL REANRQKRNQ TPWQAWGDTP
HWHAIAANLK NLPENKRWRF APDAMTRFEG ENGFLDRALK DTQYLARISR SYLDTLFTKG
GHVWVVPGRF TEMLRRHWGL NSLLSDAGRG AVKAKNRTDH RHHAIDAAVI AATDPGLLNR
ISRAAGQGEA AGQSAELIAR DTPPPWEGFR DDLRVRLDRI IVSHRADHGR IDHAARKQGR
DSTAGQLHQE TAYSIVDDIH VASRTDLLSL KPAQLLDEPG RSGQVRDPQL RKALRVATGG
KTGKDFENAL RYFASKPGPY QAIRRVRIIK PLQAQARVPV PAQDPIKAYQ GGSNHLFEIW
RLPDGEIEAQ VITSFEAHTL EGEKRPHPAA KRLLRVHKGD MVALERDGRR VVGHVQKMDI
ANGLFIVPHN EANADTRNND KSDPFKWIQI GARPAIASGI RRVSVDEIGR LRDGGTRPI