Gene Daud_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1823 
Symbol 
ID6026698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1927051 
End bp1928874 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content53% 
IMG OID641594640 
ProductCRISPR-associated RAMP Crm2 family protein 
Protein accessionYP_001717951 
Protein GI169831969 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGA TGCACCTTGT CGTTTTTCAC ATTGGACCGG TTCAGGATTT CATCGCTACC 
GCGCGGCGCA GTCGCGATCT TTGGTTTGGC TCGTGGTTGT TGAGTGAACT GGCAAAGGCC
GCCGCGCTGG AAATCGTGCA GCAGAATGGG AACGATATTT CCTGCCTTGT GTTTCCAGCC
CCTGATAAGT TAGAGCAGTT GAAGTCTTTC GACTTCAACG CGCCCAACAA AGTCGTAGCC
CGGGTAAGGT GCGATCCGGC GGAATTGCGC GAGTCCGTCC GGGAAGCCAT ACTCAGGCGC
CTTTGCGAAA TTTGTGATGA TGCCTACAAA AAGATTAAGG GCGGGTTTGA TCGTGAAATT
GCTAAGCGGC AGGTGGAAGA CTTTCCTGAG TTCTTCTGGG CTTCCTACCC CTTTAACGGT
AACTACAAAC AGGCGCGTGA CTTCGCCGAA GCCCTCCTGA ACGCCCGGAA GGTGACGCGC
AACTTCGCGC CCATTACGTG GGGAAGCCCC GCTCCCAAAT GTTCTCTCGA CGGGTGCCGG
GAGTCGGTCA TTACCGGGAG TGTATACAGT CGGATGAATG AAAGCCAGCT TTATGAAAAT
TACCGCGTGC AGCGTGGCGA GTATCTGTGC GGGGTATGTT TATTAAAACG TCACGGGAGG
CGCGGATCAG AGGAGTATTT TTTCAGTACT TCCCACGTGG CTGCTCTCCC TTTGCTGGAA
AGGCTCACCA ACCAGCACCA GCCTCTTGTG GATGCATACA TCGGAAAATT GAAGGAACTT
GGCATCACGG CGGATGCCCT GCAAACGGTG TGGCTGCATC ATCCCGTATT TGGTCCCCAT
GACGGCCAGT TGTTATTTGA AGAGCGCCTG CGCGAATTTT TCAAAGAAGA AGAAGAAGGG
AAACTGTTGC AAGCCAAAGA GGCGCTGCGC ACTTTCCTAA AACAAGCCTT TGACAGCAAA
AAGCCGCTTC CTTATTACAC GCTTCTCCTC GCCGACGGTG ATCACGTGGG TAAAGTTATT
GATACCCAGG AAACTCCCGA AAAGCACCAA GAACTTTCCC GTTCTCAGAG CCGCTTTGCC
CTAGAGGTCC GAGATATAGT ACAGCACCAC CAGGGTTCGC TTTTATATTC CGGTGGCGAT
GATGTGCTGG CTCTTGTGCC CCTGCACACG GTGCTGGCGT GCGCCCGTAG CCTGGCTGAG
ACGTTCCGCC AGCGGTTTGC TGGTTTTAAA GCAGAAGATG GCGGAAAGGA AATATCCCTT
ACCCTTTCGG TCGGCATTGC TGTTGTCCAC CATCTTGATC CCCTCTCTGA TGCCCTTGAG
TTGGCGCGGC GAGCGGAAAA AGCCGCTAAA TCCGTAGGTG GCAAAAACGC CTTGGCGGTA
ACGCTCAGCA AACGAGGTGG GGTGGAGCGG ACCGTTAAAG ATACGTGGGG CGTCCTGGAC
CGGCGGCTGG AACGGTTCAT TGACCTGCAT CGAGCCGAAG CCGTACCTGA TGGAGCCGCT
TACGAACTAC GCGACCTGGC CAGGCAGCTC GAGGCGTCAG ATGAAAATTT AAAGAGTACG
CTTCAAAAAG CAGCGTACGC AGAGGCTAAG CGTATTTTGC GCCGCAAGAA GGCCCGGCGT
GGTACTGAGC CAATAGCGGA AGATATTCTC AGTGAGCTAG AGGTTTTCCT TGACAAGGAT
AGGTTTTCTT TCGAAAAACT TAAGCAACTG GCTGACGAAC TCATCATTGC CCGAGAATTT
GCTGCCGCCA TGGATCTGAC TAATATACCC CTGCCTGTAC CCTCAACCGG CGAGGTGATA
AAAAATGACC GTCTGGATCA TTGA
 
Protein sequence
MAEMHLVVFH IGPVQDFIAT ARRSRDLWFG SWLLSELAKA AALEIVQQNG NDISCLVFPA 
PDKLEQLKSF DFNAPNKVVA RVRCDPAELR ESVREAILRR LCEICDDAYK KIKGGFDREI
AKRQVEDFPE FFWASYPFNG NYKQARDFAE ALLNARKVTR NFAPITWGSP APKCSLDGCR
ESVITGSVYS RMNESQLYEN YRVQRGEYLC GVCLLKRHGR RGSEEYFFST SHVAALPLLE
RLTNQHQPLV DAYIGKLKEL GITADALQTV WLHHPVFGPH DGQLLFEERL REFFKEEEEG
KLLQAKEALR TFLKQAFDSK KPLPYYTLLL ADGDHVGKVI DTQETPEKHQ ELSRSQSRFA
LEVRDIVQHH QGSLLYSGGD DVLALVPLHT VLACARSLAE TFRQRFAGFK AEDGGKEISL
TLSVGIAVVH HLDPLSDALE LARRAEKAAK SVGGKNALAV TLSKRGGVER TVKDTWGVLD
RRLERFIDLH RAEAVPDGAA YELRDLARQL EASDENLKST LQKAAYAEAK RILRRKKARR
GTEPIAEDIL SELEVFLDKD RFSFEKLKQL ADELIIAREF AAAMDLTNIP LPVPSTGEVI
KNDRLDH