Gene Daro_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2549 
Symbol 
ID3567525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2747259 
End bp2748728 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content57% 
IMG OID637681016 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_285752 
Protein GI71908165 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.98894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.217885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGC GGCTAACCGC CCTCGTCATC GGCAACGGCG CCTACGAGGA TGCCAGCGAA 
CTCGAAAACC CCGTCAACGA CGCCGAGGAT GTTGCCGCAA AGCTCGAGGC TTGTGGCTTC
TCGGTGATCA AAGAAATCGA CTGCACAGCC GCTGCAATGG ACCGAGCCCT CAAGCGATTC
AAGGGAGAAC TGCCAGACAA CGATGTCGGC CTTTTCTTCT TTGCCGGGCA TGGCATGCAG
ATCGAAGGCG AGAACTATCT GGCGGCGGTG GACACCGATA CCGCCGGGGA AGTTGAGGCC
AAATACTCTT CGCTGCCCTT GAACCGGGTC ATCGAAACCA TGGAAAAGGC GGCAACGTCG
ACCAGCATCA CCATCCTGGA TGCCTGTCGC GACAATCCGT TCGAACGGGC CTGGCATCGT
TCGGCGGCAA CCCGCGGCCT GGCCCCCGTG TATGCCCCCA AAGGGACCTT GATCGCCTAT
GCCACTTCGC CAGGCCAAAC CGCCAGCGAT GGGCACGGAC GCAATGGGGC GTATACCGCT
GCATTACTTC AACATATTGC CACTCCCGAC TGTTCGATCG AGAACATGTT CAAGCGGGTC
CGCAACACGC TCAGTGCCGC CACACATGGA AAACAGATTT CTTGGGAGCA TACCTCGCTA
TCCGGCGAGT TCTACTTCAA CCTGAGCCTT GGGGCTCGCA TTGACGACTA CTCCGACAGC
GCGCTCAGCG ACGGCCTGTT CGTGCCCGAC GAAGCCAAAG CATCTCATCG GATTATCAAG
GCTCTGAAAA GCCTGACTTG GCCAGTGCAG AATCCGGCTA TTGACGGGTT CTTGTCCGAT
ATCGCCAACA AGGCGTCGCT GGACTCTCTC TTCGTTCTCG GGCGGAACAT CTATCAGGCG
GCATGCGGCG GATCGAACAG TGCCATTGCC TACCTGAGCG ACTTCGCCGC CCGGACCCAG
GCGGCGAAAC CCGAGAAACG AAAAGCGCTA CTGGACGGCA TGCTGTTCGA GGTCTTCTTC
GACCCCAAGG CAAAACTCCG AAAAGACTTC AAGACCCGCA GGTTCGAGGA TCTATTCGCC
CTCCAGCAGC ATAAAAACCT CTCGTCCAGC TTTGACTTTA TCACCGAATG TCTGCTTCCC
GAGGCCGGCC GTTTCTACTC GACCCCTGGC AGAAAACACC CTGTGGTGGT CGATGTCGCG
ACGACTCCCG ATAGTGCTGC CAATACGTAT CGACTTAAGT CAATCCATTG CGGCGGTACT
AGCATCATGT GGTTGGAGGA TGAGGACTAC GCAGTCGAAC CGGGGGAAAT CCCGAATGCC
GAAAAGATGA CCATCGCCAA GTTTGAGGCG CGACTGGCCG AACAAATGGC GGTTCCTTCC
CATTTACTGA CCATCAATTA CCTTTCGTTC GACAAACAGG CTCATGAACG CATCCTGTTC
CCCTATGGCT GGACGGTTCG GAAACGATAA
 
Protein sequence
MSRRLTALVI GNGAYEDASE LENPVNDAED VAAKLEACGF SVIKEIDCTA AAMDRALKRF 
KGELPDNDVG LFFFAGHGMQ IEGENYLAAV DTDTAGEVEA KYSSLPLNRV IETMEKAATS
TSITILDACR DNPFERAWHR SAATRGLAPV YAPKGTLIAY ATSPGQTASD GHGRNGAYTA
ALLQHIATPD CSIENMFKRV RNTLSAATHG KQISWEHTSL SGEFYFNLSL GARIDDYSDS
ALSDGLFVPD EAKASHRIIK ALKSLTWPVQ NPAIDGFLSD IANKASLDSL FVLGRNIYQA
ACGGSNSAIA YLSDFAARTQ AAKPEKRKAL LDGMLFEVFF DPKAKLRKDF KTRRFEDLFA
LQQHKNLSSS FDFITECLLP EAGRFYSTPG RKHPVVVDVA TTPDSAANTY RLKSIHCGGT
SIMWLEDEDY AVEPGEIPNA EKMTIAKFEA RLAEQMAVPS HLLTINYLSF DKQAHERILF
PYGWTVRKR