Gene Daro_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2024 
Symbol 
ID3566971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2176721 
End bp2178142 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content55% 
IMG OID637680495 
Productpeptidase S1C, Do 
Protein accessionYP_285239 
Protein GI71907652 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.115837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCT TTGTGGCTCT GTGTTCTCTT TGCTTCTTGT GCTCACTGCC TGTTGTTGCC 
CAGACGCGTG GCTTACCCGA TTTTTCCGAG TTGGCCGAAA AGCAGGGACC GGCAGTCGTC
AATATCAGCA CGACACAGAT CGTCCGCGGG CAGGCACAAA TGATGCCGTT CCCGTTTGAC
GAAAATGACC CTGCCTTCGA ATTCTTCAAG CGCTTCATTC CACGCAACCC AGGTGGCGCC
GTACCGCGTG ACTTCGAAAA CAAGTCACTC GGTTCCGGCT TCATTATCAG CGGTGATGGC
TATATCCTGA CGAACGCGCA CGTCGTTGAT GGGGCCGATG AAGTTGCGGT ACGCCTGACG
GACAAGCGTG AATTCAAGGC CAAGATCATC GGTGCAGACA AACGAACCGA TGTTGCGTTG
ATCAAGATTG AAGCTACCGG ACTTCCTGCT GCCAAGCTGG GTGATCCGGG CCAAATCAAG
GTAGGGGAAT GGGTGGTCGC CATTGGCTCG CCATTCGGGT TCGACAACTC GGTAACCGCT
GGTATTGTTT CCGCCAAGGG GCGCTCTCTG CCCCAGGAAA ACTACGTTCC TTTCATCCAG
ACCGATGTCG CGATTAACCC TGGTAATTCC GGCGGCCCAC TGTTCAATAT GCGTGGCGAA
GTGGTTGGTA TCAATTCGCA GATTTACAGC CGTAGTGGTG GCTACATGGG GGTTTCCTTC
GCCATTCCTA TCGATGTGGC GATGGACATC CAGAATCAGT TGCGCGCTTC CGGCAAGGTT
AGTCGTGGTC GCCTGGGGGT GGTGATTCAG GAGGTTAACA AGGAACTCGC CGATTCGCTG
GGCTTGACCA AACCGATTGG CGCTGTGGTG AACTCTGTGG AAAAGGGCGG GCCAGCTGAA
CGAGCCGGCA TAGAGGCTGG CGACGTAATC CTGAAATTTG ATGGCAAGAC GATCAACAAT
TCGGCTGACT TGCCACGCAT GGTGGGTGCA ACCAGGCCGG GTGGCCGCTC TGTAGTTCAG
GTTTGGCGCA AGGGGGCGAC ACGCGATATC GGTGTCACGA TAGGTGAGGT TCCTGATGAA
AAGCAGGCAA ATACAAAGGT GCCACGCGGG AAACTCTCTG AACAGACGGC CAATCGCCTC
GGTTTGGTCG TCAGCGAACT GACTCCTGAT CAAAAACGCG AGCTGAAAAT GAATTCCGGC
CTGCTAATCG AAGATGTTCG CGGGCAGGGG GCTCGTACCG ATCTGCGTGC TGGTGACATT
GTGATTGCAG TGATTGCGAA AGGTGCTACC ACTGAGGTCA AGACGGTGGA CCAGTTCAAT
AAGCTACTGG CTCAGTTTGA GAAAGGTAGT AACGTGACCC TGCTTGTTCG TCGTGGCGAA
ATGCAGACTT TCATCACCGT AAAAGGCTTG AACGGTGGTT GA
 
Protein sequence
MKRFVALCSL CFLCSLPVVA QTRGLPDFSE LAEKQGPAVV NISTTQIVRG QAQMMPFPFD 
ENDPAFEFFK RFIPRNPGGA VPRDFENKSL GSGFIISGDG YILTNAHVVD GADEVAVRLT
DKREFKAKII GADKRTDVAL IKIEATGLPA AKLGDPGQIK VGEWVVAIGS PFGFDNSVTA
GIVSAKGRSL PQENYVPFIQ TDVAINPGNS GGPLFNMRGE VVGINSQIYS RSGGYMGVSF
AIPIDVAMDI QNQLRASGKV SRGRLGVVIQ EVNKELADSL GLTKPIGAVV NSVEKGGPAE
RAGIEAGDVI LKFDGKTINN SADLPRMVGA TRPGGRSVVQ VWRKGATRDI GVTIGEVPDE
KQANTKVPRG KLSEQTANRL GLVVSELTPD QKRELKMNSG LLIEDVRGQG ARTDLRAGDI
VIAVIAKGAT TEVKTVDQFN KLLAQFEKGS NVTLLVRRGE MQTFITVKGL NGG