Gene Daro_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3371 
Symbol 
ID3567239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3630039 
End bp3631190 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content63% 
IMG OID637681843 
Productpeptidase S1 and S6, chymotrypsin/Hap:PDZ/DHR/GLGF 
Protein accessionYP_286570 
Protein GI71908983 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGGT TGTGGCAGAT TTTTGCACAA ACGGTCACCG TTGCGCTCGC CATTCTGTTC 
GTCGTCTCCA CGCTGAAACC GGAATGGCTG CCGCAACGAC AGGGTGTCGT TGCACTCCAG
GAAGCGCCGA CCACCGGCGA CGAGATCAAG AGCACACCGG GCTCCTATCG CGATGCGGCA
CGCGCCGCCT TGCCCTCGGT AGTACACATC TACACGACAC AGGAAATCAA GCAGCAGCGC
CACCCGCTGT TCGATGACCC GATCTTCCGC CATTTCTTTG GCGACCGCCC GGAAGGCCAA
CCGCAGCGTA ACTCCGGCCT CGGTTCCGGC GTCATCGTCA GCCCCAACGG CTACATCCTG
ACCAACTACC ACGTCATCGA AGGCGCCGAC GACATCCAGG TCTCGCTCAA CGACACCAAG
ACCTACAAGG CGAAAATCGT CGGGAGCGAC CCGGAATCCG ATCTCGCCAT CCTGCAAATA
AAGGCCGACA AGCTGCCGGC GATCACCTTC GGCCAGATGG ACAACCTGCG CGTCGGCGAT
GTCGTGCTCG CCATCGGCAA CCCGTTCGGT GTCGGCCAGA CAGTCACCAT GGGCATCGTC
TCCGCCCTCG GTCGCTCGCA CCTCGGCATC AACACCTTCG AGAACTTCAT CCAGACCGAC
GCCGCGATCA ACCCCGGCAA CTCCGGTGGC GCGCTGGTCG ACATTCACGG CAACCTGGTC
GGCATCAACT CGGCCATCTA CTCGCGCACC GGGGGCTCGC TCGGCATCGG CTTCGCCATC
CCGGTGTCCA GTGCCCGCAG CATCATGGAG CAGATCATCC GCACCGGCAC CGTGACCCGT
GGCTGGATCG GCGTCGAGGC CCAGGAAATC ACGCAGGAAC TGGCCGAATC CTTTGGCCTG
CCGGACAACG AAGGCGCCCT GATTGCGGGT GTGGTGCGCA GCAGCCCGGC CGACACAGCC
GGCATCCGCC CCGGCGATGT GCTGCTCTCG GTCGATGGCA AGCCTGTCCA GGATCCGCAA
GTCATGCTCG ACCTGATTGC CGCACTGACT CCGGAGGAAC GCTCCGCTTT CCGCCTGCGT
CGCGGCAAGA ACATCGTCGA GGTTCAGGTC AGGATCGGCA AGCGTCCGGC CATGCGGGCC
GAACAGGAGT AA
 
Protein sequence
MQRLWQIFAQ TVTVALAILF VVSTLKPEWL PQRQGVVALQ EAPTTGDEIK STPGSYRDAA 
RAALPSVVHI YTTQEIKQQR HPLFDDPIFR HFFGDRPEGQ PQRNSGLGSG VIVSPNGYIL
TNYHVIEGAD DIQVSLNDTK TYKAKIVGSD PESDLAILQI KADKLPAITF GQMDNLRVGD
VVLAIGNPFG VGQTVTMGIV SALGRSHLGI NTFENFIQTD AAINPGNSGG ALVDIHGNLV
GINSAIYSRT GGSLGIGFAI PVSSARSIME QIIRTGTVTR GWIGVEAQEI TQELAESFGL
PDNEGALIAG VVRSSPADTA GIRPGDVLLS VDGKPVQDPQ VMLDLIAALT PEERSAFRLR
RGKNIVEVQV RIGKRPAMRA EQE