Gene Daro_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1796 
Symbol 
ID3568345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1932366 
End bp1934255 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content60% 
IMG OID637680265 
Productpeptidase U32 
Protein accessionYP_285013 
Protein GI71907426 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.191326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGAA TTCAACACCC TCTCGAACTC CTCGCGCCGG CCAAGAATGC CGATTTCGGC 
ATTGAAGCCA TCAAGCACGG GGCCGATGCC GTCTATATCG GCGGCCCGAC TTTCGGCGCC
CGTTACGGCG CCGGCAACAG CGTCGCTGAA ATCCGACGCC TGTGTGATTT TGCCCACCGC
TACCGGGCAA AAGTCTTCGT CGCGCTGAAC ACCATCCTGC ACGACGACGA ACTGGAAGGT
GCCCGACAGC TGGCCTGGGA ACTCTATGAA GCTGGCACCG ATGCGCTGAT CATTCAGGAC
ATGGGGCTGC TCGAAACCGA CCTGCCGCCG ATCCAGTTGC ACGCCAGCAC GCAAACTGAC
AACCGCAACC CGGCCAAGGT GAAGTTCCTC GAAGATGCCG GTTTTTCGCA AGTGGTCCTG
GCCCGCGAAA TGTCGATCCA GGAAGTCCAT GCCGTCGCCA GCGAAACCAC CCTCGCCCTC
GAATACTTTG TGCATGGCGC GCTGTGCGTC GCCTTTAGCG GCCAGTGCTA CATCAGCCAG
GCCCACACCG GGCGCAGCGC CAATCGCGGC GAATGCTCGC AGGCCTGCCG CCTGCCCTAC
ACGCTGGTCG ACGACAAGGG CAAGACGATC ACCGAGAACC AGCATCTGCT GTCGATGAAG
GACAACAACC AGACCGACAA CATGCTGGAA CTGGCCCGTG CCGGGGTCAG TTCGTTCAAG
ATCGAGGGGC GCTTGAAAGA CCTCTCCTAC GTCAAGAACA TCACGGCGCA TTACCGCACG
CTGCTCGACG AGATCATCGC CAACAACCCG GAATTCAGTC GAGCTTCCAG CGGCCACAGC
ACTTTCTCCT TCACGCCGCA GCCGGACAAA ACCTTCAATC GTGGCTATAC CGATTACTTT
GCCGGTGGCC GGCAGGACGA CATTGGTGCC TTCGATTCGC CAGCCTTCGT TGGCGAGTTG
ATTGGTGAGG TAGCCGATAT CGGCGATGGC TGGTTCATGG TCAATACCGA CCAGGATTTC
CACAACGGCG ACGGCGTCTG CTTCTACGAC GCCAATGGCG ACGTACTCGG CATGCGCATC
AACCGGGCCG AAGGCAAGAA ACTGTTCCCG GCCGAAATGC CGGAAGAACT GACCGAAGGC
GCCACGCTGT TCCGCAACCG CGACCAGGAA TTCGAACGGG CGCTGGAAAA GGACTCGGCC
GACCGGCGCA TTTCGGTGAA GCCGGTTTTC TCTGAAACCG CCAATGGCTT CCGCCTGACG
CTGACAGACG AAGATGGCGT GACCGTTGGC GTTGATCTGC CGAAAAGCGA GAAAATCGGC
CGCGAAGTTG CCCAGAACGC CGATCAGGCA CTGGCCAAAC TCAAGGAAAA TCTCGGCAAG
TTTGGCAACA CCATGTTTGT CGCAGAGCCG GTCGAACTGC AATTGTCGCA GCCTTGGTTT
CTGCCGGTCA GTGCGATCAA CGCCCTGCGC CGCGAAGCGA CCGAAAAGCT CGAAGCCGCC
CGTCTCGCCA GCCACCCGCG CCCGCCGCGC GCCCAGCCGG CCGACCATCC GGTGCCCTAC
CCACAGGACG AGCTGACCTA TCTCGGCAAC GTCTTTAACG CCAAGGCACG CGCCTTCTAT
GAAAAACATG GCGTCAAGCT GATCGAGGAA GCCTACGAGG CCGGCAACGA AAAAGGCATG
GTCTCGCTGA TGATCACCCG CCACTGCCTG CGCTACAGCT TCAATCTGTG CCCGAAGGAA
GTTAAGCACC TGAAGCCGGA CCCGATGACG CTGGTCAATG GCAACGAAAA GCTGATCCTC
AAGTTCGACT GCAAGGCTTG CGAAATGCAC GTCATCGGCA AGATGAAAAA GGGCGTCAAA
CTGAACCTGG GAACCATTCG TCCGGCATAA
 
Protein sequence
MTRIQHPLEL LAPAKNADFG IEAIKHGADA VYIGGPTFGA RYGAGNSVAE IRRLCDFAHR 
YRAKVFVALN TILHDDELEG ARQLAWELYE AGTDALIIQD MGLLETDLPP IQLHASTQTD
NRNPAKVKFL EDAGFSQVVL AREMSIQEVH AVASETTLAL EYFVHGALCV AFSGQCYISQ
AHTGRSANRG ECSQACRLPY TLVDDKGKTI TENQHLLSMK DNNQTDNMLE LARAGVSSFK
IEGRLKDLSY VKNITAHYRT LLDEIIANNP EFSRASSGHS TFSFTPQPDK TFNRGYTDYF
AGGRQDDIGA FDSPAFVGEL IGEVADIGDG WFMVNTDQDF HNGDGVCFYD ANGDVLGMRI
NRAEGKKLFP AEMPEELTEG ATLFRNRDQE FERALEKDSA DRRISVKPVF SETANGFRLT
LTDEDGVTVG VDLPKSEKIG REVAQNADQA LAKLKENLGK FGNTMFVAEP VELQLSQPWF
LPVSAINALR REATEKLEAA RLASHPRPPR AQPADHPVPY PQDELTYLGN VFNAKARAFY
EKHGVKLIEE AYEAGNEKGM VSLMITRHCL RYSFNLCPKE VKHLKPDPMT LVNGNEKLIL
KFDCKACEMH VIGKMKKGVK LNLGTIRPA