Gene Daro_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2009 
Symbol 
ID3566956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2165074 
End bp2166036 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content54% 
IMG OID637680480 
Productpeptidase S49 
Protein accessionYP_285224 
Protein GI71907637 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACC CCCAATCCCC CAATAACGAT TCCGCCTGGG AGCGCAAAAC CCTGGAGAAG 
CTGGTCTTCG CCGCACTTGA TGAGCAGCGA TCACGGCGGC GCTGGGGTAT TGCCTTCAAG
GCGCTTGGTT TTGTTTATCT CCTGGTTGTA CTAATTGCAG TCGTTGACTG GGGGGCTGGC
GCTGAGCATC AGGAGCGCCA CACTGCCATG GTCAATCTGA CAGGGGTTAT CGAGGCCAAG
GGAGAGGCCA ATGCCGAGAA TCTGGTGGCC GCTTTAAACA GCGCCTTTGA TGAAAAAAAT
GCGGTGGGCA TCATCTTGCG TATCAACAGC CCCGGAGGCA GTCCGGTTCA GGCTGGCATT
ATCAACGACG AGATTCGACG TCTCCGCGGA AAATACCCCG CCAAGCCGCT CTATGCCGTG
GTCGAGGATA TGTGTGCCTC TGGTGGTTAT TACGTTGCTG CAGCCGCGGA TAATATTTAC
GTTAATAAGG CGAGTATTGT TGGCTCCATC GGCGTGTTGA TGGATGGCTT CGGTTTTACG
GGCACCATGG ATAAAGCTGG TGTTGAGCGG CGCCTATTAA CTGCTGGGGA AAACAAGGGG
TTTCTTGATC CGTTTTCCCC GCAGGCGCCA CAACATAAGG CCCATGCCCA ACTGTTGCTC
AATGATATTC ACAAGCAATT CATTGATGTG GTGAAAGCTG GCCGTGGCAA GCGCCTAAAG
GAAACCCCGG AAATGTTCTC GGGCTTGATG TGGACGGGGG CTCAGAGTAT TCAGCTTGGC
CTCGCCGACG ACTTCGGTAG CGTCGACTCA GTGGCGCGTG ACATCATCAA GGCAGAAAAA
GTCCTTGATT ACTCGGTCAA GGACAATATT GCCGAACGCT TTGCCAAGCG CCTTGGGGCA
AGCACCTTCG CTGGTTTTTG GAAGGGTTTC TCGGAAAGCG CTCTTGGCGT GCGTTTGTAC
TGA
 
Protein sequence
MDNPQSPNND SAWERKTLEK LVFAALDEQR SRRRWGIAFK ALGFVYLLVV LIAVVDWGAG 
AEHQERHTAM VNLTGVIEAK GEANAENLVA ALNSAFDEKN AVGIILRINS PGGSPVQAGI
INDEIRRLRG KYPAKPLYAV VEDMCASGGY YVAAAADNIY VNKASIVGSI GVLMDGFGFT
GTMDKAGVER RLLTAGENKG FLDPFSPQAP QHKAHAQLLL NDIHKQFIDV VKAGRGKRLK
ETPEMFSGLM WTGAQSIQLG LADDFGSVDS VARDIIKAEK VLDYSVKDNI AERFAKRLGA
STFAGFWKGF SESALGVRLY