Gene Daro_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3687 
SymbolprfA 
ID3566799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3962020 
End bp3963102 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content62% 
IMG OID637682160 
Productpeptide chain release factor 1 
Protein accessionYP_286886 
Protein GI71909299 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGA GCATCCGCCA AAAACTGGAG TTGCTGGTTG ATCGTCTCGA TGAAATCGAC 
CGCATGCTGT CGGCACCGAG TACGGCCAGC GACATGGATC AGTTCCGCAA GCTGTCGCGT
GAGCGGGCCG AGGTTGAGCC GGTAGTCGTC CAGTTCAATG CCTTCCGCCA GGCCGAAAAT
GATCTGGCCG AAGCCGAGGC CATGCTGTCC GACCCGGACA TGCGCGAGTT TGCCGAGGAG
GAAATGGCTG CAGCCAAGGC GCGCCTGCCG GAACTCGAAC TCGAGCTGCA AAAACTGTTG
TTGCCGAAAG ACCCCAACGA CGAGCGCAGC GTGCTGCTCG AAATCCGTGC CGGCACAGGT
GGCGACGAGT CGGCGCTGTT CGCTGGCAGC CTCTTCCGGA TGTATTCACG CTTTGCCGAG
CGCCAGCGCT GGCAGGTCGA AGTGATGTCG GCCAGTGAAT CGGAACTCGG CGGCTATCGT
GAAATCATCT GCCGGATTGC CGGCAACGGC GCCTATTCTC GGCTCAAGTT CGAATCGGGT
GGCCATCGCG TCCAGCGCGT GCCGGAAACC GAGACGCAGG GCCGCATTCA TACCTCGGCG
TGTACGGTGG CTGTGATGCC GGAAGTCGAC GAGGTCGAGG ACGTCAATCT CAACCCGGCC
GACCTGCGCA TCGACACCTT CCGTGCCTCC GGTGCTGGTG GCCAGCACAT CAACAAGACT
GATTCGGCCG TGCGCATCAC CCACCTGCCG ACCGGCATCG TTGCCGAATG TCAGGATGGC
CGTTCGCAGC ACGCCAACAA GGCGTCGGCG CTGAAGGTGC TGGCGGCGCG GATCAAGGAT
GTCCAGGTGC GCGCCCAGCA GGCCCATATC TCCAGCACGC GGAAGAGCCT GATTGGTTCT
GGCGACCGCT CCGAGCGCAT TCGCACCTAC AATTTCCCGC AAGGCCGGAT CACCGACCAC
CGGATCAACC TGACGCTGTA CAAGATCGCT GCGATCATGG ATGGCGACAT GGATGAACTG
CTTGGCGCCC TGGCAGCCGA ACACCAAGCC GATCTGCTGG CCGAACTGGC AGAGCAGAAC
TGA
 
Protein sequence
MKSSIRQKLE LLVDRLDEID RMLSAPSTAS DMDQFRKLSR ERAEVEPVVV QFNAFRQAEN 
DLAEAEAMLS DPDMREFAEE EMAAAKARLP ELELELQKLL LPKDPNDERS VLLEIRAGTG
GDESALFAGS LFRMYSRFAE RQRWQVEVMS ASESELGGYR EIICRIAGNG AYSRLKFESG
GHRVQRVPET ETQGRIHTSA CTVAVMPEVD EVEDVNLNPA DLRIDTFRAS GAGGQHINKT
DSAVRITHLP TGIVAECQDG RSQHANKASA LKVLAARIKD VQVRAQQAHI SSTRKSLIGS
GDRSERIRTY NFPQGRITDH RINLTLYKIA AIMDGDMDEL LGALAAEHQA DLLAELAEQN