Gene Daro_0370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0370 
Symbol 
ID3569673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp402501 
End bp404165 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content63% 
IMG OID637678812 
Productphenylacetic acid degradation protein paaN2 
Protein accessionYP_283599 
Protein GI71906012 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.52975e-18 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000651019 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCCCACC CCCTGCTCGA CAAGCACCGC GCCACGCTCG ATGGCGCCCT TAATGCCATC 
GCCACCCGTG CCTATTGGTC GGCCTATAAC GAGATGCCGA GTCCGAAGAC CTACGGTGAA
ACCGCGGCTG AAGATGGCAA GAAGGCTTTC GAAGCCCATC TTGGCAAGCA GTTCGATCTG
GGGCAGCCGG GGGCGACGGG TTGGGCCGGT GGCGAGCAAT CGCCCTACGG TATCGATCTG
AACGTTCAGT ATCCGGTCTG CAACTATGAA ACATTGATCG CCGCCGGCCA GCAGGCCATG
GGCGGCTGGC AGAAGATCGG GGCCGATGGC CGCACTGGCA TCTGTCTCGA AATCCTGGCT
CGCCTGAATC AGCAGAGCTT CGAGCTGGCC CATGCGGTCA TGATGACCAC CGGCCAGGGC
TGGATGATGG CTTTCCAGGC CGGCTCGCCG CACGCCCAGG ACCGTGGCCT CGAAGCGGTT
GCCTATGCCT ATCGCGAACA GAGCTTCGTG CCGGCCGAAA CGACCTGGGA CAAGCCGCAG
GGCAAGAACC CGCCGCTGGT CATGAAGAAG CATTTCGAAA TCGTCGGTCA CGGTGTCGGC
GTCGTTGTCG GCTGCGGCAC CTTCCCGACC TGGAATACCT ACCCCGGCCT GTTTGCCGCG
CTGTCCACCG GCAATGCCGT GATCGTCAAG CCGCATAGCA ATGCCATCCT GCCGGCCGCC
ATTACCGTGC GCACCATTCG CGCCGTGCTG GCCGAGAACG GCATTGACCC CAACCTGGTC
ACGCTGTGCG TGGCCGATCG TGCCGCCACG CAGAAGCTGG TCACCCACAA GGCCGTCAAG
TCCATCGACT TCACTGGCGG TAATGTCTTC GGCCAGTGGC TGATCGACAA CTGCCGCCAG
GCCCGCGTCT ATGCCGAGCT GGCCGGCGTC AACAACATCG TGATCGATTC GACCGATGCC
TACAAGCCGA TGCTGCGCAA CCTGGCTTTC ACGCTGTCGC TGTATTCCGG CCAGATGTGC
ACCACCTCGC AGGCCATCTT CGTGCCGGCC GCTGGCATTG AGACCGAAGA CGGCCACAAG
TCCTACGACG ACGTCTGTGC CGATCTGGCC CGTGCCGTGT CCGGCTTCCT GTCCAAGCCG
GAGGTCGCGC TGGCCGTGCT CGGTGCCATG CAATCGGCTG ATACCATCAA GCGTATCGAC
ATGGCTGACA GCGGCACGCT GGGCAAGGTG GTGCTGGCTT CCACCAAGCT GGACAACCCG
GAATTCCCGA AAGCTGCCGT CCGTACCCCG GTCCTGCTCG CCTGTGATGC GGCCGACGAG
CATGCCTATA TGGAAGAGCG TTTTGGCCCG ATCAGCTTCA TCGTCAAGGT GGCTGATACC
GCTGCCGCCA TCGCGCTGTC CGAGCGCATT GTGTCTACCC ACGGTGCGCT GACGGCCGGT
ATCTACTCGA CCAAGCCGGA AGTGATCGAC GCGATGACCG CCGCCACGAT GCGCGCCAAG
GTTGCCCTGT CGATCAACCT GACCAGTGGC GTGTTCGTCA ATCAGTCGGC CGCTTACTCC
GATTACCACG GTACCGGCGG CAACCCGGCT GCCAACGCGT CCTACGCCGA TGCCGCCTTT
GTCGCCAACC GCTTCGTAGT CGTCCAGCGC CGTTACCACA TCTAA
 
Protein sequence
MSHPLLDKHR ATLDGALNAI ATRAYWSAYN EMPSPKTYGE TAAEDGKKAF EAHLGKQFDL 
GQPGATGWAG GEQSPYGIDL NVQYPVCNYE TLIAAGQQAM GGWQKIGADG RTGICLEILA
RLNQQSFELA HAVMMTTGQG WMMAFQAGSP HAQDRGLEAV AYAYREQSFV PAETTWDKPQ
GKNPPLVMKK HFEIVGHGVG VVVGCGTFPT WNTYPGLFAA LSTGNAVIVK PHSNAILPAA
ITVRTIRAVL AENGIDPNLV TLCVADRAAT QKLVTHKAVK SIDFTGGNVF GQWLIDNCRQ
ARVYAELAGV NNIVIDSTDA YKPMLRNLAF TLSLYSGQMC TTSQAIFVPA AGIETEDGHK
SYDDVCADLA RAVSGFLSKP EVALAVLGAM QSADTIKRID MADSGTLGKV VLASTKLDNP
EFPKAAVRTP VLLACDAADE HAYMEERFGP ISFIVKVADT AAAIALSERI VSTHGALTAG
IYSTKPEVID AMTAATMRAK VALSINLTSG VFVNQSAAYS DYHGTGGNPA ANASYADAAF
VANRFVVVQR RYHI