Gene Daro_2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2531 
Symbol 
ID3567565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2728302 
End bp2729501 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content62% 
IMG OID637680998 
ProductPhage integrase, N-terminal SAM-like 
Protein accessionYP_285734 
Protein GI71908147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.335158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTT TTAACGCTTC CCATTCCCGC CGCCGGGCTC AAGGTTCCAT TGGCTCGGGC 
CGTGCTGCCC AACACCAACC AGCACGCCGT CAGGAGTGGG CTTCACTCAC GCCCACCGAG
ATTCTGAGCC GCTACCGGCC GGGCAAGGCC GATCCGCTAC AGGTGCTCGA TGTCCTGCTG
GAACTGTTCA ACACGCAGCA CACCGCGCTC GACAAGACGG TCTCGCACAA GACACGACAG
GAACGGGCCG ACTTCCTGCG CCGTTTCTTC CGAGACCTCA AGGTGAAGGC CAATTTCGCC
ACCGTACCCG ATCCCCGTAA TCTCGGTGAC CGGCACATCC GGGCCATCGT CGCCGTCTGG
CGCGAAGAGA GACTCGCTCC GGCGACAATC CAGACTTATC TGAGCTTCCT ACGTGGACTG
GCGCTGTGGC TGAGAAAACC TGGCTTCATC CGGTCGCCGG CCTACTACGG CCTCTCGCCC
AATGAATATC AGCGCGACGA AAACGCTCAG CGCGACAAGA GCTGGACGGC GGCAAGCATC
GATATCGACG CCGTGGTTGA ACAGGTCATT GCGTTCGACC GCTACGTCGG CGCTTCATTG
GGATTGATCC GGACGTTCGG CCTGCGCCGC AAGGAATCAG TGATGATCCG CCCGCATCTG
TGCGTGGTGC CTTTCGAAGC CACAGGCCTG CCACCCGGGG AAAGGCAGGC CGACAACTAC
GTGCGAATCA AGGAAGGCGC GAAGGGTGGA AGGCGGCGCT TTGTGCCACT GGATTCAGAG
CAACGCATCG CAGCTTTAGA ATTCGCCCAG GCAGTCGTTC CGGGAGAGGA GGCGCATCTG
GGCGATCCTC GCCACAGTCT TAAGCACAAC CTGCGGCGCT TCGACTATGT GATGGCGAAG
TTCGGCATCA CGGCGGACGG CCTGGGTGCC ACGGCGCACG GGCTGCGCCA TGAAGCGATG
ATCGACCACT ACACGACCAA GGCTGGCGGG ACGCCACCGG TCCGAGGCGG CGGTGATGTG
CCTCCCGAGG AGGACGCGGC GGCAAGACTC TCGGCCGCCC GGCTGGCCGG ACACAATCGC
GCTAGGGCGG CAGGCGCTTA TTTAGGTGGG CTACTGCCAC GACAAGCGGC GTTGAAGGAC
CAGGCAAAAG ACAGCCTGCC CTGCACATCC GATGCGGAAA AATCACCCGG AGGGCATTGA
 
Protein sequence
MSTFNASHSR RRAQGSIGSG RAAQHQPARR QEWASLTPTE ILSRYRPGKA DPLQVLDVLL 
ELFNTQHTAL DKTVSHKTRQ ERADFLRRFF RDLKVKANFA TVPDPRNLGD RHIRAIVAVW
REERLAPATI QTYLSFLRGL ALWLRKPGFI RSPAYYGLSP NEYQRDENAQ RDKSWTAASI
DIDAVVEQVI AFDRYVGASL GLIRTFGLRR KESVMIRPHL CVVPFEATGL PPGERQADNY
VRIKEGAKGG RRRFVPLDSE QRIAALEFAQ AVVPGEEAHL GDPRHSLKHN LRRFDYVMAK
FGITADGLGA TAHGLRHEAM IDHYTTKAGG TPPVRGGGDV PPEEDAAARL SAARLAGHNR
ARAAGAYLGG LLPRQAALKD QAKDSLPCTS DAEKSPGGH