Gene Daro_3434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3434 
Symbol 
ID3568330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3686041 
End bp3687066 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID637681906 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_286633 
Protein GI71909046 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTG TCACTGTAGG CATCGATCTC GCCAAGAATG TATTCGCTTT GCATGGTGTT 
GACCAGTATG GCAAAGCCAT TTTTATCAAG CCAAAAGTAG CGCGCGGCCA ATTGCTGGAA
ATGGTCGCCC AGCTACCGCC CTGCCTGATC GGCATGGAAG CCTGCTCCGG TGCTCACCAT
TGGGCGAGAG AGTTCAGCCG CGTCGGCCAC ACCGTGAAGT TGATGGCACC CAAGTTTGTG
GTGCCGTACC GGATGAGCGG CAAGCGCGGC AAGAACGACG CCGCCGATGC CGCGGCCATC
TGCGAGGCCG TCACGCGGCC GAACATGCGT TTCGTCCCGG TCAAGGATGT CGATCAGCAG
GCCATCCTCT GCCTGCACCG CACCCGGCAA GGTTTTGTTG AAGAACGCAC CGCGCTCTAC
AACCGCTTGC GCGGCCTGAT CAGCGAGTTC GGCATTGTGC TGCCACAGAA AGTCGAACGC
CTGCGCCGGG AAATCGGTGC CCACCTCGAA GCACTGCCCG GCTGGGCCAA CCGCTGTGTC
GGTGATCTGC TGGCTCACGC CGACCGACTG AATGAACACA TCGACGAGTA CGACAAAGCC
ATCGCCCTGG CCGCCAAACA AGACCAGCGG AGCCGGCAGC TCATGCAACT TCCTGGCATC
GGCCCGACCA CCGCCAGCGC CCTGGTTGCC AGCCTGGGCG GCGGCCACGA CTTCAAGAAT
GGTCGGCAGC TTGCCGCCTG GGTCGGGCTC GTTCCCGGCC AATACAGCAG TGGCGGCAAA
GCCCGGCTGG GCAGGATCAC CAAGGCCGGC GACGCCTACC TGCGCAGTCT GCTCGTCATG
GGCGGCCGAT CTGTTCTCGC CGGACTCGGT GACAAGCAAG ACCGCTTCAG TCGTTGGGCC
AGAAATCTAG TCGAGCGACG CGGGTACTGG AAAGCGGCGG TTGCCATCGC CGCCAAGAAC
TTGCGACTCG CCTGGGCGGT CATGCACTAC GGAGAGGAAT TTCGGCGTAT CGAAGATCTC
GCCTGA
 
Protein sequence
MNIVTVGIDL AKNVFALHGV DQYGKAIFIK PKVARGQLLE MVAQLPPCLI GMEACSGAHH 
WAREFSRVGH TVKLMAPKFV VPYRMSGKRG KNDAADAAAI CEAVTRPNMR FVPVKDVDQQ
AILCLHRTRQ GFVEERTALY NRLRGLISEF GIVLPQKVER LRREIGAHLE ALPGWANRCV
GDLLAHADRL NEHIDEYDKA IALAAKQDQR SRQLMQLPGI GPTTASALVA SLGGGHDFKN
GRQLAAWVGL VPGQYSSGGK ARLGRITKAG DAYLRSLLVM GGRSVLAGLG DKQDRFSRWA
RNLVERRGYW KAAVAIAAKN LRLAWAVMHY GEEFRRIEDL A