Gene Daro_2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2250 
Symbol 
ID3566437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2433628 
End bp2434917 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID637680717 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_285457 
Protein GI71907870 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.461169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.551741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTAC ACGCGAAAGC GAAGTCGGAA TCTGGATATC GTTTCTACGC GCTGTACGAC 
AAGATTTATC GCACGGATAT TTTGGCGCAT GCCTATGCCC AGTGCCGCTC CAATAAGGGC
GCGCCGGGTG TGGATCGTCA GGATTTCGAG GATGTCGAGG CGTATGGTGT GCGGCGATGG
CTGGAGGAAC TGGCGCTTGC GCTCAAAGAG GAGAGCTACC GACCGGATCC AATTCGGAGA
GTGTTTATCC CGAAAGCCAA TGGCAAGTTA AGGCCTCTGG GCATTTCAAC GCTGCATGAT
CGAGTGTGTA TGACAGCAGC CATGCTGGTA CTCGAACCTA TCTTTGAAGC TGATCTTCCT
GATGAACAGT ATGCCTACCG GCCGGGCCGC AATGCCCAGC AGGCGGCAGA AGAAGTGAAG
AACCGGCTCT ACCTTGGACA AACGGACGTT GTCGATGCCG ACCTGTCGGA CTACTTCGGC
AGCATTCCAC ATTCTGAACT GATGAAGTCG CTGGCGCGAC GCATCGTGGA TCGGCGTGTG
CTACATCTTA TCAAGATGTG GCTGGAGTGC GCGGTTGAGG AAACCGATCA GCGAGGACGG
AAGAAACGGA CGACCGAGGC CAAAGATCAG GGGCGAGGTA TCCCGCAAGG CTCACCGATC
TCGCCGCTGC TATCGAATTT GTATATGCGG CGGTTTGTGC TGGCGTGGAA GAAACTGGGG
TTGGAGCGAA GCCTTGGCAG TCGCATCGTC ACCTATGCCG ACGACCTCGT GATCCTGTGC
AAGTGTGGCA AGGCGGAAGA AGCCTTGCAA TGGATGCGCA CGATCATGGG GAAACTGAAG
CTCACGGTGA ACGAGGAAAA GACACGAATC TGTCAGGTAC CGGCAGGGAC GTTCGACTTT
CTGGGTTACT CGTTTGGACG GCGATATGTG CCGCGCACAG GGAAGCCGCA GATCGCTCTG
TGGCCGTCGA AGAAAAGCAT TCGACGCATG GTGGAGAAAA TCCATGACAT GACTGAGCGG
CAAACGGGTT GGCAAGAGAC CACGGAGCTG GTGGGCAAGT TGAATCGGAC GCTACGCGGC
TGGGCGAACT ACTTCAGTGT AGGGAACGTC AGTCGCGCCT ATCGTGCGCT CGACAGTTAC
ACGGCAACGC GGTTGCGTCG GTGGTTGCGC TACAAGTACA AGCTCAGACA TTGCAAGGGC
GGGAGCTATC CACTCTCGCA CCTCTACGGG TACTTTGGTC TCGTACGTCT GGGCGCACGT
GGGCGCAGCG AGGCGTGGGT GAAGGCGTGA
 
Protein sequence
MALHAKAKSE SGYRFYALYD KIYRTDILAH AYAQCRSNKG APGVDRQDFE DVEAYGVRRW 
LEELALALKE ESYRPDPIRR VFIPKANGKL RPLGISTLHD RVCMTAAMLV LEPIFEADLP
DEQYAYRPGR NAQQAAEEVK NRLYLGQTDV VDADLSDYFG SIPHSELMKS LARRIVDRRV
LHLIKMWLEC AVEETDQRGR KKRTTEAKDQ GRGIPQGSPI SPLLSNLYMR RFVLAWKKLG
LERSLGSRIV TYADDLVILC KCGKAEEALQ WMRTIMGKLK LTVNEEKTRI CQVPAGTFDF
LGYSFGRRYV PRTGKPQIAL WPSKKSIRRM VEKIHDMTER QTGWQETTEL VGKLNRTLRG
WANYFSVGNV SRAYRALDSY TATRLRRWLR YKYKLRHCKG GSYPLSHLYG YFGLVRLGAR
GRSEAWVKA