Gene Dret_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0849 
Symbol 
ID8418668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1006388 
End bp1007788 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content61% 
IMG OID645037418 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_003197718 
Protein GI258404976 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.141781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.131758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG GAGCCGCAGA AGGACGGGCT GGGACGCCCG TCGCCACACC CGAGGGTAGG 
GGACGGAATC CCCGAGAGTA CGGTAGTGGT GCGTCAAGCG TCACGGCAAC GAAGGAGTAC
TCCCATCCGG AGCAGCAGAG TTTGATGGAA GCGGTGGTCG GACGCGAGAA CATGCTTGCG
GCCTACAAGC GTGTACGCGC CAACAAAGGC GTCCCCGGAG TCGACGGCAT GAGCGTCAAC
GACGTATGGG GATATTGCAC GCTCAACTGG GCCCGAATCA AAGAGGAGTT GCTGGACGGA
CGGTACGAGC CGCAGCCGGT GCTCGGGGTG GAAATCCCTA AACCCGGCGG CGGGGTGCGC
CAACTGGGCA TCCCGACGGC GCTGGACCGC CTGATACAGC AGGCGCTGCA CCAGGTGCTC
TCCCCCATTT TCAACCCTCA CTTCTCCGAA TCCAGCTACG GCTTCCGGCC CGGTCGAAGT
GCGCATCAGG CCGTGCTCAA GGCACGGGAG CATGCTGCCG CCGGCAAACG GTGGGTCGTG
GACATGGACC TGGAGAAGTT CTTCGACCGC GTGAACCACG ACGTGCTCAT GGCGCGCGTG
GCCCGCAAGG TGAAGGACAA GCGGGTGCTC GCCCTCATCC GGCGTTACCT GCAAGCGGGG
CTGATGCAGG GGGGAATTGC ATCGAAACGA AAGGAGGGCA CGCCGCAAGG CGGCCCCCTC
TCGCCGCTCT TGTCCAACAT CCTTCTGGAT GACCTGGACA AGGAGCTTGA ACGCAGAGGC
CACGCGTTCT GCCGATACGC CGACGACTGC AATATCTACG TGCAGACAAA ACGGTCCGGC
GAACGCGCAA TGGCCTCGAT CACCCGGTTT CTGACAGAGC GGTTGAAGTT GAGGGTCAAC
GCGGATAAGA GCGCGGTTGA CCGGCCATGG AAAAGGAAAT TCCTTGGGTA CTCGATGACC
TGGCATACGC AGCCGCGGCT CAAGGTTGCG CCCAGTGTGG TCAAACGCCT GAAACAGGCG
GTACGGGAGG AATTTCGACG TGGGCGGGGA CGGTCGCTCA AGAAGACGAT AGACACCCTT
GCGCCGAAAC TGCGAGGCTG GATGAACTAC TTCAAGCTGG CGGAGGTAAA GGGAGTTTTT
GAAGAACTGG ACATGTGGAT TCGCCGCAGA TTGCGCAATA TCCTGTGGCG GCATTGGAAA
CGACCCTACG CCCGAGCAAG GAACCTGATT CGCCGGGGAC TGACTGAAGA GCGCGCCTGG
AAATCCGCCA TCAACGGCCG CGGGCCATGG TGGAACTCCG GCGCATCGCA TATGAACCAG
GCATTCCCCA AGAAATACTT TGATTCACTT GGACTCGTGT CACTGCAAGA TCAACTTCGC
AAAGCTCAAA GTGTCAGGTG A
 
Protein sequence
MTNGAAEGRA GTPVATPEGR GRNPREYGSG ASSVTATKEY SHPEQQSLME AVVGRENMLA 
AYKRVRANKG VPGVDGMSVN DVWGYCTLNW ARIKEELLDG RYEPQPVLGV EIPKPGGGVR
QLGIPTALDR LIQQALHQVL SPIFNPHFSE SSYGFRPGRS AHQAVLKARE HAAAGKRWVV
DMDLEKFFDR VNHDVLMARV ARKVKDKRVL ALIRRYLQAG LMQGGIASKR KEGTPQGGPL
SPLLSNILLD DLDKELERRG HAFCRYADDC NIYVQTKRSG ERAMASITRF LTERLKLRVN
ADKSAVDRPW KRKFLGYSMT WHTQPRLKVA PSVVKRLKQA VREEFRRGRG RSLKKTIDTL
APKLRGWMNY FKLAEVKGVF EELDMWIRRR LRNILWRHWK RPYARARNLI RRGLTEERAW
KSAINGRGPW WNSGASHMNQ AFPKKYFDSL GLVSLQDQLR KAQSVR