Gene Dret_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0152 
Symbol 
ID8417956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp195505 
End bp196905 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID645036717 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_003197032 
Protein GI258404290 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.765246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.510076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG GAGCCGCAGA AGGACGGGCT GGGACGCCCG TCGCCACACC CGAGGGTAGG 
GGACGGAATC CCCGAGAGTA CGGTAGTGGT GCGTCAAGCG TCACGGCAAC GAAGGAGTAC
TCCCATCCGG AGCAGCAGAG TTTGATGGAA GCGGTGGTCG GACGCGAGAA CATGCTTGCG
GCCTACAAGC GTGTACGCGC CAACAAAGGC GTCCCCGGAG TCGACGGCAT GAGCGTCAAC
GACGTATGGG GATATTGCAC GCTCAACTGG GCCCGAATCA AAGAGGAGTT GCTGGACGGA
CGGTACGAGC CGCAGCCGGT GCTCGGGGTG GAAATCCCTA AACCCGGCGG CGGGGTGCGC
CAACTGGGCA TCCCGACGGC GCTGGACCGC CTGATACAGC AGGCGCTGCA CCAGGTGCTC
TCCCCCATTT TCAACCCTCA CTTCTCCGAA TCCAGCTACG GCTTCCGGCC CGGTCGAAGT
GCGCATCAGG CCGTGCTCAA GGCACGGGAG CATGCTGCCG CCGGCAAACG GTGGGTCGTG
GACATGGACC TGGAGAAGTT CTTCGACCGC GTGAACCACG ACGTGCTCAT GGCGCGCGTG
GCCCGCAAGG TGAAGGACAA GCGGGTGCTC GTCCTCATCC GGCGTTACCT GCAAGCGGGG
CTGATGCAGG GGGGAATTGC ATCGAAACGA AAGGAGGGCA CGCCGCAAGG CGGCCCCCTC
TCGCCGCTCT TGTCCAACAT CCTTCTGGAT GACCTGGACA AGGAGCTTGA ACGCAGAGGC
CACGCGTTCT GCCGATACGC CGACGACTGC AATATCTACG TGCAAACAAA ACGGTCCGGC
GAACGCGCAA TGGCCTCGAT CACCCGGTTT CTGACAGAGC GGTTGAAGTT GAGGGTCAAC
GCGGATAAGA GCGCGGTTGA CCGGCCATGG AAAAGGAAAT TCCTTGGGTA CTCGATGACC
TGGCATACGC AGCCGCGGCT CAAGGTTGCG CCCAGTGTGG TCAAACGCCT GAAACAGGCG
GTACGGGAGG AATTTCGACG TGGGCGGGGA CGGTCGCTCA AGAAGACGAT AGACACCCTT
GCGCCGAAAC TGCGAGGCTG GATGAACTAC TTCAAGCTGG CGGAGGTAAA GGGAGTTTTT
GAAGAACTGG ACATGTGGAT TCGCCGCAGA TTGCGCAATA TCCTGTGGCG GCATTGGAAA
CGACCCTACG CCCGAGCAAG GAACCTGATT CGCCGGGGAC TGACTGAAGA GCGCGCCTGG
AAATCCGCCA TCAACGGCCG CGGGCCATGG TGGAACTCCG GCGCATCGCA TATGAACCAG
GCATTCCCCA AGAAATACTT TGATTCACTT GGACTCGTGT CACTGCAAGA TCAACTTCGC
AAAGCTCAAA GTGTCAGGTG A
 
Protein sequence
MTNGAAEGRA GTPVATPEGR GRNPREYGSG ASSVTATKEY SHPEQQSLME AVVGRENMLA 
AYKRVRANKG VPGVDGMSVN DVWGYCTLNW ARIKEELLDG RYEPQPVLGV EIPKPGGGVR
QLGIPTALDR LIQQALHQVL SPIFNPHFSE SSYGFRPGRS AHQAVLKARE HAAAGKRWVV
DMDLEKFFDR VNHDVLMARV ARKVKDKRVL VLIRRYLQAG LMQGGIASKR KEGTPQGGPL
SPLLSNILLD DLDKELERRG HAFCRYADDC NIYVQTKRSG ERAMASITRF LTERLKLRVN
ADKSAVDRPW KRKFLGYSMT WHTQPRLKVA PSVVKRLKQA VREEFRRGRG RSLKKTIDTL
APKLRGWMNY FKLAEVKGVF EELDMWIRRR LRNILWRHWK RPYARARNLI RRGLTEERAW
KSAINGRGPW WNSGASHMNQ AFPKKYFDSL GLVSLQDQLR KAQSVR