Gene Dret_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2022 
Symbol 
ID8419867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2320615 
End bp2321892 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content61% 
IMG OID645038610 
ProducttRNA (guanine-N1)-methyltransferase 
Protein accessionYP_003198884 
Protein GI258406142 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0336] tRNA-(guanine-N1)-methyltransferase 
TIGRFAM ID[TIGR00088] tRNA (guanine-N1)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0252382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0841142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATTTTT CCCTGGTGAC CTTGTTCCCG GAATTCTTTG CGTCACCGCT ACAATGCGGG 
TTGATGGGCA AAGCGTGTCA AGAATCGCTG GTGTCCTTCG CCACAATCAA TCCGCGGGAG
TTCACTGAGG ACCGCCACCG CAGTGTGGAC GACCGGCCGT ATGGCGGCGG GCCGGGCATG
GTCATGATGT GTGATCCCTT GCGCCGGGCC CTGCACTCCA TCCCCCGCCG GGGACGGACG
CTGCTCTTGT CTCCGAAGGG ACGGCCCTTC GACCAGACCC TGGCGCGGGA GTTGGCCGAA
GAAGAGGCCC TGACGCTCAT CTGCGGCCGG TATGAAGGGA TTGACGCCCG AATTGAATCC
CTGGAAGCCA TTGAGCCCGT TTCCGTTGGC GATTACGTCC TCAACGGCGG CGAAACCGGG
GCCTTATGTA TCATCGAGGC CGTGGCCCGG TTGCTGCCGA GCTTTATGGG CAAAACCGAT
TCGGCCACCG AAGAGTCCTT TTCCACGGGG CTTTTGGAGT ATCCCCACTA TACCCGGCCG
GAAACCTATG AGGGGCTCCA GGTCCCGCAG GTCCTGTTGT CCGGGGACCA TGCCCGTATC
GCCCGGTGGC GGCGGGAAAA GGCCCTGGAG ACCACGCTGG CATATCGACC GGAACTGCTC
CGGAACGCTT CCCTGTCCGC TGCGGACAAG CACTGCCTGC AAGCGTTGCC CCGGCAGTGG
CGGGGACGGA ATTTGTATGT CGGGCTTCTG CACCACCCGG TGCTGACCAA ATCCGGGGAA
GTGGGCACAA CTTCTTTGAC AAATCTTGAC ATTCACGATA TTGGACGCGT TTCCCGTTCC
TACGGCCTTG GGGGCTATTA TCTGGCCACT CCCCTGGCTG ACCAGCGGGA ATTGGCGCAT
CGGCTGCTGG ACCATTGGCG GCAGGGCGCT GGATGCCAAA CCAATGCTGA CAGGGCCAGT
GCATTGGCCG ACGTCCATGT GGTCACGGAT TTGGAAGCCA TTCGGGACAA TATTTTGCAA
CGCACCGGTC AACCCCCGCT CGTGCTGGCC ACGAGCGCTC AGGGGCAAGA GACGGTGCGC
CTCGAGGAGG TGCGGACGGC GCTGGAGCAT CGTCCGGTTC TCCTCGTCTT CGGGACAGGG
TCCGGGCTGG CGGACGAGGC CCTGGCCCAG ACAGATGGAT TGCTGCCCCC GCTGCGTTTT
TTGAGCACGT ATAATCATCT TTCCGTACGT TCCGCAGCAG CCATCTATAT CGATCGGGTT
TTGCAGGACT GGTATTGA
 
Protein sequence
MHFSLVTLFP EFFASPLQCG LMGKACQESL VSFATINPRE FTEDRHRSVD DRPYGGGPGM 
VMMCDPLRRA LHSIPRRGRT LLLSPKGRPF DQTLARELAE EEALTLICGR YEGIDARIES
LEAIEPVSVG DYVLNGGETG ALCIIEAVAR LLPSFMGKTD SATEESFSTG LLEYPHYTRP
ETYEGLQVPQ VLLSGDHARI ARWRREKALE TTLAYRPELL RNASLSAADK HCLQALPRQW
RGRNLYVGLL HHPVLTKSGE VGTTSLTNLD IHDIGRVSRS YGLGGYYLAT PLADQRELAH
RLLDHWRQGA GCQTNADRAS ALADVHVVTD LEAIRDNILQ RTGQPPLVLA TSAQGQETVR
LEEVRTALEH RPVLLVFGTG SGLADEALAQ TDGLLPPLRF LSTYNHLSVR SAAAIYIDRV
LQDWY