Gene Dret_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2243 
Symbol 
ID8420101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2547748 
End bp2548827 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content61% 
IMG OID645038844 
Productribosomal RNA large subunit methyltransferase N 
Protein accessionYP_003199105 
Protein GI258406363 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.726084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCGA AAACCATACA CGACATTTCC TTCGACGACC TCGCCGCCTG GCTCACTGAA 
CAGGGGCAGC CACGTTTCAG GGCCGAACAA ATCTGGCAAT GGCTGTGGAT CAAAGGCGCC
ACAAGCTTTG AGGATATGAC CAATGTCTCG AAATCCCTGC GAAGCGCCTT GTCGCAGGTC
TTCCCCATAG CGCTGCCGAC TGTAGCCGAA GTCCACACCA GCGCCGACGG GACCAGAAAA
TTCCTGCTCA ACCTCCACGA CGGCCATGTG CTGGAGACGG TGCTCATTCC CGGCGGGGAA
CACTTCACCC AATGTCTCTC GACACAGGTC GGCTGCAATC TGGGGTGCAC CTTTTGCAGC
ACCGGACAGA TGGGATTGAC TCGGAATCTG AGCGCGGCAG AGATCGCCGG ACAGGTCATC
GTGGCCCGCA ATCACCTCTG GCAGACCGGT ACGGGTATGC GGTTGCGCAA CCTCGTCTTT
ATGGGCATGG GCGAGCCCCT GCTGAACTGG GAAAACGTTG ACAACGCCCT GGATCGGCTC
ATCCATGCCT CGGCCATGAA TTTTTCTCCG CGCCGGGTCA CGGTTTCCAC CGTAGGCGTG
CCGGGAACCC TGGACGCCCT GGGCCACAGC CACAAGGCGT CCCTGGCCGT CTCCCTGCAC
GCCCCGAACC AGGAACTGCG CGAGAAGATC ATGCCCAGGG CCGCTCGAAT GCTTCCGCTA
CCGGATCTTC TCGCACGGCT TCGCAGCTAC CCCATGGCCC CGCGGCAGCG GGTCACCATC
GAATACGTTC TTTTGGGCGG GGTGAACGAC AGCCTCGACC AGGCCCGCCA ACTCGTGCGC
TGCCTCAATG GCATCCGTTG TAAGGTCAAC CTCATCGCGT TCAACCCCTG TCCGGGACTC
CCCTATTCGG CCCCCGAGAC CGAGCAGGTC CTTGCGTTCG AGACGTTGCT CCGCGACAAG
GGGTTGACGG CGACCTTGCG CAAAAGCAAA GGGCAGGATA TTTCCGCCGC GTGCGGCCAA
CTCAAGACCC GCCGCACTGC CGGCGCAAAC AATTGCGGCC CCACTGTCAC CAACACATAA
 
Protein sequence
MRPKTIHDIS FDDLAAWLTE QGQPRFRAEQ IWQWLWIKGA TSFEDMTNVS KSLRSALSQV 
FPIALPTVAE VHTSADGTRK FLLNLHDGHV LETVLIPGGE HFTQCLSTQV GCNLGCTFCS
TGQMGLTRNL SAAEIAGQVI VARNHLWQTG TGMRLRNLVF MGMGEPLLNW ENVDNALDRL
IHASAMNFSP RRVTVSTVGV PGTLDALGHS HKASLAVSLH APNQELREKI MPRAARMLPL
PDLLARLRSY PMAPRQRVTI EYVLLGGVND SLDQARQLVR CLNGIRCKVN LIAFNPCPGL
PYSAPETEQV LAFETLLRDK GLTATLRKSK GQDISAACGQ LKTRRTAGAN NCGPTVTNT