Gene Mpal_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2096 
Symbol 
ID7271573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2222114 
End bp2223382 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content62% 
IMG OID643570707 
ProducttRNA pseudouridine synthase D TruD 
Protein accessionYP_002467117 
Protein GI219852685 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAT CCGACTATCC GTTGGAGCAG GCGCTCGGGA TGTGCTGGTA TGCGAGCAGC 
ACCGAGGGGG TCGGGGGGCG ACTCCGGACC ACCGCTGAGG ATTTCGTAGT CGAGGAGGTA
CCGGACCTCC CGGGTGGGGA GGGGCCATTC CTGCTCTGCA CCCTGACCAA GCGGAACTGG
GAACTGCAGC GGGCGGTCAA GGAGATCGCC TCCAGCCTTG GTATCAGCCA CAAACGGATC
GGGTGGGCCG GGACCAAGGA TAAACATGCG GTCACCACCC AGACGATCTC GCTGTATGGG
GTCACCCCTG AGGAGATAGC TTCAGTCTCG CTCGGGGATC TCTCACTGAC CGTGATCGGT
TCGGCCAACG GGGGGCTCTC GCTCGGTTCA CTCATCGAGA ACCGGTTCAC GGTGGTGATC
CGGGACTGCA TCGCGGAGGA CCTCGATCAG AGAGTGCAGG AGGTGACACA GGTCTGCAGT
CAGGGGATCC CGAACTATTA CGGGGTCCAG CGGTTCGGGG TGATCCGTCC GATCACCCAT
ATCGCCGGCG AGTACCTGAT CCAGGGAGAT TACGCCGGTG CGGTCGGAGC CTATGTCGGA
AAGAGTTCGC CCGACGAGGA TCCGGTTGTT GCGGCGGCAC GGGACGAGTA CCTGGAGACC
CGGGATCCTC AGTTGGCCCT CCACCACCTG CCGGTTCACC TCCGGTACGA ACGGGCCATG
CTCCACCACC TCACCAACCA TCCGGATGAC TACCTGGGTG CACTGCAGGC CCTTCCCCCA
AAACTCCTCT CGATGCTGGT CAGTGCATTC CAGTCCTACC TCTTCAACAT GGTCCTCTCA
GCCCGCCTTG AGGCAGAGTT AACGCTGATG GATCCGATCC CGGGCGATCG GCTCCACTTC
ACCAATGGCA GGATCGATAT CGCCACTGAG GCGAACCTGG CCACGGCGAC GATGCATATC
CGGCGGGGCC GGTGCAGGAT CGGCCTCTTT GTGCCCGGCG CCGAACCGTT CACCCCGCAG
GGGAGGATGG ACCAGATGAT GGAGGATCTG TTAAAAGAGC ACCAGATCGA TCGGGTATCC
TTCAAACAGG CTGAAACTGT CGTTAAAACA AGGTTCAGCG GCGCATCCCG GCCGATCGCC
CTGACCGCCA CCGTCGACGC AGTGGTGGAG GAGAACGACC TGACCCTCAG GTTCCCCCTT
CCCCCCGGCC ACTATGCGAC GACGGTCTGC AGGGAGTACA TGAAGAGTGA CCCCCTGATG
ATGATCTGA
 
Protein sequence
MKRSDYPLEQ ALGMCWYASS TEGVGGRLRT TAEDFVVEEV PDLPGGEGPF LLCTLTKRNW 
ELQRAVKEIA SSLGISHKRI GWAGTKDKHA VTTQTISLYG VTPEEIASVS LGDLSLTVIG
SANGGLSLGS LIENRFTVVI RDCIAEDLDQ RVQEVTQVCS QGIPNYYGVQ RFGVIRPITH
IAGEYLIQGD YAGAVGAYVG KSSPDEDPVV AAARDEYLET RDPQLALHHL PVHLRYERAM
LHHLTNHPDD YLGALQALPP KLLSMLVSAF QSYLFNMVLS ARLEAELTLM DPIPGDRLHF
TNGRIDIATE ANLATATMHI RRGRCRIGLF VPGAEPFTPQ GRMDQMMEDL LKEHQIDRVS
FKQAETVVKT RFSGASRPIA LTATVDAVVE ENDLTLRFPL PPGHYATTVC REYMKSDPLM
MI