Gene Mthe_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0022 
Symbol 
ID4462260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp19073 
End bp20974 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content58% 
IMG OID639699029 
Productglutamyl-tRNA(Gln) amidotransferase subunit E 
Protein accessionYP_842465 
Protein GI116753347 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2511] Archaeal Glu-tRNAGln amidotransferase subunit E (contains GAD domain) 
TIGRFAM ID[TIGR00134] glutamyl-tRNA(Gln) amidotransferase, subunit E 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.737796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTATC CCGATATGGA CTATCGTGCG CTTGGTCTTG TCTGCGGAAT AGAGATCCAT 
CAGCAGCTTG ATACCAGATG CAAGCTCTTC TGCAGCTGCC CGACTGTGCA CAGGGAGGTG
GAGGACTCGA ACTTCGAGTT CTTCAGGTAC CTCAGGCCTG CGAGAAGCGA GCTCGGGGAG
ATCGACAGGG CTGCTCTTGA GGAGACACTT GTATCGAGGA GGTTCGTATA CAAATCCTAC
AACACAACAT GTCTTGTGGA GGCAGATGAG GAGCCTCCAA GGGAGCTCAA CCGGGAGGCT
CTTGAGATCG CCCTCGTGAT CGCACGTCTT CTAAAGATGC GCATCGTCGA TGAGATCCAC
ACTATGCGCA AGACCGTGAT AGATGGCTCG AACACATCTG GATTTCAGCG CACCGCATTC
ATCGCGTCCA GCGGATCCAT AGATACATCA TGCGGGCCTG TCGGCATAGG AATACTCTGC
CTGGAGGAGG AGGCCGCCAG GATAGTCGAG GATCGCGGGG ATGAGGTGGT GTATTCTCTC
GACAGGCTCG GCATTCCCCT TGTCGAGATC GGGACCGCTC CTGACATAGT TTCTCCGCAG
CATGCCAGAG AGGTGGCTCA GCACATCGGC ATGATACTCA GATCGACAGG CCGGGTCAAG
CGCGGCCTCG GCACGATACG CCAGGACGTG AACGTATCGA TCGCCGGAGG CGCTAGGGTC
GAGATCAAGG GTGTTCAGGA GCTGAACCTG ATAGAGAGGA TCGTCGAGCT TGAGGTCATT
CGCCAGGTAA GGCTGCTCGA GATCAGAGAT GAGCTCCGGA GAAGAAACGC CCGTGTGTGC
GGAGATGTCG TCGATGCGAC CGGGCTGTTC TCGAACACGC GCTCCAAGGT CGTCGCAAAG
GCCCTCAAGA GCGGCGGCGC GGTTCTTGCC ACGAAGCTTG CTGGTTTCAA AGGGATTATT
GGAAAAGAGG TGCAGCCAGG GAGGCGCCTC GGCACAGAGC TATCTGACAG GGCGAAGCGT
GCGGGTGTCG GCGGCATATT CCACACCGAC GAGCTGCCAG CATATGGAAT AACAGAAGAG
GAGGTATCAT CACTCAGGAG CCTGCTGAGC TGCGAGGAGA GCGATGCTGT GGTGATGGTG
GCAGCGCCTC CCGAGAGGGC GAGAAAGGCG ATAGAAGCTG TTATTGAGCG CGCGCGCGAG
GCCATTGCGG GCGTGCCGGA GGAGACCAGG AGAGCGCTTC CCGATGGCAC ATCAGAGTAC
ATGCGTCCGC TTCCTGGATC CGCCAGGATG TACCCTGAGA CGGATGTACC GCCTGTCGTT
GTAAATAAAG AGATGGTCGA GCGGCTCAGG CTTCCAGAGC TCGTCGTGGA GAGGGCTGAG
AGGTACCAGA GGGAGTACGG GCTCAGCCCA GAGCAGGCCA GGATAATGGC AGCCTCATCC
AATTACCAGC TCTTCGAGGA GATCGTGCGG GTGTACAGAG TACAGCCCTC ACTCGTCGTG
CGCTCACTGG AGTCCACACC TGTCGAGCTT GCGAGGGATG GCGTGCCGGT TTACAGGCTC
AGCGAGAGGC ATTTCATGGG CTGCTTCGAC CTCCTCTCTC AGAAGAGAAT CGCAAAGGAA
GGCATACCCG CGCTCCTCAA GGCCATGGCT GAGAACCCCG ATGCAGATCC GGAGAGCGCG
GCGGAGGCAG CAGGTCTGAT GAGCCTGGGC GCGTCAGAGG TCGAGAGAAT CATCCATGAG
ATTGTTTCCG CAAAGGTTGA TCTCGTCAGG GAGAGAGGTG AGCGCGCGAT CGGGCCCCTC
ATGGGGCTCG CGATGGAGCA GCTGAGGGGC AAAGCAGACG GTGCGGCTGT GAGTGCTCTT
ATAAAGAAGG AGATAAATGC GATCCTCGGG AAGGCGGTCT GA
 
Protein sequence
MDYPDMDYRA LGLVCGIEIH QQLDTRCKLF CSCPTVHREV EDSNFEFFRY LRPARSELGE 
IDRAALEETL VSRRFVYKSY NTTCLVEADE EPPRELNREA LEIALVIARL LKMRIVDEIH
TMRKTVIDGS NTSGFQRTAF IASSGSIDTS CGPVGIGILC LEEEAARIVE DRGDEVVYSL
DRLGIPLVEI GTAPDIVSPQ HAREVAQHIG MILRSTGRVK RGLGTIRQDV NVSIAGGARV
EIKGVQELNL IERIVELEVI RQVRLLEIRD ELRRRNARVC GDVVDATGLF SNTRSKVVAK
ALKSGGAVLA TKLAGFKGII GKEVQPGRRL GTELSDRAKR AGVGGIFHTD ELPAYGITEE
EVSSLRSLLS CEESDAVVMV AAPPERARKA IEAVIERARE AIAGVPEETR RALPDGTSEY
MRPLPGSARM YPETDVPPVV VNKEMVERLR LPELVVERAE RYQREYGLSP EQARIMAASS
NYQLFEEIVR VYRVQPSLVV RSLESTPVEL ARDGVPVYRL SERHFMGCFD LLSQKRIAKE
GIPALLKAMA ENPDADPESA AEAAGLMSLG ASEVERIIHE IVSAKVDLVR ERGERAIGPL
MGLAMEQLRG KADGAAVSAL IKKEINAILG KAV