Gene Msed_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2131 
SymbolgltX 
ID5104424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2050311 
End bp2052011 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content51% 
IMG OID640508020 
Productglutamyl-tRNA synthetase 
Protein accessionYP_001192194 
Protein GI146304878 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00463] glutamyl-tRNA synthetase, archaeal and eukaryotic family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000159948 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000949008 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATGG AGTTAGAGGA AATTGTATAC AAGTATGCCC TCTGGAACGC GGTGAAACAC 
AACGGACAGG CCCAGGTAGG CCCCGTTGTG AGTAAGGTTT TCGCCGAGAG GCCAGAGCTT
AAGGCGAACG CCAAGGAAGT GGTTAAACTC GCCGAGAAGA TGGTTGCAAA GGTTAACGCG
ATGTCCCTGG AACAACAGAC TGCGGAGTTG CAGAAGTACC CTGAGCTTCT GGAGGAGAGG
AAGAAGGAGG AGAAAAAGAC TCTCTCCCCG CTTCCAAACG TTAAGGGCAC TGTGGTCACG
AGGTTTGCCC CGAACCCCGA TGGTCCACTT CACCTGGGTA ACGCCAGGGC AGCGGTCCTG
TCCTTCGAGT ATGCCAAGAT GTATAAGGGC AAATTCATCC TAAGGTTTGA CGACACCGAT
CCCAAGGTGA AGAAACCCAT TAAGGAGGCT TACGACTGGA TCAGGGACGA TTTAAGATGG
CTCAACATCA CGTGGGATCT TGAGTTTAAG GCCTCGGAGA GAATGAGCGC CTACTACAAT
GTGGCTAAGG TGATGCTCGA GAAGGGGTTT GCCTACGTTG ATACCCTTAG TGACGCGGAG
TTCAAGGCGT GGAGGGACTC AAGGAACAAA ACCGTGTATA AGCCCAGGAC CAATCCACCG
GAGGTCAACC TCGAGCTCTG GGAGAAGATG CTGAACGGCG ATTTTGACGA GGGTAAAGCT
GTGGTGAGGA TAAAGACGAA TCCGGAGGAC CCTGATCCCT CAAAGATCGA CTGGGTAATG
CTCAGGATCA TTGATACCAA GAGAAACCCC CACCCCATCG CAGGGGATAA GTTCAGGGTA
TGGCCCACAT ACAATTTCGC CACGGCCGTG GATGATCACG AGTTCGGGAT AACCCATATC
CTCCGTGCAA AGGAGCACAC GACCAACACC GAGAAACAGA GATGGGTCTA CGATTACATG
GGATGGGAAA TGCCAACCGT CCTTGAATTT GGAAGACTGA AACTTGAGGG TTTTATGATG
AGCAAGTCCA AGATCAGGGG GATGCTGGAG ACCGGGTCAG AGAGAGACGA TCCTAGGTTA
CCCACTCTAG CAGGGTTAAG GAGGAGGGGG ATCATACCCG ACACCGTGAG AGAGATCATC
ATCCAGGTGG GATTGAAGGT GACAGACGCG ACCATTAGCT TCGATAATAT AGCCTCTGTT
AATAGGAAAC TCCTGGATCC CGTTGCCAAG AGGCTCATGT TTGTCAGGGA AGGGGTACTC
TTTAAACTGG AGATCCCCCA GGAGATGAAG GCCAAGGTTC CCCTAATACC TGCGAGACAG
GAGTTTAGGG AGATCTTCGT GAAACCTGGT GACGAGATTT ACCTGGATAA GGGCGATGTT
GAGGAGGGGA AGGTAGTGAG GCTCATGGAC CTCTGCAACG TTAAGATAGA GGGAGACAGA
TTGAGGTTCC TCAGCCAGGA TCTAGAGTCT GCCAAGAGAA TGGGAGCTAA CATAATCCAG
TGGGTGAAGA AAAGTGAAAG CAAGAGCGTG AACGTGATAA AGGCTGATCC AAACAAGGAT
GTCGAGGAGA TCAGGGGCTA CGGCGAGGGG TACTTTGAGA CCCTAAAGCC AGGGGATATT
GTCCAGCTGG TTCGCTACGG ATTCGCCAGA GTGGACAGCA TATCACGTGG TGAGATTACC
ATGATATTCG CACATGAGTA A
 
Protein sequence
MTMELEEIVY KYALWNAVKH NGQAQVGPVV SKVFAERPEL KANAKEVVKL AEKMVAKVNA 
MSLEQQTAEL QKYPELLEER KKEEKKTLSP LPNVKGTVVT RFAPNPDGPL HLGNARAAVL
SFEYAKMYKG KFILRFDDTD PKVKKPIKEA YDWIRDDLRW LNITWDLEFK ASERMSAYYN
VAKVMLEKGF AYVDTLSDAE FKAWRDSRNK TVYKPRTNPP EVNLELWEKM LNGDFDEGKA
VVRIKTNPED PDPSKIDWVM LRIIDTKRNP HPIAGDKFRV WPTYNFATAV DDHEFGITHI
LRAKEHTTNT EKQRWVYDYM GWEMPTVLEF GRLKLEGFMM SKSKIRGMLE TGSERDDPRL
PTLAGLRRRG IIPDTVREII IQVGLKVTDA TISFDNIASV NRKLLDPVAK RLMFVREGVL
FKLEIPQEMK AKVPLIPARQ EFREIFVKPG DEIYLDKGDV EEGKVVRLMD LCNVKIEGDR
LRFLSQDLES AKRMGANIIQ WVKKSESKSV NVIKADPNKD VEEIRGYGEG YFETLKPGDI
VQLVRYGFAR VDSISRGEIT MIFAHE