Gene Msed_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2123 
SymbolpheT 
ID5104416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2042240 
End bp2043862 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content47% 
IMG OID640508012 
Productphenylalanyl-tRNA synthetase subunit beta 
Protein accessionYP_001192186 
Protein GI146304870 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0072] Phenylalanyl-tRNA synthetase beta subunit 
TIGRFAM ID[TIGR00471] phenylalanyl-tRNA synthetase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.276754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCA TCAACCTGAA CAAGTGGATA CTGCAGGATA TGACGGGGTT GAACGAGCAG 
GAATTAGTGG ATTATCTGTT CAAGTTGAAA TCAGAAGTCT CACCAGTAAG TCAGGACGAG
TATTCCATAG AGGTAAACGC TGATAGGTTG GACATGCTGA GCCTCGGCGG GATTGTGAGG
GCGTTAAAGG GAATAACGGG CAAGGAACTA GGGGAGCCCA GTTACCCCGT TAAGGACACG
GATTACGTTC TTGAGGTCGA AAAGGTAGCC TCTAGACCAT ACGCTCTTGC GTGCGTCATT
TACAATGTGA AGCTTAGTCC AGATTTCTAC TTAAAGGAGC TAATCCAGTT TCAGGAAAAA
CTCCATGATA CCATAGGAAG GAGAAGGAAG AAGGTAGCCA TTGGAATACA CGATCTTGAG
AAGGTAGAGG GGAAAATAAT TAGATATGCC CCAGTTTCCC TCTCTACCAC TTTCATTCCA
CTAAACCAGG AAAGGGAAAT GAGCGTAAGG GATGTCCTGC AAGAGACACC GCAGGGGAAA
CAGTACGGAA ACATATCCGT CTGGGACAGC AACTCCCCTG CAATAATGGA TGAGAGGGGG
ATCCTGAGCG TACCTCCGGT TATTAACTCG GACAGGACGA AGATCACTGG TAATACCAAG
TCGCTCCTCA TTGACGTTAC GGGAACGAAC TTTGAGTCCG TTATGGAGAC AATGGACCTC
TTGGCCACGG CTCTAGCTGA ACTGGGAGGG ATAATTGGGA GAGTAAAGGT AAGGGGGATG
AGCGTCGATA GGTCACCTGT GTTGAGGCAT ACCTCAGTTC CATTTAGTTT GGATGACGTG
AATAAGAGAC TAGGAATTCA CGTATCTAGG GATGAGGCCA TCAACTTGAT CAGAATGATG
AGAATGGAGG TGGAGACTAA CAAGGATCTC GCGGTTATAG TTCCACCATA CAGGGTCGAT
ATAATGAACT ACACAGATGT GGCAGAGGAT ATCGCAATGG CTTACGGGTA CGATCGCTTT
ACGCTGGAGT CGGGAAGAAC CGCGAGTAGG GGGTCCCTTT CGGAAAAGTC TGAAATTTAC
AGAAAGTTAA GAACCCTACT TGTAGGGGCA GGGTTTCAAG AAGTATATAC GCTTGTCTTA
ACAAAGTCCA GTTACCAGAG GGGGGAGGCG GTAAACATAG CTAATCCGAT CTCCGTGGAA
TACGATTCCG TTCGCAACTC GTTGCTGTGG AATAGCCTTG TGTTCCTATC AAATAATCAG
CACTCTAGGT TTCCTGTGAG GATATTTGAG ATAGGTGACG TGGTTAACAG GGATGATAGT
AAGGACACCA AATACTCTAA CTCAACTAGG CTGTCCATGG CCATTATGGA CAGCAGGGTG
AGTTACGAGA TGCTTCAGGC CCCACTGCAC GAGGTATTGC TCAATCTCTT GGGAGTTGCT
CCTTCCTACA GGAGGTTCGA GAGCGACATC TTCATGAAAG GAAGATCAGC TGAAGTGGTT
GTCAAGGGCG AGACAATTGG CAGGTTGGGT GAGGCAAATC CAGAGTTATT AAGGAGTTTT
GGTCTATTAT ACCCGGTGTT ACTAGCAGAA CTTGATCTGG ATGCCTTGAG GAGGGTGATG
TGA
 
Protein sequence
MPTINLNKWI LQDMTGLNEQ ELVDYLFKLK SEVSPVSQDE YSIEVNADRL DMLSLGGIVR 
ALKGITGKEL GEPSYPVKDT DYVLEVEKVA SRPYALACVI YNVKLSPDFY LKELIQFQEK
LHDTIGRRRK KVAIGIHDLE KVEGKIIRYA PVSLSTTFIP LNQEREMSVR DVLQETPQGK
QYGNISVWDS NSPAIMDERG ILSVPPVINS DRTKITGNTK SLLIDVTGTN FESVMETMDL
LATALAELGG IIGRVKVRGM SVDRSPVLRH TSVPFSLDDV NKRLGIHVSR DEAINLIRMM
RMEVETNKDL AVIVPPYRVD IMNYTDVAED IAMAYGYDRF TLESGRTASR GSLSEKSEIY
RKLRTLLVGA GFQEVYTLVL TKSSYQRGEA VNIANPISVE YDSVRNSLLW NSLVFLSNNQ
HSRFPVRIFE IGDVVNRDDS KDTKYSNSTR LSMAIMDSRV SYEMLQAPLH EVLLNLLGVA
PSYRRFESDI FMKGRSAEVV VKGETIGRLG EANPELLRSF GLLYPVLLAE LDLDALRRVM