Gene Msed_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2124 
SymbolpheS 
ID5104417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2043862 
End bp2045259 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content48% 
IMG OID640508013 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_001192187 
Protein GI146304871 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.258233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.247036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCG AGAACGAGAT CAAAATCTTG GATTTTTTAA AGAGGAGAAA GGAATCAACC 
TCCCAGGAGA TTGCAGAGGG AACAGGTCTT CCCCTGAGCT CAGTGTTTAG TATTATCGCA
ACCCTAGAAT CTAAAGGTAT AGTGAAAGTC ATCTCAGAGG AGACCAGGAA AGTAGTCAGA
CTAACGGACG AGGGAAAACT TAGGACCGAA CAAGGGCTTC CAGAGGACCG TTTAGTTACT
CTCCTCAACG GAAGACCCTT AAAGATCCAG GAGTTGAGAA ATGCACTGGG CAAGGATTTT
GAAATAGGGT TCGGATGGGC CAGAAGAAAG GGATTAATAA CCTTGGAGAA CGACACGGTA
ATACCCAAGG TTTCGCAGTA CGTATCACCT GAGTACACGG CGTTAAAGGA TCTGCAGGCT
GGAAAGGAAC CTACTGGCGA GGTTCTAGAG ATCCTTCTAA GGAGGAAACT TGTTGAGGTG
AAGGAAGAGA AGATGCTAAG GGTTCAACTC CTGAGGGAAG TAGAGACAAG GCCAGCCGAA
CTTTACGTAA CTCACGAGAT GTTAACCACG GGTTCTTGGA GAGAATACGA GTTTAAACCC
TACAACGTGG AGGCTAATCC GCCTTTCTTT CCCATAGGGA AGACCCACTA CTTTAGGGAC
TTCATTGAGA AAGTGAAGGA CCTCATGGTG GGGCTTGGTT TCGTGGAAGT GTCTGGAGAC
TTCGTGGAGA CTGAATTCTT CAATTTCGAC ATGTTGTTCC AGCCACAGGA TCATCCGGCC
AGGGAAATTC ACGATTCCTT TGTGATTGAG GGAAAGGGTA ACTTACCTGG TTCTGACCTC
GTTAGGAAGG TTAAGGAGGT CCACGAGAAG TGGTGGAGAT ATTCTTGGAG CGAGGATAAC
GCAAGAAGGC TTGTTCTGAG GAGTCAGACC ACCGCTGTTA CTGCCAGGGT CTTAAGTGGT
GCACCAAAGA GAATAAGAGC CTTTACCATA GGTAAGGTGT TTAGGCCAGA CTCCATTGAC
GCTACTCATC TCATAGAGTT TCATCAGATG GATGGACTGG TCATAGAGGA GGACTTCACG
TTCAGGGACC TCCTTTCTAC TTTACGTGAT ATATTTCAGG GACTTGGGGT CAAGCAGGTA
AAGTTCAAGC CTGGGTATTT CCCATTCACC GAGCCCAGCG TAGAGGTTTA CGGTTTCATT
GAGGGCCTAG GTTGGGTGGA GATGGCTGGG GCGGGACTGC TCAGAAAGGA GGTTACGGAA
CCAGCAGGAG TTTTCTCGCC AGCAGGAGCA TGGGGGATAG GTATAGACAG ATTGGCCATG
CTCTTTCTAG GTGTCAAGGA TATAAGGGAT CTATACTCGC TCGATATAGA GTACCTGAGA
TCGAGGAGGG TGATCTAA
 
Protein sequence
MLSENEIKIL DFLKRRKEST SQEIAEGTGL PLSSVFSIIA TLESKGIVKV ISEETRKVVR 
LTDEGKLRTE QGLPEDRLVT LLNGRPLKIQ ELRNALGKDF EIGFGWARRK GLITLENDTV
IPKVSQYVSP EYTALKDLQA GKEPTGEVLE ILLRRKLVEV KEEKMLRVQL LREVETRPAE
LYVTHEMLTT GSWREYEFKP YNVEANPPFF PIGKTHYFRD FIEKVKDLMV GLGFVEVSGD
FVETEFFNFD MLFQPQDHPA REIHDSFVIE GKGNLPGSDL VRKVKEVHEK WWRYSWSEDN
ARRLVLRSQT TAVTARVLSG APKRIRAFTI GKVFRPDSID ATHLIEFHQM DGLVIEEDFT
FRDLLSTLRD IFQGLGVKQV KFKPGYFPFT EPSVEVYGFI EGLGWVEMAG AGLLRKEVTE
PAGVFSPAGA WGIGIDRLAM LFLGVKDIRD LYSLDIEYLR SRRVI