Gene Msed_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2213 
Symbol 
ID5105433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2123340 
End bp2125055 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content45% 
IMG OID640508106 
Productglycyl-tRNA synthetase 
Protein accessionYP_001192275 
Protein GI146304959 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.472902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000523773 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGAGT CTGACAAAGT GATCGAGTTA GCGAAGAGGA GAGGGATATT CTGGCCCTCC 
TATGAGATAT ATGGTGGAGT AGCTGGGTTG TATGATATAG GACCTGTTGG TGCAAGAATT
AAGAACAAAA TAATTAATAC TTGGAGGAAA ATATTTGTTG AAGAGAACAG CGAATTTGTT
GTAGAAATTG AAACTCCCAT GATAACTCCA TCTAAAGTGC TTGAGGCCAG CGGACATGTG
GAGAACTTCA CTGACCCCAT AGTGGAGTGT ACTAAGTGTC ACAAAATATA CAGGGCCGAC
CATTTGGTGG AGGAAATGTT AAAGATCAAT GTGGAGAGAC TTAAACCATC TGAGCTGACT
TCCCTCATAT CTGAGAAGGG ACTTAAGTGT CCATCATGCG GAGGCGATTT AGGAGAAGTT
AGAAGTTTCA ATCTTCTCTT TGCGACCAAC ATAGGTCCCT ACTCTGGTAC GACCGGATAT
CTAAGGCCAG AGACAGCTCA GGGCATGTTT ACCTCCTTTA AGAGGGTTTA TGAGGCTACG
AGGCAGAGGT TACCCCTTGG GATAGCCCAA GTGGGAAGGG TAGCTAGGAA CGAGATCTCC
CCGAGGCAAG GTTTAGTTAG AATGAGGGAG TTTACCATCA TGGAGGTGGA ATTTTTCATT
GACCCCGATG ACAGGAATGT TCCCTGGTTA GATAGATACT ACAATGAGGA GTTTAGAGTT
CTATTTGGGG ATGCTAAGGT AAAGGGTCTG AAACCGGCTA CGATGAAGGT AAAGGAAATG
ATTGAGGAGG GTCTGCTCGT AAATCCGTGG ATGGGCTTTT GGATGGCATC AGCGTCCAGG
TTTGTCCAGG CACTGGGTAT ATCTAAGGAT AGTTTCTATT TCGAAGAGAA ATTACCTGAG
GAAAGGGCTC ACTACTCATC GCAAACCTTT GATCAGATAG TCGAGATCAT GGGGGAGAAG
GTGGAAATAT CGGGGCATGC GTACAGGGGA AACTATGACC TGAGCAGGCA CTCAAAGTTC
AGTAACGAAG ATCTAACTGT TTTCAAGAAG TTCGATCAAC CTAGGACAGT GGTAAAGAAG
ACCGTCATAG TGAACAGGGA TAGGTTCAAG GATAATCCAG AACTTCAGAA GGAAGTCATG
ATGCTGGTGT CTGGAAAATC GCCAGAGCAA GTTGAGGAGT TGCTAAATAA ACAGGTCCAG
GTTGCTGGAA GACCGCTGTC TGAGTTTGTC CGGATTATGA ACAGGGAAGA GAAGGAACAC
GGAATTAAGT TCTACCCACA TGTGGTTGAA CCGTCATTTG GGGTAGAAAG ATGTCTCTAC
CTAAGCGTGC TTTCAGCTTA CAGAGAGAAG AAGGATCGAG TGGTATTGGC CTTACCTAAG
GATTTAGCTC CCTATCAAGT CGCAGTATTT CCGCTTTTAG AGAGGGATGA ACTCATAAAG
AAGGCTAGGG AGATATATAA TCTCCTTTCC GGGAAGTATG AGGTCCTATT TGATGACGCA
GGAAGCATAG GAAAGAGATA TGCAAGAGTG GATGAGATTG GTGTACCATA CGCAGTCACG
GTTGACCCAC AGACATTGTC TGATGATTCT GTGACCATTA GGGACAGGGA CTCTTGGAGC
CAAATTAGAA TCAAAACATC CGATCTGGAA TCTGTTATGG ACAAGTTATT TAGTGGGCAA
GATTTTAGTA TGTTAACAGG AGAGGCGAAA AGATGA
 
Protein sequence
MPESDKVIEL AKRRGIFWPS YEIYGGVAGL YDIGPVGARI KNKIINTWRK IFVEENSEFV 
VEIETPMITP SKVLEASGHV ENFTDPIVEC TKCHKIYRAD HLVEEMLKIN VERLKPSELT
SLISEKGLKC PSCGGDLGEV RSFNLLFATN IGPYSGTTGY LRPETAQGMF TSFKRVYEAT
RQRLPLGIAQ VGRVARNEIS PRQGLVRMRE FTIMEVEFFI DPDDRNVPWL DRYYNEEFRV
LFGDAKVKGL KPATMKVKEM IEEGLLVNPW MGFWMASASR FVQALGISKD SFYFEEKLPE
ERAHYSSQTF DQIVEIMGEK VEISGHAYRG NYDLSRHSKF SNEDLTVFKK FDQPRTVVKK
TVIVNRDRFK DNPELQKEVM MLVSGKSPEQ VEELLNKQVQ VAGRPLSEFV RIMNREEKEH
GIKFYPHVVE PSFGVERCLY LSVLSAYREK KDRVVLALPK DLAPYQVAVF PLLERDELIK
KAREIYNLLS GKYEVLFDDA GSIGKRYARV DEIGVPYAVT VDPQTLSDDS VTIRDRDSWS
QIRIKTSDLE SVMDKLFSGQ DFSMLTGEAK R