Gene Msed_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2077 
SymbolcysS 
ID5105057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1995195 
End bp1996592 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content49% 
IMG OID640507967 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001192141 
Protein GI146304825 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.786603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAGTAT TTAACACGCT GGGAAAGAGG CTCCAGAGCT TCGAGCCCCA CGAGCCCAAC 
ACCGTGAAAA TGTACGTGTG CGGTCCCACC GTCTACGACG AGGTTCACAT TGGACATGGT
AGGACCTTTG TGGCCTTCGA TGCCATGAGC AGGTATCTAA GGGTAAAGGG TTACAACGTG
GTAAGGGTCC AGAACATCAC GGACATAGAC GATAAAATAA TCAATAAGGC AAGGGAACTG
GGGAAGAGTT GGAACGAAGT TTCAGAGTAC TACTCGAAGA GCTACCTGGA ACACATCGGT
GCCCTCAAGG TAAAGATAGA CATGCATCCT AAGGTCACTA CCCACATCAA GGAGATTATC
GACTTCGTTC AAAGGTTAAT TGACAGTGGG CATGCCTACG TTGCCAACGG GAGCGTCTAT
TTTGATGTGG ACACTTACCC GGGTTATGGG GAGCTATCCA ACGTGAAGAA GGAGGAGTGG
GATCAGGGGG AGGAGATAGT TAAGGAAAAA AGACACCCCT ACGACTTCGC GCTCTGGAAG
GCGTATAAGC CTGGGGAACC ATACTGGGAG TCTCCTTGGG GTAAGGGTAG ACCTGGATGG
CACATAGAGT GCTCCACCAT GTCCACGAGG TATCTAGGAA CAAAGATCGA TATTCACGGT
GGAGGAATGG ACCTGGTGTT TCCCCATCAC GAGAACGAGA GAGCCCAAAC CGAATCCCTC
ACCGGATCAA CTTGGGTAAA GTATTGGATG CATGTGGCCT TTCTCACAAT AAGGAAGGAG
AAGATGTCCA AGTCCAAGGG CAACATTGTC CCGCTTAAGG AGGCACTGAG CAAGTATGGG
CCATCTACGC TGAGGTACTG GTTTCTATCA TCCCAGTACA GGAACCCCAT AGAGTATAGC
GAAGAGATCC TAGAACAAAG CTCTAGGTCC CTCCAGAGGC TTAAGGATGC CATATCCGTG
CTGAGGAAAA TAATTCAGAA GGGACCAGCC CACTACGCGA AGGAGGAGGA CGTAAAGGTC
CAAGAGGAAA TAGTAAGGGC TATCTCAAGG TTCGACGAAC ACATGGAGAA CGATTTTGAT
ACGTCTAACG CATTGACATC AATTCACGAA ATAGCCTCAA TAGTTTTCTC AAAGCTCCAA
TACAGCGAAG ACGTGTTTGG GGCTTTAATA GCGCTGGACG GATTCAGGAA GTTCAATGAG
GTCTTCGCAG TTATGGATGA GGAATTTTCG GCGGAGCTAG ATAGGTTAAC CAAGGTGATC
GACGCAGTGA TAGAAGTCAG GAATTACCTG AGAAAGAAAC AGATGTACGA TCTATCGGAC
CAGATCAGGG ATATCCTCTC CAGGAGCGGA GTAAAAATAC TGGACTCCAA GGAAGGCTCT
ACTTGGAGAT TTCAGTGA
 
Protein sequence
MQVFNTLGKR LQSFEPHEPN TVKMYVCGPT VYDEVHIGHG RTFVAFDAMS RYLRVKGYNV 
VRVQNITDID DKIINKAREL GKSWNEVSEY YSKSYLEHIG ALKVKIDMHP KVTTHIKEII
DFVQRLIDSG HAYVANGSVY FDVDTYPGYG ELSNVKKEEW DQGEEIVKEK RHPYDFALWK
AYKPGEPYWE SPWGKGRPGW HIECSTMSTR YLGTKIDIHG GGMDLVFPHH ENERAQTESL
TGSTWVKYWM HVAFLTIRKE KMSKSKGNIV PLKEALSKYG PSTLRYWFLS SQYRNPIEYS
EEILEQSSRS LQRLKDAISV LRKIIQKGPA HYAKEEDVKV QEEIVRAISR FDEHMENDFD
TSNALTSIHE IASIVFSKLQ YSEDVFGALI ALDGFRKFNE VFAVMDEEFS AELDRLTKVI
DAVIEVRNYL RKKQMYDLSD QIRDILSRSG VKILDSKEGS TWRFQ