Gene Msed_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2152 
Symbol 
ID5104891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2066156 
End bp2067214 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content46% 
IMG OID640508043 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_001192215 
Protein GI146304899 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000510287 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000433253 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATAACTA GAAATACCGC GGAGGTTGTT ACTCCCGAGG AATTAAAAGA AGCCTTGGAA 
AGCGGTAGAA AACTTAAGGG ATATCTTGGT TTTGAGCCCA GCGGTCTATT CCATATAGGA
TGGTTAATCT GGGCTCAAAA GGTGAAGGAC CTATTGCAGG CCGGAGTGGA CATGAGCCTC
CTGGTCGCAA CGTGGCATGC CTGGATAAAT GACAAACTAG GGGGGAATAT GGAAATGATA
AAGCTAGCAG GCGATTATGC TATTACCGTA TTGGACAGTT TTGGAATTAG CAGGAGCAAG
GTTCACGTCA TAGATGCTGA GGATATGGTG AAGGACAAGG ATTACTGGTC ATTGGTAATA
AGGGTGGCAA AGAACACGAG CCTGGCTAGG ATGAAGAGGG CACTCACTAT CATGGGAAGG
AAGGCCGATG AAGCTGAGCT AGATTCCTCT AAGCTGATTT ATCCTGCGAT GCAGGTGAGT
GATATCTTCT ATATGGACTT AGATATAGCG CTGGGTGGAA CGGATCAAAG GAAAGCTCAC
ATGCTTGCAA GGGACGTAGC TGAGAAGCTT GGCAAGAAGA AGGTAATAGC AATTCACACG
CCACTCCTGG TTGGCTTACA GGGAGGGCAG AGGATGAACC CTGGAGTGGA CGAAGATGAC
GCCTTGGCTG ACATAAAGAT GAGTAAATCC AAGCCTGAGA CTGCCATATT CATCAACGAC
GAGCCTGAAG AAGTGGAGGG TAAATTGATG ACAGCATACT GTCCCAAGGG AGTTGTGGAG
AATAACCCGG TGTTACAAAT TAACAAGTAC ATCCTATTCC AGGTCGATGA TAGGGGACTT
AAGGTAGAGA GGGATGCTAA GTTTGGCGGG GATGTACAGT TCAACACCTA TGAAGAGCTG
GAGAAAGCCT ACGCTGAAGG GAAATTACAT CCCAAGGACC TTAAGGTTGC AACTGCAAGA
AAGCTTAACC AGATAATAGA TCCTTTAAGG AAGTCTATTA AATCTAGACC TGAATATGAT
AAACTAGCAA AAGAAATAGC AAGGAGTGTT AGCAGGTGA
 
Protein sequence
MITRNTAEVV TPEELKEALE SGRKLKGYLG FEPSGLFHIG WLIWAQKVKD LLQAGVDMSL 
LVATWHAWIN DKLGGNMEMI KLAGDYAITV LDSFGISRSK VHVIDAEDMV KDKDYWSLVI
RVAKNTSLAR MKRALTIMGR KADEAELDSS KLIYPAMQVS DIFYMDLDIA LGGTDQRKAH
MLARDVAEKL GKKKVIAIHT PLLVGLQGGQ RMNPGVDEDD ALADIKMSKS KPETAIFIND
EPEEVEGKLM TAYCPKGVVE NNPVLQINKY ILFQVDDRGL KVERDAKFGG DVQFNTYEEL
EKAYAEGKLH PKDLKVATAR KLNQIIDPLR KSIKSRPEYD KLAKEIARSV SR