Gene Msed_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1921 
Symbol 
ID5103308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1868233 
End bp1869675 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content49% 
IMG OID640507809 
Productprolyl-tRNA synthetase 
Protein accessionYP_001191985 
Protein GI146304669 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAT CAAGGGAGAA GTGGTCCTCT AATTTTAGCG AATGGTTTGA TTGGGTTATC 
TCTCAGGCTG AAATATACGA CTATGGAAGA TACCCCGTTA AGGGATCTGG CGTATGGATG
CCCTACGGCT TCAAGATAAG GCAGAACGTG ACCACCCTCA TTAGGAAATT ACTTGACGAA
ACAGGTCATG AGGAGGTGCT CTTTCCACTT CTTATTCCTG AGACGCTCCT CAAGAAGGAG
GCAGAACACA TTAAGGGCTT CGAGAAGGAA GTTTTCTGGG TCACTCACGG AGGGGAGGAC
GAACTCGAGG AAAGGTTTGC CCTCAGACCA ACGTCAGAGG TTGCGATTAC CCTGATGGAA
TCCCTCTGGA TAAAGGGTTA CTCGCAATTA CCAAGAAAGT TCTACCAGAT CGTTAGCGTG
TTTAGGTATG AGACCAAGGC CACAAGACCC ATGATCAGGG TTAGGGAGCT CTCCACGTTT
AAGGAAGCCC ATACTGTTCA CGAGACCTTT GAGGATGCGG CCAGGCAAGT GGATGAGGCA
GTAGAGATTT ACAGTAAGTT CTTTGATATT CTGGGAATTC CATACCTCAT CTCTAGGAGA
CCGGAATGGG ATAAGTTCGC TGGGGCAGAG TACACCATAG CTCTCGACAC CATAATGCCC
GATGGAAGAG CTCTACAGAT AGGGACGGCG CATCATCTGG GCCAGCACTT CACCAAGGCA
ATGGACTACA AGGTCCAGAG GGCCGATGGT TCTCACGTTC ATCCACATCA GACAAGTTAC
GGGATATCTG ACAGGGTAAT AGCAACTGTG ATCTCCATAA ACGGTGATGA TCACGGCCCC
ATACTACCAC CTGTGGTAGC TCCCATTGAG GGTGTCATCA TACCGATACC TGGAAAGAGT
GAAGAGGACA CCGAGAAAAT CAACAAGTAT GCCATGGAAG TGGAGTCCGT TCTCAAGAAC
AGCGGAATCC GCGTGGCCCT TGACGCCTCT GAGGATAAGA CTCCTGGAGA GAAGTATTAT
ATCTGGGAGT TAAAGGGCGT TCCAATCAGA ATAGAGATAG GACCTAGGGA GCTAAACTCT
GGCACTGCCT TCCTTAAGAG GAGGGATACG CTAGAGGGCA AAAGCGTGAA GAGGGAGGAA
CTGGTAAAGG AATTCAGGAA CCTTGAGGAT CAAATCTCCG CCGACCTTAG GAAGAGGGCA
TGGGAACAGT TCAAGGAGAG GGTTAAGAGG TTCCAGAGCT TGGATGAGGC TAAAAAGTTC
CTGGAGAACA GGGGAGGCAT AGCTGAGGTT CCATGGTGCG GACAGGACTC ATGCGGACTT
AAGATCGAGG AACAGGTCCA GGCTAGGGTT TTGGGTACTC CCTTGAAACC TGAACCTAGC
GGTAACTGCG TCGTATGTGG AAAACCTTCA ACCAACATCC TTCGAATAGC AAAAACTTAT
TAG
 
Protein sequence
MKISREKWSS NFSEWFDWVI SQAEIYDYGR YPVKGSGVWM PYGFKIRQNV TTLIRKLLDE 
TGHEEVLFPL LIPETLLKKE AEHIKGFEKE VFWVTHGGED ELEERFALRP TSEVAITLME
SLWIKGYSQL PRKFYQIVSV FRYETKATRP MIRVRELSTF KEAHTVHETF EDAARQVDEA
VEIYSKFFDI LGIPYLISRR PEWDKFAGAE YTIALDTIMP DGRALQIGTA HHLGQHFTKA
MDYKVQRADG SHVHPHQTSY GISDRVIATV ISINGDDHGP ILPPVVAPIE GVIIPIPGKS
EEDTEKINKY AMEVESVLKN SGIRVALDAS EDKTPGEKYY IWELKGVPIR IEIGPRELNS
GTAFLKRRDT LEGKSVKREE LVKEFRNLED QISADLRKRA WEQFKERVKR FQSLDEAKKF
LENRGGIAEV PWCGQDSCGL KIEEQVQARV LGTPLKPEPS GNCVVCGKPS TNILRIAKTY