Gene Msed_1715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1715 
SymbolargS 
ID5105078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1651403 
End bp1653262 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content47% 
IMG OID640507609 
Productarginyl-tRNA synthetase 
Protein accessionYP_001191794 
Protein GI146304478 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCGG ATGTTCTAGC GAAGATAAAA CAGGAGATAT CCTCTCAGGT GGGTCAGGAA 
CTTGGAGTAA GCCAGGAATC GGTTTATAAG GCGATCGAGT ATCCGCCACG CGAGGAAATG
GGAGATCTGG CCATTCCCTT TCCCGCGTTA GGAAAGAAAC CGCAAAATCT CGAGATCAAA
TCCAAGTTTA TTAGGTCGCT CTCACTCTCT GGTCCCTTCC TTAACCTGAG GTTGGATGAG
ACCCAACTCT TCATGGAAGT CTTTTCGTCG ATGGATCAGG ACTATGGAAT AGAGAAGGTA
GAGAAGCCCA AGAGGATAGT GATAGAGCAC ACAAGTGCCA ACCCGATTCA CCCACTTCAC
ATAGGACATC TTAGGAATTC CATAATTGGC GACGCCCTAG TCAGGCTCTC TAGGGCAAGG
GGCCATGAGG TAATCTCTAG GTTTTACGTT AATGACAGCG GTAGGCAGGT TGCAATCCTG
ATTTATGGTC TCTCCAAACT TGGTTATCCG GAACCCCCAG ACGGAGCTAA GAAGGATCTA
TGGGCAGGAA CCGTGTATGC CATGACCAAT ATCATCCTCG AAATCAGGGC CATAACTGAC
GAGCTCAAGA CTGCACAGGA TGCCGAGTAC AGAGAGAAGG TAACCAAGAG AGATGAACTT
GTGGGATTAG CTTCCGAGAT AAGGTCCAGA AACCAGGAAT ATTTCGATAG GTTGGCTGAA
TCCATAAGGG ATGATCCAGA TCCTGAGGGA AAGATTGCGG ATATCATTAG GAGGTATGAG
GCCGGAGATC CACAGATGAA GGAAATTGTT AGAAAATACG TGAACCACGT CTTGGAGGGG
TTCAGGGAGA GTTTAGGTAG ACTGAACATC GAGTTTGACG AGTTTGATTA CGAGAGTGAC
TTGCTATGGT CTGGCGAAGT GAAGTCCGTG TTAAGATCTG CATTGAGCTC CAGAGCTAGA
ATAGAATATA AGGGAACGGA GGCCCTGGAC CTAGACAAGT ATCTAGACGA CGAGGTCAGG
AAAGAGCTAA GGATTCCGGC AGGGTTGGAG ATACCTCCCT TAGTGTTGAC TAGATCCGAC
GGGACAACCC TTTACACGGT AAGAGATATT GCCTACACCA TAAGGAAGTT CCTGACGAGT
AAGGCAGAAC AGGTAATTAA CGTAATAGCT GAGCAACAGA CTGTACCACA GATACAGCTT
CGAGCTGCCC TCTACCTTCT TGGTTATCCT GAGATGGCGA AAAATCTGAT TCATTACTCA
TACAGCATGG TCACCCTTCC AGGAATGACT ATGAGCGGGA GGTTAGGACG TTACATCTCC
CTTGATGAAG TTTACGAAAA AGTAAAGCAA GCAGTGGAGG AGAAGACTAA GGACAGGGGT
AACCAGGTTA ACATAGCCGA GATAGTTAAC TCGGCCATAA GATATGCGCT ACTCTCTGTC
TCCGCCGACA AGCCCATTAC GTTCAATGTG GGGAAGGTCA CAAACTTTGA ACAAAATAGT
GGGCCATATC TCCAGTATAC CTACGTGAGG GCCTACAATA TCCTTTCAAA ATTTACGGGC
GAAATAAACC TAGATGTCGA TTATGGTGAC CTGGTAGGGG AGAAAAGAAG GTTACTTCTC
GCTATCGCCA AGTTCCCAGA AACCTTCAAG AACTCTGCGG ACAGCCTAGA GCCCGAGCTC
TTGGTTTCCT ATCTCAGATA CCTAGCTGAT ACTTTCAACG CATGGTACGA TAAGGAGAGA
GTTCTCCAGG AACAGGATGA GAAAAAGAGG ATGACCAGAT TAAACCTGGT GAAGGGAGTA
GAGGTCGTGA TGAGAAACGG TCTAAGGGTT CTAGGAATAA GTTCATTAAC TAAAATGTAA
 
Protein sequence
MPADVLAKIK QEISSQVGQE LGVSQESVYK AIEYPPREEM GDLAIPFPAL GKKPQNLEIK 
SKFIRSLSLS GPFLNLRLDE TQLFMEVFSS MDQDYGIEKV EKPKRIVIEH TSANPIHPLH
IGHLRNSIIG DALVRLSRAR GHEVISRFYV NDSGRQVAIL IYGLSKLGYP EPPDGAKKDL
WAGTVYAMTN IILEIRAITD ELKTAQDAEY REKVTKRDEL VGLASEIRSR NQEYFDRLAE
SIRDDPDPEG KIADIIRRYE AGDPQMKEIV RKYVNHVLEG FRESLGRLNI EFDEFDYESD
LLWSGEVKSV LRSALSSRAR IEYKGTEALD LDKYLDDEVR KELRIPAGLE IPPLVLTRSD
GTTLYTVRDI AYTIRKFLTS KAEQVINVIA EQQTVPQIQL RAALYLLGYP EMAKNLIHYS
YSMVTLPGMT MSGRLGRYIS LDEVYEKVKQ AVEEKTKDRG NQVNIAEIVN SAIRYALLSV
SADKPITFNV GKVTNFEQNS GPYLQYTYVR AYNILSKFTG EINLDVDYGD LVGEKRRLLL
AIAKFPETFK NSADSLEPEL LVSYLRYLAD TFNAWYDKER VLQEQDEKKR MTRLNLVKGV
EVVMRNGLRV LGISSLTKM