Gene Msed_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2049 
Symbol 
ID5105271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1971082 
End bp1972413 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content46% 
IMG OID640507939 
Productreplication factor C large subunit 
Protein accessionYP_001192113 
Protein GI146304797 
COG category[L] Replication, recombination and repair 
COG ID[COG2256] ATPase related to the helicase subunit of the Holliday junction resolvase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC CCTGGGTGGT TAAGTACAGA CCAAAGACCC TAGATGACGT GGAGAATCAG 
GAGGACGTAA AGGACGAGTT GAGGTCTTGG ATAGATTCTT GGCTTAAGGG ATCTCCCTCC
TCCACGGCAG TAATGTTATA TGGTCCTCCT GGGACCGGGA AAACCTCCTT GGCTATAGCG
TTGGCCAATA CCTACAAACT TGAGCTCGTG GAGACCAATG CCAGCGATAC CAGAAACTTG
ACCTCACTTA GGGCAATAGT GGAGCGGGCT TCAATTAGTG GTTCTCTCTT TGGAATTAGG
GGAAAGCTAA TCTTTCTCGA TGAAGTGGAT GGAATTCAAC CAAAGCAAGA CTACGGAGCA
GTATCAGCAA TTCTAGAGAT AATTAAGAAC ACGAAGTATC CCATATTGAT GGCTGCTAAC
GATCCATGGA ATCCGAATCT ACGTGATCTT AGAAATGCGG TGAAGATGAT TGAGGTAAAA
AAACTTGGGA AGATCGCTAT GAGGAGATTA CTCAAAAAAA TCTGCTCTGG CGAGAAAATT
AAGTGCGAGG ATAACGCGTT GGATCAGATC ATAGAGGCCT CAGACGGCGA CTCTAGATAC
GCAATAAATT TCCTTCAATC CATTGCTGAG GGATATGGAG AGGTCACGGA AAAGCTGGTA
AGTGAGCTAG TAAGAAGAAA GGAGAGGGAG CTAGATCCCT TTGAGACTGT CAGGAGCGTG
TTTTGGGCAA GATATGGTTG GCAGGCCAAG CAGGCAGTGT CTAACTCCCA GGTCGAATAT
GATCTTCTAA TGAGATGGTT ATCCGAGAAC ATACCGATTC AGTATGAAAT GTTAAATGAT
ATATGGAGAG GTTACGACGC CCTAGCTAGG GCATCTATCT TCCTCACAAG GGCCAAGCTT
TCCAGCTGGG ATATGCTAAG TTACACCTTT GACCTTATGG GTCCAGGTGT TGCAATGGCC
GAAGTGGAGA AGAAGAGTCC CTCGTGGAAA GCGAAGTGGA AGAAGTACCA ATTCCCTACC
CTAGTACAGC AATTGTACAA ATCTAAGAGG ACTAGGGATA CTAGGGATCA GATAATCAAG
AAGATAGGAT TCCACCTACA TTCCTCTTCG ACTAAAATTT ACAACGACGT GTTCCCGTTC
TTCCTTATCA TGACATCAAA GGACTTGGAT GAGCTGGCGA AGAACCTAGA TCTTAGTCCA
GAGGAGATTG AGTTCATTCA GTCCTCACAG GTAAGGGATG TGGCCTTGAA GGAAACTGGA
TCTACTGCAC AGCCCTCTGA GAGAACTTCT AGGTCTAGAA CGACCTCTAA ATCCAGGTCT
AAGAAACCTT GA
 
Protein sequence
MTVPWVVKYR PKTLDDVENQ EDVKDELRSW IDSWLKGSPS STAVMLYGPP GTGKTSLAIA 
LANTYKLELV ETNASDTRNL TSLRAIVERA SISGSLFGIR GKLIFLDEVD GIQPKQDYGA
VSAILEIIKN TKYPILMAAN DPWNPNLRDL RNAVKMIEVK KLGKIAMRRL LKKICSGEKI
KCEDNALDQI IEASDGDSRY AINFLQSIAE GYGEVTEKLV SELVRRKERE LDPFETVRSV
FWARYGWQAK QAVSNSQVEY DLLMRWLSEN IPIQYEMLND IWRGYDALAR ASIFLTRAKL
SSWDMLSYTF DLMGPGVAMA EVEKKSPSWK AKWKKYQFPT LVQQLYKSKR TRDTRDQIIK
KIGFHLHSSS TKIYNDVFPF FLIMTSKDLD ELAKNLDLSP EEIEFIQSSQ VRDVALKETG
STAQPSERTS RSRTTSKSRS KKP