Gene Msed_0150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0150 
Symbol 
ID5105003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp121185 
End bp122981 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content45% 
IMG OID640506053 
ProductATP-dependent DNA ligase 
Protein accessionYP_001190251 
Protein GI146302935 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.132861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.011821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA AGCTTATCGC AGAGTATTTC GACAGACTTG AGAAAATATC CTCTAGAATA 
CAGCTAACAT CTCTTCTTTC GGACCTCTTT AAGAACACTG AAAGGGAAGT GATAGATAAG
GTAGTGTACC TTATCCAGGG AAGGCTCTGG CCAGACTTTA CAGGAATGCC TGAGATAGGA
ATGGGAGAGA AATTTCTGAT TAAGGCAATA GCAATGGCCT ATGGAAATAA GGAAGAGGAA
GTTGAAAAAC TATACAAGAA TATAGGTGAT CTAGGTGAAG TTGCCTATTC CTTGAGGAGT
AAGGTAAAGG GCGTGAGCAT TCTCTCATTC GTCGGAGGAA ATCAGGAGGC CGGAGAACTT
GACGTGATGG AGGTATATAA CGAGCTAGTA AAGATAGCCA CTAGCACTGG GGAGGGAAGC
AGGGACATTA AGATAAGGAT TTTCGCTGGT CTGATAAAGA AGGCAACCCC CATAGAGGCC
AAGTATCTCG TCAGATTTGT TGAGGGGAGG CTAAGGCTAG GGATAGGAGA TGCGACGGTA
CTGGATGCGT TGGCCATAAC CTTTGGTGGA TCTGCAGACT ATAGGCCAAT AGTCGAAAGG
GCATACAACT TAAGGGCTGA CCTTGGCGAT ATAGCCAGAG TAATAGCTAC TGAGGGAATA
GAGAAACTGA AGAATATTTC CCCAACCCCT GGGATTCCAA TCAGACCAAT GTTGGCTGAA
AGGTTGCCCG ATCCAGAGGA GATAATGGAG AAAATGAACG GAAAGGCGCT AGTGGACTAC
AAATATGATG GTGAGAGGGC TCAGATTCAT AGAAAGGGAG ATAAGGTCAC CATTTTCTCT
AGACGAATGG AAAATATAAC TGACCAGTAC ATTGATGTAA CAGAGTACGT GAAACAATTC
GTTAAGGGTG ACAACTTCAT TGTGGAGGGT GAGATTGTAC CTGTTGATCC AGAGAGCGGT
GAAATGAGAC CCTTCCAAGA GCTTATGCAT AGGAGAAGGA AAAATAACAT AGCGGAAGCC
ATAAAGGAAT ATCCGGTCAA CCTATTCCTC TTTGATCTAA TGTTCTTTGA GGGAGAGGAT
TACACCACGA AGCCTCTCCC AGAGAGGAGG GCCAAACTTG AGGAGATTCT TGCAAGTAAC
GACAAGGTTC ATATCGCGTC ACATATAATT GCTGATAGAG TGGATAAGCT AAGGGAGTAC
TTCTATCAAG CAATATCTGA GGGTGCAGAG GGAGTTATGG TTAAGTCGAT TGGACCAGAC
TCCATATACC AGGCAGGGTC CAGAGGGTGG CTATGGATAA AGTTAAAGAG GGATTACCAG
AGCGAAATGG CTGATACGGT AGATCTAGTG GTAGTTGGTG CCTTTTACGG TAAGGGTAAA
AGGGGAGGTA AGTTTAGCTC CTTGCTTATG GCAGCTTACA ACCCAGAGAA GGATGTGTTT
GAGACGGTTT GTAAGGTTGC CTCTGGTTTC AGTGACCAGG AACTAGATGA GATGCAGAAG
AAGATAAACG AACTCAAGAG GGAGCAGAAG CATCCAAGGG TTGTATCCGA CATGATCCCT
GATGTCTGGG TATCGCCAAC CCTAGTAGCT GAGGTGATAG GTGCCGAAAT CACGATTTCT
CCCTTGCATA CCTGTTGCAG AGGCGAAAAA GGAGGACTAT CCATACGTTT CCCAAGATTT
ATCAGATGGA GGGATGACAA AAGTCCAGAG GATGCTACCA CTAACCAGGA GATAATGGAG
ATGTACTCGA AACAGCTCAA GAAAATAGAG GAAAAACCTG TAGATGAAAA TATCTAG
 
Protein sequence
MKFKLIAEYF DRLEKISSRI QLTSLLSDLF KNTEREVIDK VVYLIQGRLW PDFTGMPEIG 
MGEKFLIKAI AMAYGNKEEE VEKLYKNIGD LGEVAYSLRS KVKGVSILSF VGGNQEAGEL
DVMEVYNELV KIATSTGEGS RDIKIRIFAG LIKKATPIEA KYLVRFVEGR LRLGIGDATV
LDALAITFGG SADYRPIVER AYNLRADLGD IARVIATEGI EKLKNISPTP GIPIRPMLAE
RLPDPEEIME KMNGKALVDY KYDGERAQIH RKGDKVTIFS RRMENITDQY IDVTEYVKQF
VKGDNFIVEG EIVPVDPESG EMRPFQELMH RRRKNNIAEA IKEYPVNLFL FDLMFFEGED
YTTKPLPERR AKLEEILASN DKVHIASHII ADRVDKLREY FYQAISEGAE GVMVKSIGPD
SIYQAGSRGW LWIKLKRDYQ SEMADTVDLV VVGAFYGKGK RGGKFSSLLM AAYNPEKDVF
ETVCKVASGF SDQELDEMQK KINELKREQK HPRVVSDMIP DVWVSPTLVA EVIGAEITIS
PLHTCCRGEK GGLSIRFPRF IRWRDDKSPE DATTNQEIME MYSKQLKKIE EKPVDENI