Gene Nmar_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1037 
Symbol 
ID5773689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp910852 
End bp912618 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content33% 
IMG OID641316679 
ProductDNA ligase I, ATP-dependent Dnl1 
Protein accessionYP_001582371 
Protein GI161528545 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTT CTATCTTAGC TGATTCGTTT AACAAGATGG AATCAACTAG AAAAAGATTA 
GAACTAACAC AGTACTTGGT AGAATTATTT AAAAAAACTC CACAAGAAGT GATTTCAAAG
ATAGTCTATT TACTTCAAGG AAAACTAAGA CCAGACTTTG AAGGAGTGGA GTTGGGAGTT
GCAGAAAAAC TTGCAATAAG AGCAATCTCA AAATCTTCAG GAATACCAAT TAAAAAAATT
GAAGAAGAAT ACAGAAAAGG TGGAGACTTG GGGCATGCAG CCACTACAAT TCTAGAGCAA
AAAACGCAGA CAACATTTCT CGTAGAAGAC ATTACAGTTG AACGAGTCTA TGAGACATTA
TTCAAGATTG CAAAGTTAGA GGGCAATAGA TCACAAGACA TGAAGATGAA ATACATTTCA
AGCTTACTTA ATGATGCAAG TCCGTTAGAG GCAAGCTTTA TTCTAAAAAT ATTGTTAGGT
ACACTAAGAC TAGGAATTGC AGAAAATACT GTAATGGATG CATTAGCATT AGCATTTTCA
GGCAACAAAG AAAATAGAAA AATTTTGGAG CATGCATACA ATGTTTCTAG TGATTTGGGA
AAAGTTGCAG AAGTTTTAGC AACTGAAGGA TTAGCAGAAG TTGAAAAATT CAAAATAATT
TTGTTTAATC CAATCAGACC AATGCTTGCA GACAGAGTAA AGAGCGAACA AGAAGCAATT
GAAAAAATGG GGAATGAATT TGCAGCTGAA TACAAATTAG ATGGAGAAAG AGTACAACTA
CACATAGAAG GAGACAAAGT AGTTTTATTT TCAAGAAGTT TAGAAAATAT TTCAAGTTAT
TATCCAGATA TTATAGAAAA AATTCCAAAA ACAATTCAAG CAGAAAATAT TGTACTAGAG
GCAGAAGCAG TAGCAATCAA TGAAAACACA GGAGAGTTTT TGCCATTTCA AGAATTAATG
CATAGAAGAA GAAAATACAA AATAGAAAAA GCAGTTACAC AATATCCCAT AACGGTAAAT
CTCTTTGATA TCTTGTATTG TAATGGAAAG AGTTGTCTTG AATTAGACTA TAAAGAAAGA
AGAGAAAAAA TGGAAAAAGT GGTAAAAGAA GATGATTTTG TAAAGCACAT TCCCATGGCC
ATTGTCAAAA ATGAAAATGA TATTGAAGAC TTTTTTGAAA ACAGCATCAA TGCAGGAAGT
GAAGGACTAA TGCTAAAGAC GCTTGTTAGT CCATATCAAG CAGGTTCAAG AGGAAGTCAC
TGGTTAAAAC TGAAAAGAGA ATATCAAAAT GAACTTGGAG ATAGTTTAGA TCTTGTTGTG
ATAGGAGGAT TCTTTGGGAA AGGAAGACGG ACAGGAAACT ATGGAACTTT ATTGTTAGCA
ACATACGAAG AAGATGAAGA TACATTCACC AGCATTTGTA AAGTTGGAAC AGGTTTTTCA
GATGAAGATT TAGATCAATT ATATCAAATT CTAAATCCCA AAGTAACAAT CAAGAAAAAT
CCGCGTATTA ATAGTGAAAT GGAAGCAGAT GTTTGGTTTG AACCAGAATT AGTAATAGAG
GTGGTTGCAT CAGAGATTAC ACTTAGTCCA ATTCACAAAG CAGCTAGAGA CAAAATTAGA
AAGGGAGCAG GACTTGCATT GAGATTTCCA AAATTTACAG GAAAGATGAG AGTTGAAAAA
ATGGCAGAAG ATGCATCTAC TAATGAAGAA GTGATCACAT TATACCAAGG TCAGAAAAAA
GTGGCACATG ACAAAAGTCT CATGTAA
 
Protein sequence
MEFSILADSF NKMESTRKRL ELTQYLVELF KKTPQEVISK IVYLLQGKLR PDFEGVELGV 
AEKLAIRAIS KSSGIPIKKI EEEYRKGGDL GHAATTILEQ KTQTTFLVED ITVERVYETL
FKIAKLEGNR SQDMKMKYIS SLLNDASPLE ASFILKILLG TLRLGIAENT VMDALALAFS
GNKENRKILE HAYNVSSDLG KVAEVLATEG LAEVEKFKII LFNPIRPMLA DRVKSEQEAI
EKMGNEFAAE YKLDGERVQL HIEGDKVVLF SRSLENISSY YPDIIEKIPK TIQAENIVLE
AEAVAINENT GEFLPFQELM HRRRKYKIEK AVTQYPITVN LFDILYCNGK SCLELDYKER
REKMEKVVKE DDFVKHIPMA IVKNENDIED FFENSINAGS EGLMLKTLVS PYQAGSRGSH
WLKLKREYQN ELGDSLDLVV IGGFFGKGRR TGNYGTLLLA TYEEDEDTFT SICKVGTGFS
DEDLDQLYQI LNPKVTIKKN PRINSEMEAD VWFEPELVIE VVASEITLSP IHKAARDKIR
KGAGLALRFP KFTGKMRVEK MAEDASTNEE VITLYQGQKK VAHDKSLM