Gene Nmul_A1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1144 
Symbol 
ID3784200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1315184 
End bp1317034 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content57% 
IMG OID637811229 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_411839 
Protein GI82702273 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.513301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA TCCAACCCCT GCCCGATCTT CTCATCAGCC AGATTGCGGC GGGAGAAGTC 
GTTGAACGTC CCGCTTCGGC GCTCAAGGAA CTACTGGAAA ACAGCCTCGA TGCGGGAGCA
ACCGAAGTAA CGGTGCAGCT TTTCCAGGGC GGTGTCAAGC TTGTGCGGGT GGCAGACAAT
GGCGCGGGAA TTTCCGGAGA AGACCTGCCC CTTGCACTTG CCCGCCACGC CACCAGCAAG
ATAAGGACTC TGGAGGATTT GCAGAATGTG CCCAGTTTGG GTTTTCGCGG CGAAGCGCTT
GCAAGCATCG CTTCCGTTTC CCGGCTGGCT CTGGTAAGCC GCAGCGCGGG AGATAAGCAC
GCCTGGCGGA TAGAGGCGGA AGCAGGACGG TTCACCCCGC CTGAGCCTGC TGCGTTCGCC
CAGGGAACAG CCGTGGAAAT GCGCGATCTC TACTTCAACA CTCCCGCCCG GCGGAAATTT
CTGAAAACAG AAGCTACCGA ATTTGCGCAT TGCGAGAGCG TGTTCAAGCG TATTGCCCTC
TCACGGCCCA GCGTCGGTTT CACCTTGCAG CACAATGGCA CTGTACGCAG CCATTTGCGG
ACAGCGGATG CAAGACTGCG CATTGCTGCA GTATTAGGAG ATGAATTCAG TCAGGCCTCG
GTATTCGTCG ATGATCAGGC AGCGGATCTA CGGTTATGGG GCATGGCCGC ACTCCCTGCC
TACAGCCGGT CTTCTGGCGA TGCCCAATAT TTCTTCGTCA ATGGGCGTTT CGTACGTGAC
AAACTCGTTG CCCACGCCCT GCGCGAAGCC TATCGCGACA TCCTCCATCT GGATCGGCAT
CCGGCGTTTG TACTATTCCT GGAAATAAAT GGGGGCGGTG TGGACGTTAA CGTGCACCCA
AGCAAAACGG AGGTGAGATT CCGCGATCCG CGTGCGCTCC ATCAATTCAT CTTTCATACG
GTTGACAAGG CTCTGGCCAT GCCGCACCTG ACGGGTGGAG CAGCAGTACC TGTAGCCGGG
AAACTTCCTG AATTCACCCG GAAGGCAGCC GGAGAAGTAT CCCTTGCACC TCCCGCTTAT
TCCCGGCAGG ATGCGATTCC TTTTGCCTCG GCGGCGCCTC AGATAAAAGC GGCTCAACCG
CAGGCTTTCT ACCAGGTGCT TTTCGGTTCC GATTCCAATG CCGGCACTCG TCAGACTGGA
TATCAGGCTG GGATCAATGG GCAATCTGCC TTCATGAAGC CGGAAGGGAC TGTTCGGGAA
CCGGAAATCC CGCCATTGGG ATTCGCGCTG GCGCAGCTTC TCGGCGTCTA CATCCTGTCG
CAGAATGAAA GAGGCCTGCT TATCGTGGAT ATGCACGCCG CGCACGAACG CATCCTGTAC
GAGAAACTCA AGTCGGCGCT CGACAACCAC ACGCTCTCCA TGCAGCCCCT GCTGATTCCG
GCCGCATTCC GGGCAGATAG CCAGGATATC GCCACGGCGG AAGAAAACAG TGCGGTTTTG
CATGATATCG GATTCGAGAT CTCCCCCTTC TCACCGGTGA TGCTCGCGGT ACGAGCCGTA
CCGGCCGCGC TCAAGGATGC CGATGTGGTG ACCCTCGCGC GTGACGTATT GAATGAAATC
CGGGAATTTG GAGGGAGTCA GGTACTTGTC AGCAGGCGAA ACGAACTCCT CGCCACCATG
GCCTGCCATG GAGCGATTCG GGCTAACCGT AGCCTGAGCA TTCCGGAAAT GAATGCACTG
CTGCGTGAAA TGGAAATCAC TGAACGTTCC GGCCAGTGTA ATCACGGCAG GCCCACATGG
TTTGAGATCA GCCGCACTGA CCTGGATAAG ATGTTCATGC GTGGCAGATA A
 
Protein sequence
MNAIQPLPDL LISQIAAGEV VERPASALKE LLENSLDAGA TEVTVQLFQG GVKLVRVADN 
GAGISGEDLP LALARHATSK IRTLEDLQNV PSLGFRGEAL ASIASVSRLA LVSRSAGDKH
AWRIEAEAGR FTPPEPAAFA QGTAVEMRDL YFNTPARRKF LKTEATEFAH CESVFKRIAL
SRPSVGFTLQ HNGTVRSHLR TADARLRIAA VLGDEFSQAS VFVDDQAADL RLWGMAALPA
YSRSSGDAQY FFVNGRFVRD KLVAHALREA YRDILHLDRH PAFVLFLEIN GGGVDVNVHP
SKTEVRFRDP RALHQFIFHT VDKALAMPHL TGGAAVPVAG KLPEFTRKAA GEVSLAPPAY
SRQDAIPFAS AAPQIKAAQP QAFYQVLFGS DSNAGTRQTG YQAGINGQSA FMKPEGTVRE
PEIPPLGFAL AQLLGVYILS QNERGLLIVD MHAAHERILY EKLKSALDNH TLSMQPLLIP
AAFRADSQDI ATAEENSAVL HDIGFEISPF SPVMLAVRAV PAALKDADVV TLARDVLNEI
REFGGSQVLV SRRNELLATM ACHGAIRANR SLSIPEMNAL LREMEITERS GQCNHGRPTW
FEISRTDLDK MFMRGR