Gene Noc_0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0919 
Symbol 
ID3707309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1010149 
End bp1012740 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content53% 
IMG OID637737427 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_342961 
Protein GI77164436 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAA ATGACCCTCA GAAACCCCAT ACGCCTATGA TGCAGCAGTA TCTGCGGATA 
AAGGCAGAGT ATCCCAATAC GCTCTTGCTC TATCGCATGG GGGACTTCTA CGAATTGTTT
TATGACGATG CTCAACGCGC CTCGGAACTA TTGGATATCG CGCTTACCAG CCGTGGCCGA
TCGGCCGGCG AGCCCATTCC CATGGCAGGA ATTCCCTACC ATGCGTTGGA TTCCTATTTG
GCAAGGTTAG TGCGCCAGGG GGAGTCGGTA GCCATCTGCG AGCAGATAGG CAATCCGGCG
GCAAGCAAGG GTCCCGTGGA ACGCCAAGTG GTGCGAATCA TTACCCCCGG AACGGTGACC
GAGGAAGCCC TCTTGGAAGC CCGCCGGGAT AATCTGCTAG CCGCACTTCA GAAAGAGGGG
GATGTTTTTG GATTTGCTGT GCTTGATCTT TGCAGTGGGC GCTTCAATAT TCTAGAAGTA
GCTAGTGAAT CGGCGGCTAC TAGCGAATTA GCCCGCATCC GGCCAGCGGA ACTTTTGGTA
AGCGAAGACC TAGCGCTTAT CCTGGTCGAT TCCAAAACTG AGGCGGTGGT GCGACCCTTG
CCTCCCTGGT ATTTTGATAG GGAAAGCGCC CAGCGCCAGC TATGTCGGCA GTTTGGGACT
CAAGACCTAG CCGGTTTTGG CTGCGAGGAA ATGAAAACCG CAATTGCCGC CGCCGGGTGC
CTGCTGCATT ATGTCCAGGA TACCCAACGC ACCCAATTTC CCCACATTCA CGCACTCCAA
GTCGAGCGAC AAGAAACCAG CATTATTCTG GACCCCAGCA CTCGGCGCAA TCTAGAATTA
GAAGAAAGCC TGAGCGGCGA TTCGGGCCGT AATACCTTAA TCGCGGTACT GGACCATACG
GCAACCGCCA TGGGCAGCCG CCTGCTACGG CGCTATCTCC ACCGTCCTCT GCGGGATCAA
ACCCTGCTCA AACAACGCCA ACAGGCGCTT GCTACTCTCC TAGAAGGAGG ACTGAGCGAT
GTTTTACAAA CATTACTCCG GGGAATAGGC GATATTGAAC GCATTCTCTC CCGCGTGGCC
TTGCGTTCCG CCCGTCCGCG AGATCTCGTC CAATTTCGGC AAGCCTTGGG TCTATTACCC
AAGATCCAAG AGAGCTTGTT GCAGTTAAAC AGAGACAGCC TTTTACTTCA GTCGCTACAA
GAAGATTTGG GTCCCTTTCC CAATCTCCAT GAACTGTTAC AACGGGCCAT TTGTGAAAAT
CCGCCGGTGC TCATTAGAGA TGGCGGGGTA ATTGCCCTCG GCTTTGACTC GGAACTAGAT
GAATTGCGGC ATTTAAGCGG CAATGCCGGG CAATTTTTAG TGAAATTGGA GCAGCGGGAG
CGGGAGCGCA CCAAAATCCC AACTCTCAAG GTAGGCTACA ACAAAGTTCA TGGCTACTAT
CTTGAGATCA CACGTGCTCA GGCCCATCAA GCGCCTCCTG ACTATATTCG TCGTCAAACC
TTAAAGGGGG CGGAACGCTA TATTACCCCG GAATTGAAAG GCTTTGAAGA CCAGGTGTTA
AGCGCCCGAG AACGGGCGCT GGCGCGGGAA AAAGCCCTTT ACGAGGAGCT ATTGGAACAA
TTTATGGAAC CCCTCCCCGC TTTGCGGGCC TGCGCCAACG CCTTGGCGGA GCTGGATGTG
CTTCATAACC TGGCCGAGCG GGCTAAAACC TTGGAATATG TGGCACCCCT ATTGAGCGAT
CAGCCAGGGA TATTTATCGA AAGGGGCCGC CACCCGGTGG TGGAACAAAC CCTAGAGGAT
CCTTTTGTGC CTAATGATCT GACTCTCCAT GAAGCACGGC GGATGCTGAT CATTACCGGC
CCCAATATGG GAGGAAAGTC TACGTACATG CGCCAGACAG CCTTAATTGT CTTGCTCGCC
CATATTGGCA GTTTTGTGCC AGCCCGCCGG GCTGTCATTG GCCCTATTGA CCGAATTTTT
ACCCGTATCG GCGCAGCCGA TGATCTTGCC GGGGGACGCT CCACTTTTAT GGTCGAAATG
ACCGAGACCG CCAACATTCT ACATAATGCG ACTGAGCACA GCTTAGTCTT GCTGGATGAG
GTTGGCCGGG GCACCAGTAC TTTCGACGGT CTATCTCTCG CTTGGGCAGT GGTCTCCCAC
CTGGCAAACA AAGTGCGTTC CCTAACGCTA TTTGCGACCC ATTACTTTGA GCTCACCACT
CTTCCCGAGT GTCTTCCCGG CGTGGTTAAT CTTCACCTTA CAGCAACCGA GCATAAGGAG
CATATCGTTT TTCTCCATGC GGTGAAAGAA GGTCCTGCCA GCCAAAGTTA TGGCCTTCAG
GTAGCCGCAT TGGCGGGTGT GCCCCAGGAG ATCATTGCCC AGGCGCGGCA ACAGCTTATG
GAATTAGAAA ACAATACTTG GCAAAAATCG ATCAATGGAG GCGGCCCTCA ACTAGACTTG
CTTGCGCCCC CTGCGGATCA TCCTGCCGTT CAAATACTAC AAGATTTAGA CCCCGATGAA
CTTACTCCCC GGCAAGCTTT AGAGAAACTC TACGAACTCA AACAGCTATT AGACCTTGCT
GTTACACACT AA
 
Protein sequence
MPANDPQKPH TPMMQQYLRI KAEYPNTLLL YRMGDFYELF YDDAQRASEL LDIALTSRGR 
SAGEPIPMAG IPYHALDSYL ARLVRQGESV AICEQIGNPA ASKGPVERQV VRIITPGTVT
EEALLEARRD NLLAALQKEG DVFGFAVLDL CSGRFNILEV ASESAATSEL ARIRPAELLV
SEDLALILVD SKTEAVVRPL PPWYFDRESA QRQLCRQFGT QDLAGFGCEE MKTAIAAAGC
LLHYVQDTQR TQFPHIHALQ VERQETSIIL DPSTRRNLEL EESLSGDSGR NTLIAVLDHT
ATAMGSRLLR RYLHRPLRDQ TLLKQRQQAL ATLLEGGLSD VLQTLLRGIG DIERILSRVA
LRSARPRDLV QFRQALGLLP KIQESLLQLN RDSLLLQSLQ EDLGPFPNLH ELLQRAICEN
PPVLIRDGGV IALGFDSELD ELRHLSGNAG QFLVKLEQRE RERTKIPTLK VGYNKVHGYY
LEITRAQAHQ APPDYIRRQT LKGAERYITP ELKGFEDQVL SARERALARE KALYEELLEQ
FMEPLPALRA CANALAELDV LHNLAERAKT LEYVAPLLSD QPGIFIERGR HPVVEQTLED
PFVPNDLTLH EARRMLIITG PNMGGKSTYM RQTALIVLLA HIGSFVPARR AVIGPIDRIF
TRIGAADDLA GGRSTFMVEM TETANILHNA TEHSLVLLDE VGRGTSTFDG LSLAWAVVSH
LANKVRSLTL FATHYFELTT LPECLPGVVN LHLTATEHKE HIVFLHAVKE GPASQSYGLQ
VAALAGVPQE IIAQARQQLM ELENNTWQKS INGGGPQLDL LAPPADHPAV QILQDLDPDE
LTPRQALEKL YELKQLLDLA VTH