Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0336 |
Symbol | |
ID | 3706507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 365383 |
End bp | 367134 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637736848 |
Product | DNA mismatch repair protein |
Protein accession | YP_342392 |
Protein GI | 77163867 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00211885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCTC CTTCCATCCC CCGTATTCAG ATTCTACCTC CGGCACTAGC TAACCAGATT GCTGCCGGGG AGGTGGTGGA ACGCCCCGCT TCAGTACTCA AAGAACTTGT GGAAAACGCT CTAGATGCAG GCGCCCAACG CATCGAAATT GAAACGGAGG CGGGGGGCAT TGGGCTTATC CGAGTCCGTG ATGACGGTTG CGGCATCCAT CATAATGATT TGCCTTTAGC CTTAAGCTCC CACGCTACCA GTAAAGTTCG CCACGGAGAA GAATTACTCA ATATTACTAC TTTGGGCTTT CGCGGCGAAG CCTTGGCAAG CATCGACGCC GTTTCCCGTC TTAGCCTCAG TTCCCGGATG GCAGATAATG AACATGGCTG GTGCATTCGG GAAAATACGC CGGTACAACC CATCGCCCAT CCCCTAGGCA CCACGGTTGA GGTCCGTGAT CTGTTTTACA ATACTCCAGC TCGGCGGCGC TTCCTCCGAG GGGAAAAAAC CGAGTTTATA CGTTTACGCA CGATACAAAC ACAACTAGCT CTCAGCCATT TCGAAATAAG CTTTCGAATA AGTTATAACC GGCGTCCTTT TCTCACCCTG CCTGCTTGTA CCTGCCCCCC TGAGCAGCTG AAACGGATTA CCGAACTCTG CGGACGGAAT TTTGCTGAGC ATAGTATGTA CTTCAAGCGG GAAATAGAAG GGCTATGCCT ATGGGGCTGG TTAGGACATC CTGAATTCGC CCGCAGTCAA ACCGATCTCC AATATTGTTA TGTTAACCAC CGCATGGTTC GAGACAAGCT ATTGAGCCAT GCAGCTCGCC AAGCTTATGG CAACCGCCTG TCCCAAGGGC GCCACCCCGC CTATTTACTG TATCTGGAAT TACCCACTCA TCAAGTAGAT GTCAATGCCC ACCCGGCTAA GCATGAAGTC CGGTTTCGGG AATCCCGGCA GGTTCATGGT TTTATCGTTC GCACGTTAGC AGAGATACTA GAACAAACCG AACCCGAAGG AGAGCATCGG CTAGCCTCTG GAGAATTTCG ATCACACCCC CATGAGGTGC TCGGTAAAGA ACAGGCTGGC GACACTTATC TCGTAGCAGA GGTCCCCGGA AGCTATGGTC CCCGGAAGCA TGGGAAACAT AACCCCCTAT CCAAAGGGAG AAACGATGCT CCATCACGGT TCGGTCAGGT TCAGGCATTC GTGCTCGGGC GCTATCTACT GACAGAAAAC AGCCAAGGGT TGATGCTAGT AGATTTGCCC ATAGCCCGCG CCCATCTAGC TCAAGCACGA CTGCGCACCG CCTACGCTGC CGGCCATATC ATCCGGCAAC CTTTACTTCT TCCTCTCACT TTTCAGGTTT CCCTGCAGCA GGCAGAGTGG ACAGAACGAC ATGTCCAGGA GCTGCGGAAA CTGGGTCTCG GGCTGCACCG GTTAGGACCC CAAACCGTCG TTTTGCGGGA GATACCTGCT GCCATCCGAG AGCTCGATCT CGAGGGTTTA CTACTGGCTT TACTCGCCCA ATTAACCCGC CAGCAGCACA TAATGCCCGC TGAAATCCCG CTGGGAGAGC TCATCGTCGC TCTTACGGCG CAATACCCTG CCTCAACCAC ATCCCGCCCC TCCCTCCAGG AAATGAATGC TTTCCTGCAA GAGTTGGAAA ATCTTTATCA AATCGAAACC GGCCTTAAAG CCCCCCTCCC CTGGCGGGAA TTACCCGAGC ATGAAATAGC ACAATGGTTC CTCCCAAGCT AG
|
Protein sequence | MAAPSIPRIQ ILPPALANQI AAGEVVERPA SVLKELVENA LDAGAQRIEI ETEAGGIGLI RVRDDGCGIH HNDLPLALSS HATSKVRHGE ELLNITTLGF RGEALASIDA VSRLSLSSRM ADNEHGWCIR ENTPVQPIAH PLGTTVEVRD LFYNTPARRR FLRGEKTEFI RLRTIQTQLA LSHFEISFRI SYNRRPFLTL PACTCPPEQL KRITELCGRN FAEHSMYFKR EIEGLCLWGW LGHPEFARSQ TDLQYCYVNH RMVRDKLLSH AARQAYGNRL SQGRHPAYLL YLELPTHQVD VNAHPAKHEV RFRESRQVHG FIVRTLAEIL EQTEPEGEHR LASGEFRSHP HEVLGKEQAG DTYLVAEVPG SYGPRKHGKH NPLSKGRNDA PSRFGQVQAF VLGRYLLTEN SQGLMLVDLP IARAHLAQAR LRTAYAAGHI IRQPLLLPLT FQVSLQQAEW TERHVQELRK LGLGLHRLGP QTVVLREIPA AIRELDLEGL LLALLAQLTR QQHIMPAEIP LGELIVALTA QYPASTTSRP SLQEMNAFLQ ELENLYQIET GLKAPLPWRE LPEHEIAQWF LPS
|
| |