Gene Noc_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0336 
Symbol 
ID3706507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp365383 
End bp367134 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content53% 
IMG OID637736848 
ProductDNA mismatch repair protein 
Protein accessionYP_342392 
Protein GI77163867 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00211885 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCTC CTTCCATCCC CCGTATTCAG ATTCTACCTC CGGCACTAGC TAACCAGATT 
GCTGCCGGGG AGGTGGTGGA ACGCCCCGCT TCAGTACTCA AAGAACTTGT GGAAAACGCT
CTAGATGCAG GCGCCCAACG CATCGAAATT GAAACGGAGG CGGGGGGCAT TGGGCTTATC
CGAGTCCGTG ATGACGGTTG CGGCATCCAT CATAATGATT TGCCTTTAGC CTTAAGCTCC
CACGCTACCA GTAAAGTTCG CCACGGAGAA GAATTACTCA ATATTACTAC TTTGGGCTTT
CGCGGCGAAG CCTTGGCAAG CATCGACGCC GTTTCCCGTC TTAGCCTCAG TTCCCGGATG
GCAGATAATG AACATGGCTG GTGCATTCGG GAAAATACGC CGGTACAACC CATCGCCCAT
CCCCTAGGCA CCACGGTTGA GGTCCGTGAT CTGTTTTACA ATACTCCAGC TCGGCGGCGC
TTCCTCCGAG GGGAAAAAAC CGAGTTTATA CGTTTACGCA CGATACAAAC ACAACTAGCT
CTCAGCCATT TCGAAATAAG CTTTCGAATA AGTTATAACC GGCGTCCTTT TCTCACCCTG
CCTGCTTGTA CCTGCCCCCC TGAGCAGCTG AAACGGATTA CCGAACTCTG CGGACGGAAT
TTTGCTGAGC ATAGTATGTA CTTCAAGCGG GAAATAGAAG GGCTATGCCT ATGGGGCTGG
TTAGGACATC CTGAATTCGC CCGCAGTCAA ACCGATCTCC AATATTGTTA TGTTAACCAC
CGCATGGTTC GAGACAAGCT ATTGAGCCAT GCAGCTCGCC AAGCTTATGG CAACCGCCTG
TCCCAAGGGC GCCACCCCGC CTATTTACTG TATCTGGAAT TACCCACTCA TCAAGTAGAT
GTCAATGCCC ACCCGGCTAA GCATGAAGTC CGGTTTCGGG AATCCCGGCA GGTTCATGGT
TTTATCGTTC GCACGTTAGC AGAGATACTA GAACAAACCG AACCCGAAGG AGAGCATCGG
CTAGCCTCTG GAGAATTTCG ATCACACCCC CATGAGGTGC TCGGTAAAGA ACAGGCTGGC
GACACTTATC TCGTAGCAGA GGTCCCCGGA AGCTATGGTC CCCGGAAGCA TGGGAAACAT
AACCCCCTAT CCAAAGGGAG AAACGATGCT CCATCACGGT TCGGTCAGGT TCAGGCATTC
GTGCTCGGGC GCTATCTACT GACAGAAAAC AGCCAAGGGT TGATGCTAGT AGATTTGCCC
ATAGCCCGCG CCCATCTAGC TCAAGCACGA CTGCGCACCG CCTACGCTGC CGGCCATATC
ATCCGGCAAC CTTTACTTCT TCCTCTCACT TTTCAGGTTT CCCTGCAGCA GGCAGAGTGG
ACAGAACGAC ATGTCCAGGA GCTGCGGAAA CTGGGTCTCG GGCTGCACCG GTTAGGACCC
CAAACCGTCG TTTTGCGGGA GATACCTGCT GCCATCCGAG AGCTCGATCT CGAGGGTTTA
CTACTGGCTT TACTCGCCCA ATTAACCCGC CAGCAGCACA TAATGCCCGC TGAAATCCCG
CTGGGAGAGC TCATCGTCGC TCTTACGGCG CAATACCCTG CCTCAACCAC ATCCCGCCCC
TCCCTCCAGG AAATGAATGC TTTCCTGCAA GAGTTGGAAA ATCTTTATCA AATCGAAACC
GGCCTTAAAG CCCCCCTCCC CTGGCGGGAA TTACCCGAGC ATGAAATAGC ACAATGGTTC
CTCCCAAGCT AG
 
Protein sequence
MAAPSIPRIQ ILPPALANQI AAGEVVERPA SVLKELVENA LDAGAQRIEI ETEAGGIGLI 
RVRDDGCGIH HNDLPLALSS HATSKVRHGE ELLNITTLGF RGEALASIDA VSRLSLSSRM
ADNEHGWCIR ENTPVQPIAH PLGTTVEVRD LFYNTPARRR FLRGEKTEFI RLRTIQTQLA
LSHFEISFRI SYNRRPFLTL PACTCPPEQL KRITELCGRN FAEHSMYFKR EIEGLCLWGW
LGHPEFARSQ TDLQYCYVNH RMVRDKLLSH AARQAYGNRL SQGRHPAYLL YLELPTHQVD
VNAHPAKHEV RFRESRQVHG FIVRTLAEIL EQTEPEGEHR LASGEFRSHP HEVLGKEQAG
DTYLVAEVPG SYGPRKHGKH NPLSKGRNDA PSRFGQVQAF VLGRYLLTEN SQGLMLVDLP
IARAHLAQAR LRTAYAAGHI IRQPLLLPLT FQVSLQQAEW TERHVQELRK LGLGLHRLGP
QTVVLREIPA AIRELDLEGL LLALLAQLTR QQHIMPAEIP LGELIVALTA QYPASTTSRP
SLQEMNAFLQ ELENLYQIET GLKAPLPWRE LPEHEIAQWF LPS