Gene Noc_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1585 
Symbol 
ID3705747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1765038 
End bp1766681 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content42% 
IMG OID637738065 
ProductDNA mismatch repair protein MutS-like 
Protein accessionYP_343594 
Protein GI77165069 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTT CCTTGAAAGA ATACCTAAGA GAAATGATAC ATGGAATATA CCCGCCTATT 
TTGCAAGGAA GTGAGGCTGC GCCATCGCCC CATTCAACGC AACCGTCCCG TGTTGGAGAA
GGGGTGATTG ATGAATCCAC CTTCCAGGTG ATAGAAGCGG ATAGGCTTTT CGATGCGATA
AATACCGCCC ATACAGTGAT AGGCCAGGCC GTGCTTTATC GCTCTTTGGC CCAGCCGTTA
GCTGACATAA AAATCATCAA GGCCAAGCAA GAAGCATTAC AGGAGCTGGC GTCGAACCCT
AGCCTTCGGG AAAAAATAGA ATCATTGACA AAAAAAGCTT CTAAACGGGA AAAGTCATTT
TACCGTTTAC TTTTTAGTAA ATTTACCGGT TTTTTTGGCA GTTCGAGAGG GGATACTGAA
ATTGAGGGAT ATGGCTATGC TACTTATGAA AGGGGAACAA CTTTTATGCT CGAACTGGTT
AAGGATGCAA GGACTTTGCC TGCGCCAGAG AGTAATTATC TCAGAATTCT AATTGATGAT
CTCAAGGGGT TCGGCGCTAC TAAAATTCAT TCTTTGATGA AAGGACCCGT TTATTTAACA
GAGAGTGGAA TTAGGACGAG AGAAGAAAAA AAATGGTTTA TTCCTGCTGT AAAATTCAGG
CCAACTTTGT TTAAACCACT TTTTATACTG GCAGTATTGC TGGGAATTGT TGCGCTCTTT
ATGTATGGGC CTATGGTGCT GGGTATCTCT TTTTCTTCCT CGCCCATACT GATACTTTTT
CTCCTGCCAG CCCTCATATT TTATATGCCT ATGGTGGGTA CATTTGACCG TGACAGTTGC
ATCTATCCTT TGCAGAAACG CTACCAAGAA TCGGAAGACG TACATACTGC GCTGGAAGCT
TTGGGAAAGT TGGATGAATT GCTTGCCTTT CATCATTATG GGAAATCGTT TGGTAGCCCT
ACAGTACTGC CACGAGTTAT TGCGGCAAAA AATCATACCC TGATACTCAG GGAGGCGAAA
AATCCTATCC TGGGTAAGGA TAACCCTAAT TATGTTCCCA ATGATATTGA CCTGGATGGC
CAAAAGCTCA CCTTTATTAG CGGTCCCAAT AGCGGCGGCA AAACGGCCTT TTGCAAAACA
ATCGCTCAAA TTCAATTGCT CTCCCAAGTA GGCTGTTATG TGCCCGCGGA AGATGCTGAA
ATTTCTGTTG CTGATCGTGT TTTTTACCAA GTCCCTGAAA TTAGCTCCTT GGAAGATGTA
GAAGGGCGGT TTGGAAAAGA ACTTAAGAGA ACCAAGGATA TGTTTTTAAT GACGAGCCCA
GAGAGCTTGA TAATTTTAGA TGAATTATCG GAAGGGACGA CTCACGCAGA AAAATTGGAG
ACCTCTTTCC ATGTACTCAA CGGGTTTTAT CGAATAGGAA ATAATACGCT TTTAGTGACC
CATAACCATG AGCTGGCGGA ACGATTTAAA GAAAATAAAA TAGGTCAGTA TTTTCAGGTT
CAGTTTATAG GAGAAGGACC CACCTACAAA ATTATTGAAG GGATATCAAA AGTAAGCCAT
GCGGATAGAG TCGCCAGAAA AATAGGATTT GGGAAGGAAG ATATAGAAAG GTATTTAAAG
GAAAAGGGGT TTGTTAGCGG GTAG
 
Protein sequence
MSSSLKEYLR EMIHGIYPPI LQGSEAAPSP HSTQPSRVGE GVIDESTFQV IEADRLFDAI 
NTAHTVIGQA VLYRSLAQPL ADIKIIKAKQ EALQELASNP SLREKIESLT KKASKREKSF
YRLLFSKFTG FFGSSRGDTE IEGYGYATYE RGTTFMLELV KDARTLPAPE SNYLRILIDD
LKGFGATKIH SLMKGPVYLT ESGIRTREEK KWFIPAVKFR PTLFKPLFIL AVLLGIVALF
MYGPMVLGIS FSSSPILILF LLPALIFYMP MVGTFDRDSC IYPLQKRYQE SEDVHTALEA
LGKLDELLAF HHYGKSFGSP TVLPRVIAAK NHTLILREAK NPILGKDNPN YVPNDIDLDG
QKLTFISGPN SGGKTAFCKT IAQIQLLSQV GCYVPAEDAE ISVADRVFYQ VPEISSLEDV
EGRFGKELKR TKDMFLMTSP ESLIILDELS EGTTHAEKLE TSFHVLNGFY RIGNNTLLVT
HNHELAERFK ENKIGQYFQV QFIGEGPTYK IIEGISKVSH ADRVARKIGF GKEDIERYLK
EKGFVSG