Gene PsycPRwf_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsycPRwf_2017 
Symbol 
ID5206593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychrobacter sp. PRwf-1 
KingdomBacteria 
Replicon accessionNC_009524 
Strand
Start bp2511886 
End bp2514930 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content49% 
IMG OID640600249 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001280907 
Protein GI148653814 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACA CCGTGCACAC CACCTCACTG CTCAATATAG CTGGCGTTGA ATACAATCTT 
GATGACCATA CCCCAATGAT GGTGCAATAT TTAACCCTCA AAGCGCAGTA TCCGAATGCG
CTCTTGCTGT ACCGTATGGG CGATTTTTAT GAGCTTTTTT TTACCGATGC TCAGCGAGCG
GCTGATATTT TAGACATTAC TTTAACCCGC CGCGGCAACG ATAAAGCAGG TAACAATATC
GCTATGGCCG GCGTGCCCTT TCATGCTGCC GAAAGCTATA TGGCGCGTCT TATTGCAGCG
GGCGAAACCG TGGTTATCTG TGAGCAAGTC GAGGATGCCC CTGAGCTGAA TGAAGAGCAT
GACACGAATA AGCCCAATGC GGCCAAAGGT ATCATGCGCC GTGAAGTGGT CAAAACATTA
ACCGCTGGCA CTCTCACCGA TGATGCACTT ATTGCGCCCA ATCACACCCC AAGCGTTGTG
GCACTTGATT TCAATTTAAA GAAAAATTGG CAACACCTAA AGAGTAGCTC AAATCAAGCG
AGCCTAAACC AGATAGGTAT CAGCCAGCTA GACATCAACG CCGGTACCAT TCGTACGCAA
ACCCTGCCAC TTTCTGAGGT TATTAAGCCG GCTGAGGCCA ATATTGCTTA TGTGTCTGAG
TCACAGGTGG ATTATGAGGT ACTGTTAAAG CAGCGTCTGC TCACCGTACT CAACAGATTT
GCCCCAAGTG AAGTGATTAT TCCAGAGACG TTAAGCGATG ACTGGCGCCA CTGGCTACAA
GACAAGCTAT ATTGCCCAGT TATTAGTGTG GCGGCCAGCG ACTTTCACCC CCAGCATGCC
GGCGAAACAG TATGTCGACA GTTTAATGTA CAAACCTTAG AGGGACTGGG CATCGCCGAA
AGTTTGGTGG CACAAACCAC CAGTGCCGCC TTAATCCATT ATGCTCAGCA AACCCAGCAG
CGCCATACGC CGCAGCTCAC TGAACTGATT GTTGAGCGTG ATGAAGACTA TTTAATCATT
GATGACATTA GCAGCCGCAA CCTTGAGTTA TTTACCCCCG TTAGTCCCAC GGGCACCAGT
CTATTAAGCG TGGTCAATCA GTGCCAAAGC GCCATGGGCA GACGCTTATT AATCAACCAG
CTCAAGCGCC CGTTACGTCT GGCCACTCAT GTTGAGTCGT TAGGCTTGCG TCTTGATGCC
TGCCAGTGGC TGATACAGAC ACCCACCAGC GCTGAGATAA CTCAGCTGCA GCAAAGCTTA
CACAGTATCG CTGATATTGA GCGTATCAGT AGCCGCATTG CATTACACAG CGCCAAACCC
CGAGACCTGC GACGCTTAGC AGACAGTATC AGACACAGTG AGCAGCTGGC CCACACGTTA
CAGAGCTTAG GGCTACAGGC GGACAGTGAG GGTCTACTGC CAAGCCTGCT ACAAAACTTA
CCACTGCAAA AAAATACCTT ATTAAAAATC GCTGAACACA TCGAGTCGGC TATTATAGAG
GAGCCGCCGG CACATATACG TGACGGTGGC ATGATTGCAG AGGGGTATGA CAGTGAGTTT
GATCGCCTGG TTCATCTGCA TGACAATATC CAGCAAACCC TTGATGACAT GGCTGATGAA
GCCCGTCAAA ATTATCAGCT CCCCAGCCTA AAAGTAGGCT TTAACAAAGT CAGCGGTTTT
TACTTTGAAT TACCCAAAGC GCAAGCGTCT ACAGCGCCGG ATGTTTTTAT TCGCCGCCAA
ACCCTTAAAA ACAGCGAGCG TTTTATTACT GAGCCGTTGA AGAATCTGGA GGTAGAATAT
CTAGACGCCC AAAGCTTAGC ACTCAGCTGT GAAAAGGCGT TATATCAAGG GTTACTGGTG
CGCTTAAGTG AACAACTTCA ATATTTACAA CAGCTTAGCG CAGCCATCGC TCAGATTGAC
GTGCTGCTTA ACTGGGCTGT ACTTGCCAAA GATAATGACT GGGTACGCCC CACGCTTGAC
CCAAGTGGCA GTTATTTAGA CATTCGCCAA GGCCGGCACT TAGTCGTAGA AGCCATGAGT
CAGCCCAAAG CAACCCATGG CAATAGCCAA CCCACCTCTG GGCCGCAGCA TTTTGTGGCC
AATGATTGTC AACTGGGCAC GGAATCATTT AATGAGCGGC TACTACTCAT CACTGGGCCA
AATATGGGTG GTAAATCTAC CTATATGCGC CAAACTGCCC TTATTGTGCT GCTTGCCTGC
TGTGGCAGCT TCGTGCCGGC CGCCTCAGCT ACCCTCGGTG ACATTGATCG CATTTTTACT
CGCATTGGAT CAGCCGATGA CTTAGCTGGT GGCAAATCTA CCTTTATGGT GGAAATGATT
GAGACCGCAC AAATCTTAAA TCTGGCCAGC CACTGCTCTT TGGTATTAAT GGATGAGGTC
GGCCGCGGTA CGTCCACCAC GGATGGTCTA GCGATTGCTC ATGCCTGTGC CGTCCAGCTG
TGTGAAATGG GTAGTCTCAC ATTATTTGCC ACCCACTACT TTGAATTAAC CGAGCTTAGT
CAACAACGCA ATCTTAGCGG TAAGCTAAGA AACGTTCATG TCGCCGCCAG TCACATCGAT
GGCCAGCTGT TATTACTGCA CAAAATTGAG CCAGGTGCTG CCAGCTCAAG CTTTGGGCTG
CATGTGGCAA AGATGGCCGG TATTCCTGAC AAAGTTTTAA TCGCCGCCGA ACGCTACTTA
AGCTTACAAA AATCCCTAAA CGCAGCCACC CAAGAGCGTC ATCACCCCTC AATAACGCCT
AGGGATAATC ATGCTAATTT TGATAAAGCT TATGCTAATG ACGACGATAG CCGCTACGCA
GCAGATTTAA ATAGCTCAGC TCTTGATTCT GTGACACAAG ATAGCCTGTC AACAGCGCCT
GACTTGAGCC CTCATTTATC AAGTGAATCT AAAGTGCTTA GCCCTTTAAA TACTAAGGCC
AAAGCGATTT TAGATACGCT AAATGACATC TATCCTGATG AGTTGACACC CAGACAAGCC
TTGGACTTAA TTTATGAATT AAAAAAACAA GCACGCTTGG GCTAA
 
Protein sequence
MSDTVHTTSL LNIAGVEYNL DDHTPMMVQY LTLKAQYPNA LLLYRMGDFY ELFFTDAQRA 
ADILDITLTR RGNDKAGNNI AMAGVPFHAA ESYMARLIAA GETVVICEQV EDAPELNEEH
DTNKPNAAKG IMRREVVKTL TAGTLTDDAL IAPNHTPSVV ALDFNLKKNW QHLKSSSNQA
SLNQIGISQL DINAGTIRTQ TLPLSEVIKP AEANIAYVSE SQVDYEVLLK QRLLTVLNRF
APSEVIIPET LSDDWRHWLQ DKLYCPVISV AASDFHPQHA GETVCRQFNV QTLEGLGIAE
SLVAQTTSAA LIHYAQQTQQ RHTPQLTELI VERDEDYLII DDISSRNLEL FTPVSPTGTS
LLSVVNQCQS AMGRRLLINQ LKRPLRLATH VESLGLRLDA CQWLIQTPTS AEITQLQQSL
HSIADIERIS SRIALHSAKP RDLRRLADSI RHSEQLAHTL QSLGLQADSE GLLPSLLQNL
PLQKNTLLKI AEHIESAIIE EPPAHIRDGG MIAEGYDSEF DRLVHLHDNI QQTLDDMADE
ARQNYQLPSL KVGFNKVSGF YFELPKAQAS TAPDVFIRRQ TLKNSERFIT EPLKNLEVEY
LDAQSLALSC EKALYQGLLV RLSEQLQYLQ QLSAAIAQID VLLNWAVLAK DNDWVRPTLD
PSGSYLDIRQ GRHLVVEAMS QPKATHGNSQ PTSGPQHFVA NDCQLGTESF NERLLLITGP
NMGGKSTYMR QTALIVLLAC CGSFVPAASA TLGDIDRIFT RIGSADDLAG GKSTFMVEMI
ETAQILNLAS HCSLVLMDEV GRGTSTTDGL AIAHACAVQL CEMGSLTLFA THYFELTELS
QQRNLSGKLR NVHVAASHID GQLLLLHKIE PGAASSSFGL HVAKMAGIPD KVLIAAERYL
SLQKSLNAAT QERHHPSITP RDNHANFDKA YANDDDSRYA ADLNSSALDS VTQDSLSTAP
DLSPHLSSES KVLSPLNTKA KAILDTLNDI YPDELTPRQA LDLIYELKKQ ARLG