Gene Sala_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2791 
Symbol 
ID4080372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2953178 
End bp2955772 
Gene Length2595 bp 
Protein Length864 aa 
Translation table11 
GC content71% 
IMG OID638011175 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_617829 
Protein GI103488268 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.494407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGT ACTGGGCCCT GAAGGAGAAG GCGGGCGACT GCCTGCTCTT CTATCGGATG 
GGCGATTTTT TCGAGCTGTT CTTCGACGAT GCCAAGGCGG CCGCGGCGAC GCTCGACATC
GCGCTGACCT CGCGCGGCGA ACATGACGGA CAGCCGGTGC CGATGTGCGG GGTGCCGGTT
CACGCCGCTG AATCCTATCT CGCGCGGCTG ATCCGCGCGG GGCACCGCGT CGCGATCGCC
GAGCAGGTCG AAACGCCCGC CGAAGCCAAG GCGCGCGGCG GGTCGAAGGC GCTCGTCGCG
CGCGACATTG TGCGCTTCGT CACCGCGGGC ACGCTGACCG AGGAAAGCCT GCTCGAAGGG
CGCAGCGCCA ATCGCCTCGC CGCGCTGGCA GAGGTCGGCG GCGAGCGCGA GGTGGCGATC
GCCGCCGCCG ACATCTCGAC CGGGCGGTTC GAGGTCGTCA CGGTGCGCCC CGATGCCGTT
GACGCCGAAC TCGCGCGGCT GGCGCCGTCC GAACTGCTGC TCAGCGAAAG CGCCGAAGAC
CTGCCGATCT CGTCGGCGCG GCAGGTCGTG CGTCGTCCGG CGTCGGACTT TGCCAGCACC
GCCGCACAGA AACGGCTCGA AGCATTTTTC GGCGTTCGGA CGCTCGACGG CTTCGGCCAG
TTTTCACGCA GCGAAATCGC CGCGATGGGC GCGCTGCTTG CCTATCTCGA CCATGTCGGG
ACCGGCGGCC CGACTTTCCT CCAGCCGCCG GTGCGCCACC TTGCGTCGGA CCGCATGGCG
ATCGACGCCG CGACGCGCGA AAGCCTCGAA CTCGTCCGCA CGATGGCCGG AACGCGCGAG
GGCAGCCTGC TCGGCACGAT CGACCGTACC GTCACCGCGG CGGGCGCGCG CCTGCTCGCC
GACGATCTTG CGAGCCCGCT CACCGACCGC GCGGCGATCC TCGACCGGCT CGATCTCGTC
GATGCGCTCG CGCAAGATGC GCTGTGGCGC GGCGATTTGC GCGCGGCGCT CCGTGCGCTG
CCCGACGCCG GGCGCGCGCT GGGGCGCCTC GTCGCGCGGC GCGGCGGCCC GCGCGATCTG
GCGCAGTTGC GCGACGCGCT CGGCGGCGCG CGAAGCTTGC GCGAACGGCT CGCGCGGCGC
GCCGACCTGC CGCCGCTGCT CGCGCGCCTG CTCCCCGGCC TCGACGGCCA TGGCGCGCTC
GTCGACGAGC TGACGCGCGC GCTCGTCGAA ACGCCGCCGG TCGATGCCGC GCAGGGCGGC
TATATCGCCG AAGGTTACGA TCATGCGCTC GATGCGCTGC GCGAAACCGC GCGCGACGGG
CGCAAGGCAA TTGCGGCGCT GGAAGCCGAA TATCGCGACC GCACCGGCAT CGCCTCGCTC
AAGATCCGTC ACAATGGCGT GCTCGGCTAT CATGTCGAGG TGCCCGCGAA GCACGCCGAC
GCGCTGATGG CGGAGGGGAG CGGCTTCACC CACCGCCAGA CGCTCGCGGG CGTCGTGCGC
TTCAACTCGG CCGACCTGCA CGACGCGGCG ATGCGCGTGA CGCAGGCGGG GGTCCATGCG
GTCGCCGCCG AGGCCGCGCA TCTCGAAGCG CTGACCGCAG CGGCGGTGGC GCGGCGCGAG
GCGATCGCCG CAAGCTGCGA CGTGCTCGCG CGCCTCGACG TCGCCGCCGC GCTCGCCGAC
CATGCGATGA GTCACAATTG GTGCCGTCCC GACCTTGCCG ACGTGCCCTG TCTCGAGGTC
GTCGGCGGTC GGCATCCGGT GGTCGAGGCG GCGCTGGCCA AGGCGGGCGA ACGCTTTGTG
CCCAATGATG TCTCGCTGTC GGAAACCGAC CGGCTTTGGC TCGTCACCGG GCCGAACATG
GGCGGCAAAT CGACCTTCCT GCGCCAGAAT GCGGTGATTA TCGTGCTCGC GCAGGCGGGC
GGTTTCGTTC CGGCGGCATC GGCGCGGCTC GGACTCGTCG ACCGGCTGTT CAGCCGCGTC
GGCGCGAGCG ACAATCTCGC GCGCGGACGC TCGACCTTCA TGGTCGAGAT GGTCGAGACG
GCGGCGATCC TCGCGCAGGC GACCCCCGAC AGCTTCGTCA TCCTCGACGA GGTCGGGCGC
GGCACCTCGA CCTATGACGG GCTCGCGCTC GCCTGGTCGG TGGTCGAGGC GGTGCATGAA
GTGAACAAGT GTCGATGCCT GTTCGCGACC CATTATCACG AGCTGACACG CCTCGCCGAA
ACGCTGGACG CGCTCTCGCT CCACCATGTC CGCGCGCGCG AATGGCAGGG CGATCTCGTC
CTGCTCCACG AGGTTGCCGC GGGGCCAGCC GATCGCAGCT ACGGCCTCGC CGTCGCACGC
CTCGCAGGCG TTCCCCCCGC GGTCGTCAAG CGCGCCGAAA CGGTGCTGGC AAAGCTAGAG
GCGGGGCGCG AGAAAACCGG CGGCCTCGCC GCGGGGCTCG ACGATCTGCC GCTCTTCGCC
GCGACGCTCG CCGAGGCCCC GGTGGCCGCA AGGGACGCAC TCCGCGACGC GCTCGCCGCC
ATCGACCCCG ACGCGCTCAC CCCGCGCGAC GCGCTCGATG TGCTCTATCG CCTCAAGGAC
ATTGCGCGCT CCTAG
 
Protein sequence
MAQYWALKEK AGDCLLFYRM GDFFELFFDD AKAAAATLDI ALTSRGEHDG QPVPMCGVPV 
HAAESYLARL IRAGHRVAIA EQVETPAEAK ARGGSKALVA RDIVRFVTAG TLTEESLLEG
RSANRLAALA EVGGEREVAI AAADISTGRF EVVTVRPDAV DAELARLAPS ELLLSESAED
LPISSARQVV RRPASDFAST AAQKRLEAFF GVRTLDGFGQ FSRSEIAAMG ALLAYLDHVG
TGGPTFLQPP VRHLASDRMA IDAATRESLE LVRTMAGTRE GSLLGTIDRT VTAAGARLLA
DDLASPLTDR AAILDRLDLV DALAQDALWR GDLRAALRAL PDAGRALGRL VARRGGPRDL
AQLRDALGGA RSLRERLARR ADLPPLLARL LPGLDGHGAL VDELTRALVE TPPVDAAQGG
YIAEGYDHAL DALRETARDG RKAIAALEAE YRDRTGIASL KIRHNGVLGY HVEVPAKHAD
ALMAEGSGFT HRQTLAGVVR FNSADLHDAA MRVTQAGVHA VAAEAAHLEA LTAAAVARRE
AIAASCDVLA RLDVAAALAD HAMSHNWCRP DLADVPCLEV VGGRHPVVEA ALAKAGERFV
PNDVSLSETD RLWLVTGPNM GGKSTFLRQN AVIIVLAQAG GFVPAASARL GLVDRLFSRV
GASDNLARGR STFMVEMVET AAILAQATPD SFVILDEVGR GTSTYDGLAL AWSVVEAVHE
VNKCRCLFAT HYHELTRLAE TLDALSLHHV RAREWQGDLV LLHEVAAGPA DRSYGLAVAR
LAGVPPAVVK RAETVLAKLE AGREKTGGLA AGLDDLPLFA ATLAEAPVAA RDALRDALAA
IDPDALTPRD ALDVLYRLKD IARS