Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0621 |
Symbol | |
ID | 4029090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 695721 |
End bp | 698300 |
Gene Length | 2580 bp |
Protein Length | 859 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965789 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_572682 |
Protein GI | 92112754 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.947929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGG CCTCTGCCCA GCATACCCCG ATGATGGCCC AGTACCTGAA GATCAAGCGC GAGCATCCCG AGGTGCTGCT CTTCTATCGC ATGGGCGATT TCTACGAGCT GTTCTACGAC GACGCCAAGC GCGCCGCCTC GCTGCTCGAC ATCACCCTCA CCCAGCGCGG CCAGTCGGCG GGACAGCCGA TTCCCATGGC CGGTGTGCCC TATCATAGCG CGGAGAGCTA TCTGGCGCGG CTGGTCAAGT CGGGTGAGTC GGTGGCCATC TGCGAGCAGA TCGGCGACCC GGCCACCGCC AAGGGCCCGG TGGAGCGCAA GGTCGTGCGC ATCGTCACGC CCGGCACCCT CCATGACGAG GCCCTGCTCG ACGCGCGCCG CGACAACCTC GTCCTCGCCG TGCATCCGCA GGGAGACCGC TGGGGCCTTG CCTGGCTGGA ACTGTCCAGC GGCCACTTCA GCGTGCTCGA AGTCGATGGC GAGAGCGATC TGCTCTCCGA GATTCAACGT CTCGATCCCG CCGAGCTGCT TGCCGCGGAA AGCCTGTCGT TACCGCCGGC ACTCGCCGAG CGCCCCGGCT TCCGTCGCCA GAGCGATTGG CTGTTCGATC TGGAAAGCGC CACTCGCCTG TTATGCGACC AGTTCGGCGT GGCCGACCTG CGCGGCTTCG GCTGCGCTCA CCTCACCACG GCCCTGACCG CCGCCGGCGT GCTCATCGAC TACGCCCGCG ACACCCAGCG CTCGCGGCTG CCGCACGTCA CCGGCATCGC CGTGGAGACA CGCGACGAAG CGGTGGTGAT CGATGCCGCC AGCCGCCGCA ACCTCGAGAT CGACACCAAC CTGGGGGGCG GCTTCGACAA TACGCTGGCC AGCGTGCTCG ACACCACCGC CACCGCCATG GGATCGCGTC AGCTCAAGCG CTGGCTCAAC CGACCGCTGC GCGACATCGC CCAGATCCAG TCGCGCCAGG CCGCCGTGCA GTGCCTCATC GACGCCGACC GCCATGCCAC CCTGCGCGAC GCGCTCAAGG CCATCGGCGA CATCGAACGC ATCCTCGCGC GAGTGGCGCT CTACAGCGCC CGGCCGCGCG ACCTGGCCCG TCTGCGCGAT GCCCTCAACG CCCTGCCCGC CCTGGAACAC GATCTGGCCG AACTCGACGA GGGCACGGCC ATCGACGCGC TCAAGCACCA CATTCGTCCC TATCCCGAAT TGGCCGAGAC GCTGAGCCGG GCGCTGGTCG ACAACCCGCC CGTGGTGATC CGCGATGGCG GCGTGATCGG CACGGGCTTC GACGCAGAGC TCGACGAATA CCGCGGCCTG GCCGAGCACG CCGGGGACTA TCTGGTCGAG CTCGAGACTC GGGAACGCGA GCGCACCGGC CTGGTCGGCC TCAAGGTGGG GTACAACCGC GTCCATGGCT ACTACATCGA GATCCCCCGC GCCCAGGCCC GCGAAGCACC GGCCGAGTAC ATCCGTCGCC AGACGCTGAA AAACGCCGAA CGCTTCATCA TCCCCGAGCT CAAGGAATTC GAGGACAAGG CGCTGTCGGC CAAGTCGCGC GCGCTGGCGC GCGAAAAGCT GCTCTACGAC GGGCTGCTGG AAACGCTCAA CGTCGACCTC CAGGCGCTTC AGGGTACCGC GCGCGCGCTG GCCACCCTCG ACGTGCTGGC GTGTTTCGCC GAACGCGCGC TGGCGCTGGA TTTCGTGCGT CCACGCCTGA GCGATCAGCC CGGGCTGCGC ATTCGCGGAG GCCGCCATCC CGTGGTCGAA CACGTCAGCG ACCACCCGTT CGTGCCCAAC GACCTGATGC TCGATGAAAC GCGGCGCCTG CTGGTGATCA CCGGCCCCAA CATGGGCGGC AAGTCCACCT ACATGCGCCA GGCGGCCTTG ATCGCGCTGC TCGCGCACAC CGGCAGCTGC GTGCCCGCCG ACGAGGCAGA AATCGGCCCC GTGGACCGGA TCTTCACGCG CATCGGTTCC AGCGACGACC TCGCCGGCGG CCGTTCGACC TTCATGGTCG AGATGACCGA GACGGCCACC ATCCTGCACA ATGCCACCGA ACACAGCCTG GTGCTGATGG ACGAGATCGG GCGCGGCACC AGCACCTTCG ACGGCCTCTC ATTGGCCTGG GCCAGTGCCG AGCACCTGGT CGAACGGCGC GCCTTCACCC TCTTCGCCAC GCACTACTTC GAAATGACCG CGTTGACGGA GCCCTACGAC AGCGTGGCCA ACGTGCACTT GACCGCCGCC GAGCACCGGG ACGGCATCGT CTTCATGCAC CGCGTCGAGG AAGGGCCGGC GAGCCAGAGC TACGGGCTGC AGGTGGCACA ACTCGCCGGC GTGCCGCCCC GGGTCGTCGC CCGGGCACGC GAAAAACTCG CCACCCTGGA ACAACAGGAA GTTCACCAGG CCGGCACGCG TGGCGGCCTG GGTGACAGCG ACGCTGCCGC GCCGCAGCAG GCGGACCTGT TCGCCAGCGC GCCGCACCCG GTCGTCGAGG CGCTGGAAAA ACTCGACCTC GACGCGATAT CCCCGCGCCA GGCCATGGCC TTGCTCTACG AGTGGCGCGA ACAATGCTGA
|
Protein sequence | MSQASAQHTP MMAQYLKIKR EHPEVLLFYR MGDFYELFYD DAKRAASLLD ITLTQRGQSA GQPIPMAGVP YHSAESYLAR LVKSGESVAI CEQIGDPATA KGPVERKVVR IVTPGTLHDE ALLDARRDNL VLAVHPQGDR WGLAWLELSS GHFSVLEVDG ESDLLSEIQR LDPAELLAAE SLSLPPALAE RPGFRRQSDW LFDLESATRL LCDQFGVADL RGFGCAHLTT ALTAAGVLID YARDTQRSRL PHVTGIAVET RDEAVVIDAA SRRNLEIDTN LGGGFDNTLA SVLDTTATAM GSRQLKRWLN RPLRDIAQIQ SRQAAVQCLI DADRHATLRD ALKAIGDIER ILARVALYSA RPRDLARLRD ALNALPALEH DLAELDEGTA IDALKHHIRP YPELAETLSR ALVDNPPVVI RDGGVIGTGF DAELDEYRGL AEHAGDYLVE LETRERERTG LVGLKVGYNR VHGYYIEIPR AQAREAPAEY IRRQTLKNAE RFIIPELKEF EDKALSAKSR ALAREKLLYD GLLETLNVDL QALQGTARAL ATLDVLACFA ERALALDFVR PRLSDQPGLR IRGGRHPVVE HVSDHPFVPN DLMLDETRRL LVITGPNMGG KSTYMRQAAL IALLAHTGSC VPADEAEIGP VDRIFTRIGS SDDLAGGRST FMVEMTETAT ILHNATEHSL VLMDEIGRGT STFDGLSLAW ASAEHLVERR AFTLFATHYF EMTALTEPYD SVANVHLTAA EHRDGIVFMH RVEEGPASQS YGLQVAQLAG VPPRVVARAR EKLATLEQQE VHQAGTRGGL GDSDAAAPQQ ADLFASAPHP VVEALEKLDL DAISPRQAMA LLYEWREQC
|
| |