Gene Csal_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0621 
Symbol 
ID4029090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp695721 
End bp698300 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content68% 
IMG OID637965789 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_572682 
Protein GI92112754 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.947929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGG CCTCTGCCCA GCATACCCCG ATGATGGCCC AGTACCTGAA GATCAAGCGC 
GAGCATCCCG AGGTGCTGCT CTTCTATCGC ATGGGCGATT TCTACGAGCT GTTCTACGAC
GACGCCAAGC GCGCCGCCTC GCTGCTCGAC ATCACCCTCA CCCAGCGCGG CCAGTCGGCG
GGACAGCCGA TTCCCATGGC CGGTGTGCCC TATCATAGCG CGGAGAGCTA TCTGGCGCGG
CTGGTCAAGT CGGGTGAGTC GGTGGCCATC TGCGAGCAGA TCGGCGACCC GGCCACCGCC
AAGGGCCCGG TGGAGCGCAA GGTCGTGCGC ATCGTCACGC CCGGCACCCT CCATGACGAG
GCCCTGCTCG ACGCGCGCCG CGACAACCTC GTCCTCGCCG TGCATCCGCA GGGAGACCGC
TGGGGCCTTG CCTGGCTGGA ACTGTCCAGC GGCCACTTCA GCGTGCTCGA AGTCGATGGC
GAGAGCGATC TGCTCTCCGA GATTCAACGT CTCGATCCCG CCGAGCTGCT TGCCGCGGAA
AGCCTGTCGT TACCGCCGGC ACTCGCCGAG CGCCCCGGCT TCCGTCGCCA GAGCGATTGG
CTGTTCGATC TGGAAAGCGC CACTCGCCTG TTATGCGACC AGTTCGGCGT GGCCGACCTG
CGCGGCTTCG GCTGCGCTCA CCTCACCACG GCCCTGACCG CCGCCGGCGT GCTCATCGAC
TACGCCCGCG ACACCCAGCG CTCGCGGCTG CCGCACGTCA CCGGCATCGC CGTGGAGACA
CGCGACGAAG CGGTGGTGAT CGATGCCGCC AGCCGCCGCA ACCTCGAGAT CGACACCAAC
CTGGGGGGCG GCTTCGACAA TACGCTGGCC AGCGTGCTCG ACACCACCGC CACCGCCATG
GGATCGCGTC AGCTCAAGCG CTGGCTCAAC CGACCGCTGC GCGACATCGC CCAGATCCAG
TCGCGCCAGG CCGCCGTGCA GTGCCTCATC GACGCCGACC GCCATGCCAC CCTGCGCGAC
GCGCTCAAGG CCATCGGCGA CATCGAACGC ATCCTCGCGC GAGTGGCGCT CTACAGCGCC
CGGCCGCGCG ACCTGGCCCG TCTGCGCGAT GCCCTCAACG CCCTGCCCGC CCTGGAACAC
GATCTGGCCG AACTCGACGA GGGCACGGCC ATCGACGCGC TCAAGCACCA CATTCGTCCC
TATCCCGAAT TGGCCGAGAC GCTGAGCCGG GCGCTGGTCG ACAACCCGCC CGTGGTGATC
CGCGATGGCG GCGTGATCGG CACGGGCTTC GACGCAGAGC TCGACGAATA CCGCGGCCTG
GCCGAGCACG CCGGGGACTA TCTGGTCGAG CTCGAGACTC GGGAACGCGA GCGCACCGGC
CTGGTCGGCC TCAAGGTGGG GTACAACCGC GTCCATGGCT ACTACATCGA GATCCCCCGC
GCCCAGGCCC GCGAAGCACC GGCCGAGTAC ATCCGTCGCC AGACGCTGAA AAACGCCGAA
CGCTTCATCA TCCCCGAGCT CAAGGAATTC GAGGACAAGG CGCTGTCGGC CAAGTCGCGC
GCGCTGGCGC GCGAAAAGCT GCTCTACGAC GGGCTGCTGG AAACGCTCAA CGTCGACCTC
CAGGCGCTTC AGGGTACCGC GCGCGCGCTG GCCACCCTCG ACGTGCTGGC GTGTTTCGCC
GAACGCGCGC TGGCGCTGGA TTTCGTGCGT CCACGCCTGA GCGATCAGCC CGGGCTGCGC
ATTCGCGGAG GCCGCCATCC CGTGGTCGAA CACGTCAGCG ACCACCCGTT CGTGCCCAAC
GACCTGATGC TCGATGAAAC GCGGCGCCTG CTGGTGATCA CCGGCCCCAA CATGGGCGGC
AAGTCCACCT ACATGCGCCA GGCGGCCTTG ATCGCGCTGC TCGCGCACAC CGGCAGCTGC
GTGCCCGCCG ACGAGGCAGA AATCGGCCCC GTGGACCGGA TCTTCACGCG CATCGGTTCC
AGCGACGACC TCGCCGGCGG CCGTTCGACC TTCATGGTCG AGATGACCGA GACGGCCACC
ATCCTGCACA ATGCCACCGA ACACAGCCTG GTGCTGATGG ACGAGATCGG GCGCGGCACC
AGCACCTTCG ACGGCCTCTC ATTGGCCTGG GCCAGTGCCG AGCACCTGGT CGAACGGCGC
GCCTTCACCC TCTTCGCCAC GCACTACTTC GAAATGACCG CGTTGACGGA GCCCTACGAC
AGCGTGGCCA ACGTGCACTT GACCGCCGCC GAGCACCGGG ACGGCATCGT CTTCATGCAC
CGCGTCGAGG AAGGGCCGGC GAGCCAGAGC TACGGGCTGC AGGTGGCACA ACTCGCCGGC
GTGCCGCCCC GGGTCGTCGC CCGGGCACGC GAAAAACTCG CCACCCTGGA ACAACAGGAA
GTTCACCAGG CCGGCACGCG TGGCGGCCTG GGTGACAGCG ACGCTGCCGC GCCGCAGCAG
GCGGACCTGT TCGCCAGCGC GCCGCACCCG GTCGTCGAGG CGCTGGAAAA ACTCGACCTC
GACGCGATAT CCCCGCGCCA GGCCATGGCC TTGCTCTACG AGTGGCGCGA ACAATGCTGA
 
Protein sequence
MSQASAQHTP MMAQYLKIKR EHPEVLLFYR MGDFYELFYD DAKRAASLLD ITLTQRGQSA 
GQPIPMAGVP YHSAESYLAR LVKSGESVAI CEQIGDPATA KGPVERKVVR IVTPGTLHDE
ALLDARRDNL VLAVHPQGDR WGLAWLELSS GHFSVLEVDG ESDLLSEIQR LDPAELLAAE
SLSLPPALAE RPGFRRQSDW LFDLESATRL LCDQFGVADL RGFGCAHLTT ALTAAGVLID
YARDTQRSRL PHVTGIAVET RDEAVVIDAA SRRNLEIDTN LGGGFDNTLA SVLDTTATAM
GSRQLKRWLN RPLRDIAQIQ SRQAAVQCLI DADRHATLRD ALKAIGDIER ILARVALYSA
RPRDLARLRD ALNALPALEH DLAELDEGTA IDALKHHIRP YPELAETLSR ALVDNPPVVI
RDGGVIGTGF DAELDEYRGL AEHAGDYLVE LETRERERTG LVGLKVGYNR VHGYYIEIPR
AQAREAPAEY IRRQTLKNAE RFIIPELKEF EDKALSAKSR ALAREKLLYD GLLETLNVDL
QALQGTARAL ATLDVLACFA ERALALDFVR PRLSDQPGLR IRGGRHPVVE HVSDHPFVPN
DLMLDETRRL LVITGPNMGG KSTYMRQAAL IALLAHTGSC VPADEAEIGP VDRIFTRIGS
SDDLAGGRST FMVEMTETAT ILHNATEHSL VLMDEIGRGT STFDGLSLAW ASAEHLVERR
AFTLFATHYF EMTALTEPYD SVANVHLTAA EHRDGIVFMH RVEEGPASQS YGLQVAQLAG
VPPRVVARAR EKLATLEQQE VHQAGTRGGL GDSDAAAPQQ ADLFASAPHP VVEALEKLDL
DAISPRQAMA LLYEWREQC