Gene CHU_2648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2648 
SymbolmutS 
ID4184770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3029080 
End bp3031473 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content42% 
IMG OID638072640 
ProductDNA-mismatch repair protein 
Protein accessionYP_679242 
Protein GI110639033 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.270427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATC CATCTGATCT GGAAAGTAAA ATAGGATTTG ACCAAATCCG TGAATTGCTT 
AAAACAACAT GCAAAGGATT GGCCGCGATG AATATGCTGG AAGAGTTAAC ACCTTCGTAT
GATATTGCAA CAATCGAAAA AGAATTACAG GAGACAGACG AGTTTAAAAA AATTATTGAA
TCGGGAATGT TATTTCCCGA TCGTGATTTT GTTGATATAA CGCTGCTGGC TAATAAAATT
CAGGTGCAGG GAAATTACCT GGAGATTGAA GAATTATTAC AATGCAAATT GTTTTTAACA
ACATTGATTG CCTGCCAGTC TTTTTTCAAA CGTACAGAAA CAGATATTTA TGTACGTTTG
CAGGAACGTG CCAATGCCGT TAAAATTGAT TCGTCGTTAT TCAAAGCCAT CAACCGCGTG
ATCGATGATC ATGGTGTAAT CCGTGATACG GCTTCCGATG AATTACGCAT GATCCGCGAG
GAATACCGGG AAGAACAAAT TAAGCTGCGC AAAGAGGTAG ACCGATTGCT GCGTGTGTTT
AAAAAAGAAG GCTATACCGT TGAAGATACC GAAGCAACAA TCCGTTCAGG CCGTTTGGTA
TTACCTGTAC TGGCAGAATA TAAACGTAAA GTACAGGGGA TCATTCACGA TGAGTCTGGT
ACGGGACAAA CGGTTTTTAT GGAACCTATA TCTTTATTGC CGTACAACAA CAGCGTACGC
GAACTGGAGA TCCGTGAACG TAAAGAGATC GTGCGCATAC TCATGCAGCT TACGGATTAT
ATCCGTCCGT TTGCAGAGGT GATGCTTGAA GCAAATCAAT TGGTCGGCTG GTTTGATTTT
ATCCGTGCAA AAGCAACGTT TGCGTGTTCA GTAAACGGCG TGATGCCTGG TCTGCAGAAA
CGGCCGCTCA TTAAATGGCG CAATGCACGC CACCCGTTAT TGTGGCTCAA AAATAAAAAG
ATAAAAAAAG AAGTTATGCC GTTGTCTATT CAGGTAGACG AACAAAACCG CATCTTACTT
CTTTCAGGTC CGAATGCAGG TGGTAAGTCT GTTTGTATGA AAACACTGGG CTTGCTGCAG
TACATGCTGC AATGCGGCCT GCTCATTCCC GTAGAGGAGG GCAGTGTATC CGGCGTGTTT
GAAAATTTCT TTATTGATAT CGGTGATTCA CAATCGCTGG ATAATGATCT GAGTACCTAT
AGTTCGCACA TTAAGAACCT GACGTTTTTC TTAGAACATG CAGATGCTAA AACTTTATTA
CTGATCGATG AATTCGGCAG CGGTACAGAT CCGATGTATG GTTCTGCGAT TGCTGAAGCT
GCACTTGAGA AACTGAACGA GCGCAAGTCG ATGGGTATCA TTACCACGCA CTATGCAGGC
TTAAAAGCAT TGGCATCTAA AACAGAAGGG TTGATAAATG GTTCCATGCG TTTTGATACA
GACAAATTAA TTCCGTTATA CATACTGGAT ATCGGCGTAC CGGGGAGCTC CTTCACACTG
GAGATTGCAG AGAAATCCGG TTTGGCCAAA TCACTTATTG AACAGGCACG TACAAAACTG
GATCAGGAAC AGGTTGATCT TTCAACCTTA TTAAGAGACA TTGAACGTGA GCGTACAACG
CTTCAACAAG AGATTTTATC AGGCAGGGAA CTGAAAGTAA AACATGAAAA ACTTTCAAAG
GAGTTTGAAG AAAAGCTAGC GGAGTTACAG GATAAGCGCA GACGTCTGTT GCTTGAAGCG
AAGGAAGAAG CATATCGCAT TGTTCAGAAA GCAGACGGAA AAGCAGAAGA GCTGATCCGT
TCCATTTCAA ATGCAAAAGA TAAACACGGT GCACAAAAGC ATCGTCAGGA AATACGTGAA
GTAGGAAAGG CATTGGAAAA AGAACTGGAA CCTGAAATAA ACATTGCTGA TAATGAATCG
ATCCGATACG ATTGGAAAAC AGGTGATATT GTGCGTATAC GCAGCAATGG CGCGGTAGGT
AAAATAGAAG CCGTGAAAGG TAAGCTGGCA GAATTATTTA TCGGTGATCT GAAAGCTACG
GTTCATTTTT CAGAACTAAG CAGTGCTTCT GCAAAAGAGT TGAAAGCTAA AACAGATGAA
CGCAGACCAA AAGCGGGTGG TGTTGATCTG GTACAGAAGC ATCAGATATT TTCAATGCAG
CTGGATTTGC GCGGCAAGCG GACCGAAGAA GCCATTGCAT TTACAGACCG CTGGATTAAC
GATGCATTTA TTCTGGGTAT TGAAGAAGCA CGGATTCTGC ACGGAAAAGG AGATGGTATA
CTACGCAAAA TGCTGCGCGA ACATTTAAAA CAATATAAGC AGATCGTTGC AATGAACGAT
GAACACATTG ATAGCGGCGG CGCAGGCATA ACGGTGCTGA GTATGCGGTA TTAA
 
Protein sequence
MIYPSDLESK IGFDQIRELL KTTCKGLAAM NMLEELTPSY DIATIEKELQ ETDEFKKIIE 
SGMLFPDRDF VDITLLANKI QVQGNYLEIE ELLQCKLFLT TLIACQSFFK RTETDIYVRL
QERANAVKID SSLFKAINRV IDDHGVIRDT ASDELRMIRE EYREEQIKLR KEVDRLLRVF
KKEGYTVEDT EATIRSGRLV LPVLAEYKRK VQGIIHDESG TGQTVFMEPI SLLPYNNSVR
ELEIRERKEI VRILMQLTDY IRPFAEVMLE ANQLVGWFDF IRAKATFACS VNGVMPGLQK
RPLIKWRNAR HPLLWLKNKK IKKEVMPLSI QVDEQNRILL LSGPNAGGKS VCMKTLGLLQ
YMLQCGLLIP VEEGSVSGVF ENFFIDIGDS QSLDNDLSTY SSHIKNLTFF LEHADAKTLL
LIDEFGSGTD PMYGSAIAEA ALEKLNERKS MGIITTHYAG LKALASKTEG LINGSMRFDT
DKLIPLYILD IGVPGSSFTL EIAEKSGLAK SLIEQARTKL DQEQVDLSTL LRDIERERTT
LQQEILSGRE LKVKHEKLSK EFEEKLAELQ DKRRRLLLEA KEEAYRIVQK ADGKAEELIR
SISNAKDKHG AQKHRQEIRE VGKALEKELE PEINIADNES IRYDWKTGDI VRIRSNGAVG
KIEAVKGKLA ELFIGDLKAT VHFSELSSAS AKELKAKTDE RRPKAGGVDL VQKHQIFSMQ
LDLRGKRTEE AIAFTDRWIN DAFILGIEEA RILHGKGDGI LRKMLREHLK QYKQIVAMND
EHIDSGGAGI TVLSMRY