Gene Cphy_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1160 
Symbol 
ID5742883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1472040 
End bp1473704 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content34% 
IMG OID641292265 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_001558277 
Protein GI160879309 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT TTTTTATAGT ACTTGGAATC TTTTTATTTG GAGTAATATT AGCTTTGATT 
ATTCAAGAAT ATTTCTTACA AAAAAGTCGT AGACAATACA TACAAAAGAG ATTTGGTACG
ATACCACCAG AGCGTAATTA TGATTTTGAG GGAATAGCAT CCTATTGGGA AGAAATTAAA
AAACAGATTC CAAACCAAGT ACAAGTGGAT AATGTTACTT GGAATGATTT AGATATGGAC
AAAGTGTATG CAAGAATAAA TCAATGTTGT TCCTCTGTAG GTGAGGAATA CCTTTATGCA
ATACTTCATC AGCTTCATTG GTCAGAAGAA GAGCTATCCG GGTTAGAAGA AAAGATTCAG
TATTTTGAAT CAAGGGACGA GGAACGAATT AGAATGATGG AATATTTGCT CGCTTTAGGT
AAAAAACCGA GAAACTTCTT AATTCCTTTT TTAAATCATA CTGATATTTT TGAGATTCCC
CACATCCTAA TTTATCGAAT ACTTCAGTTT CTTCTATTTG GAAGTTTTAT TGGAGCATTG
GTACTACAAA ACGCATTATG GATTACGATA CTTATAGTAA TAGCGGGAAT TAATCTTGTA
GTGTTTGAAT GTCAGCACGC CAAGTATGAA ACAAGTGTTG TAACCCTATC TTATATTACA
CGATTACTTT TTACAGCGAA AAAGATTGCT AAGAGTAAAG AAACCTCATT TGAGGAGACG
TTTCATGAAT TGCATGAACC TTCTTCTAGT TTTCGCAGTT TTTCCAAAAG TGTACAATCA
CTAGAATTGC ATACACAGAC TAGACTGAGT GGGGTTGGTT TAGAATTATT ATTTGAATAT
CTTCGAGGAA TTACTTTAGT TGACTTTACG AAATACCATA AGGTTATGAA AGCCTTAAAA
GGGAAAAAGG ATTTGTTTTA TAAAATTTAC CATAGTATCG GGCAGCTTGA TTGTGCGATA
TCAATTTTAT CATTTCGTCA TAGTATACCT TTCTACTGTA AACCGGAATT TAGTTCTAGC
ACAGATTCAA TAAAAGCGAA GGAAATCTAT CATCCGCTTA TTCAGGATGC GATAACTAAC
ACGATTTCAT TTGACCGATG TATTATACTT TCAGGTTCAA ATGCATCTGG TAAATCAACT
TTTATAAAGA CGATTGCAAT CAATGCTATT CTCGCTCAAA GTATCCAAAC TTGTCTAGCG
ACACAATTTT GTATGCCAAG ATCTTCTATT GTTTCCTCAA TGGCAGTGCG AGATGATATT
ATAAGTGGCG ACAGCTATTT TATCACGGAA CTAAATTACT TAAACCGCAT TCTAAGTAAT
CTAAATGAAG ACAGAACTAC CCTTTGTTTT ATCGATGAGA TTCTACGAGG AACCAATACA
GCAGAAAGAA TAGCAGCTTC GATTGCGGTG GTAAAATATT TAGTTCAAAA AAATTGTATT
GCGATTGTAG CAACCCATGA TGTGGAACTT ACAGAAGAAT TAAAAGAAAT CTGTGACAAC
TATCACTTTA GAGAGGTTTT AAATGAGGGG GACGTAGTAT TTGATTATAA ATTGCATGAT
GGACCGACAA CAACACGCAA TGCTATTCTT CTATTAGAGC GAATGGGATA TCCGGAAGAG
ATAATTAAAA CAGCTAACCG CTATGTGAAT TCAATCAATA ATTAG
 
Protein sequence
MNSFFIVLGI FLFGVILALI IQEYFLQKSR RQYIQKRFGT IPPERNYDFE GIASYWEEIK 
KQIPNQVQVD NVTWNDLDMD KVYARINQCC SSVGEEYLYA ILHQLHWSEE ELSGLEEKIQ
YFESRDEERI RMMEYLLALG KKPRNFLIPF LNHTDIFEIP HILIYRILQF LLFGSFIGAL
VLQNALWITI LIVIAGINLV VFECQHAKYE TSVVTLSYIT RLLFTAKKIA KSKETSFEET
FHELHEPSSS FRSFSKSVQS LELHTQTRLS GVGLELLFEY LRGITLVDFT KYHKVMKALK
GKKDLFYKIY HSIGQLDCAI SILSFRHSIP FYCKPEFSSS TDSIKAKEIY HPLIQDAITN
TISFDRCIIL SGSNASGKST FIKTIAINAI LAQSIQTCLA TQFCMPRSSI VSSMAVRDDI
ISGDSYFITE LNYLNRILSN LNEDRTTLCF IDEILRGTNT AERIAASIAV VKYLVQKNCI
AIVATHDVEL TEELKEICDN YHFREVLNEG DVVFDYKLHD GPTTTRNAIL LLERMGYPEE
IIKTANRYVN SINN