Gene Cpin_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3053 
Symbol 
ID8359218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3769229 
End bp3770581 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content45% 
IMG OID644965231 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003122727 
Protein GI256422074 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTAA TGACGGATAA ACAAACACTG AATGATCTGA ATATCTTCGG TAAAAGCGGT 
ACGAACGGTG GTATATATTC CATCTTTAAC CGTACGCATA CAAGAGGAGG ATCAGAGCTG
CTGGAAGAAA TGTTCCGTAA TCCATTGTCG GAAGTAGATG CAGTGAATGC ACGTAGTACT
ACTATCCGTT TATTTGGAGC AAACGGGATC GCGTTTCCTT ATGACGCTAC CTGGTTTGAC
GGTGCAGAAC AATATCTGCT GGATACGGAT GAACGCAGCC GTTTGAGTGA TCAACAGGAC
AATCTGAAGA GAAAATTAAA CCAGCTCATA GCAGGAGACA ATGCGTATAA GGCGATTGAG
AAAGGTGTGG AAAGCCTGAT CGCGATCTTG CAGACTACTA TTCAGTTTGC CGATACAATA
CGCCGGCAGA TGACAGGCAC TCCTTACAGT AAAGAGTTGG CCGTTATAGA AGAACTTCTG
AAGGAGAATG AACTACAGGC GATCCTGCAG GAACCAGCTA AACAAAAATT ACCTTTTGCC
AAGGTGGCGG AGTATGATAA AATACTCCGC TTCCGGCAAC GGGATGTTAT GAACAGGCTG
TTACGGCTAC TTTATCAGAC AGACGTTTAT ATTTCTGTAG CACGTATCGC GGCAGAGAAG
CAATTCTCTT TCCCGCTCGC TTTGCCGCGT GAACAGCAAA CCGTTATCCT GGAGGATTTT
TACCATCCTT CACTAACGAC GCCTGTGGCG AACACCATCA ACATTTCACC ATCCAGTAAT
GTCATATTTC TGACGGGTGC GAACATGGCG GGTAAATCCA CTTTCATGAA GGCGCTGGGT
ATCTGTATGT ACCTGGGGCA GATGGGATTC CCGGTACCCG CTTCAAAAAT GGAGTTTTCA
GTGAGAGATG GCATCTTCAC CACCATCAAC CTGCCGGATA ACCTGAGCAT GGGCGCCAGC
CACTTCTATG CGGAAGTACT GCGTATCAAA AACGTGGCGA GGGAGTTAAG CAGGGATAAA
TACCTGTTTG TCATCTTTGA TGAACTGTTC CGCGGAACAA ACGTAAAAGA TGCACATGAA
GCTACAATTG CCGTTACCAG TGCAATTGCC CGCAGAAAGA ACTGCATGTT TGTAGTGTCT
ACACACATTA TTGAAGCGGG TGATGTATTG AGGGAAAAGT GTGCAAATAT TAATTTCGTA
TTTCTGCCGA CGCTGATGGA AGGTAATAAA CCTGTATACA CGCACAAGCT GCAATCAGGT
ATTACGGCTG ACAGACATGG TATGGTAATT ATTAAAAACG AAGGTATCCT GGATATCCTG
GCGAGAAAAA AAACAGTAAA TAAGAACGCA TGA
 
Protein sequence
MGLMTDKQTL NDLNIFGKSG TNGGIYSIFN RTHTRGGSEL LEEMFRNPLS EVDAVNARST 
TIRLFGANGI AFPYDATWFD GAEQYLLDTD ERSRLSDQQD NLKRKLNQLI AGDNAYKAIE
KGVESLIAIL QTTIQFADTI RRQMTGTPYS KELAVIEELL KENELQAILQ EPAKQKLPFA
KVAEYDKILR FRQRDVMNRL LRLLYQTDVY ISVARIAAEK QFSFPLALPR EQQTVILEDF
YHPSLTTPVA NTINISPSSN VIFLTGANMA GKSTFMKALG ICMYLGQMGF PVPASKMEFS
VRDGIFTTIN LPDNLSMGAS HFYAEVLRIK NVARELSRDK YLFVIFDELF RGTNVKDAHE
ATIAVTSAIA RRKNCMFVVS THIIEAGDVL REKCANINFV FLPTLMEGNK PVYTHKLQSG
ITADRHGMVI IKNEGILDIL ARKKTVNKNA