Gene Cpin_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3054 
Symbol 
ID8359219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3770578 
End bp3771891 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content43% 
IMG OID644965232 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003122728 
Protein GI256422075 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.613085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA TTGCAGACAA ACAAATGCTA GACGATTTGT CCTTGTTAGG TAAATTTAAT 
CCTGGTTCTG TTTTCAGTCT TTTTAACCAG GTAAAAACCA GGGGCGCCGA GAAATTACTG
GATGCCATGT TCCTGCATCC CTTATCGGAT GTGGAAGCGA TCAACAACAG AAGTGCTGTT
TTCCGCTATT TCAATCAACA TCCTGTCAGC TTCCCTTTTG ATGAAAAGCA GCTGGAACGT
ATGGAATCCT ATATGGATGA AGGCGGTGAC GGGAGTTATA TCATGGCATT ATGGGAATTA
AGCCGGAAGA AAGTAGCATC AGTACTGGTA AAAGATGATA CCTATGACCT GACCTTAACA
GGTATAGAAA GCAGCATCTC CGTATTAAAA GCCTGTGAAA AGCTACTCAA ACAGCTGGAG
CAGGAAGGAC GCGATAACGA TCAGCCATGG GTAAAATGGT CAGAGATCGT ACGCACGATC
ACTACAGATG ACCGGTTGAA AGATTTCTCT AAGCCTGCCA GGTCTTTAAT GGATAATGTA
CGGCTGCATC ATATGCTGAC AGGCGTATTT CGTAGTCAGC TGAAAACGCT CCTTGAACTG
ATCTATGAAA CAGACCTGTA CCTGGCAGTA GCAGGCGTAG CGAAGGCAAA AGGCTTCTCC
TATGCACAGG CATTACCGAA GGATCGGAAT ATACTGGAAG CTAAAGGACT GAGACATCCG
GGGCTTGATA AGGGCGTCGC TAACTCATTG TCCTTTAATG CCGGTACGAA TGTCCTGTTT
CTTACCGGTG CAAACATGGC GGGTAAGTCA ACATTGATGA AGTCGACAGG TATTCTCATT
TATCTTGCGC ATATGGGATT CCCGGTAGCA GCCACTGAAG TGAAATTTTC TATATTGGAT
GGTATTTATT CTTCCGTGAA TGTGCCCGAT GACCTGAATA AAGGATATAG TCACTTCTAT
GCGGAAGTAC TGCGCGTGAA AAAGGTAGCA GAAGAAGTAG CTACAGATAA ATCTTTGTTT
GTCATCTTTG ATGAGCTGTT CAAAGGTACA AACGTAAAAG ATGCTTATGA CGCTACCCTG
GCGGTAACTG AAGCGTTCAC TGATTTTACA AATTGTTTCT TTATCATTTC CACGCATATA
TTTGAAGTCG GACATGCATT GAATAATGGT GGATCACAGA TCGCGTTTGA ATTCCTTCCA
ACCATCATGA ACAATAACGT GCCGCAGTAT ACATATCAGC TGCAGAAAGG TATTACTACT
GACAGACAGG GCATGATCAT TATTGAGAAT GAAGGAATAC TGGATATGTT ATAG
 
Protein sequence
MSFIADKQML DDLSLLGKFN PGSVFSLFNQ VKTRGAEKLL DAMFLHPLSD VEAINNRSAV 
FRYFNQHPVS FPFDEKQLER MESYMDEGGD GSYIMALWEL SRKKVASVLV KDDTYDLTLT
GIESSISVLK ACEKLLKQLE QEGRDNDQPW VKWSEIVRTI TTDDRLKDFS KPARSLMDNV
RLHHMLTGVF RSQLKTLLEL IYETDLYLAV AGVAKAKGFS YAQALPKDRN ILEAKGLRHP
GLDKGVANSL SFNAGTNVLF LTGANMAGKS TLMKSTGILI YLAHMGFPVA ATEVKFSILD
GIYSSVNVPD DLNKGYSHFY AEVLRVKKVA EEVATDKSLF VIFDELFKGT NVKDAYDATL
AVTEAFTDFT NCFFIISTHI FEVGHALNNG GSQIAFEFLP TIMNNNVPQY TYQLQKGITT
DRQGMIIIEN EGILDML