Gene Tcr_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1594 
Symbol 
ID3760854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1744784 
End bp1747399 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content48% 
IMG OID637786331 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_391860 
Protein GI78485935 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0845017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA CAATCAAGCA CACCCCCATG ATGCAGCAAT ACCTTGCTGT TAAAGCAGAC 
TATCCAAATC AATTATTGTT TTATCGAATG GGGGATTTCT ACGAGCTCTT TTATGAAGAT
GCGGTCAAAG CGTCTGAATT ATTGGAAATT ACGTTGACGG CTCGCGGGAA ATCTGGCGGC
AATCCTATTC CGATGGCGGG TATTCCGCAT CATTCGGCCG AAGGGTATTT GGCAAAGTTG
GTAAAACTGG GACAATCGGT CGCGATTTGT GAACAAATTG GTGATCCGTC AATATCAAAA
GGCCCTGTAG AACGTAAAGT GGTGCGCGTC ATCACACCGG GAACCTTGGT AGAGGATGCT
TTATTAGAAG ACAAGTCTGA AAATTTACTT GCCGCCATTT TCCAGCAAGC AGACGAATAC
GGACTGGCCA CTCTGGATGT GGCCAGTGGC CGCTTTGAAG CCACATTGCT GCCAGACTCG
ACACAATTAA GTGCCGAGGT AGAACGACTC AAGCCAGCTG AAATCATCTT GCCGGACGAT
CCACTCTTTA AACAGAACCT GCCTGAAAGC ATCCAAAATC GACCAGGCCT GGTCGATTAC
CCAAGCTGGC ACTTTGAAAA AGACAGCTGT CGAAAACGCT TGATAGACCA TTTTGGCACA
CAAGATTTAG TGGCCTTCGG TTGTGACCAA CTCCCGGCCG TCATCAGCGC GGCCGGCGTG
ATTCTGCATT ACGCACAATC CATGCTTCAA AATACGCTGG CTCATGTATT CAGCCTGCAA
ACCTATCAAG CCGATGATGC GCTGGCATTG GATGCCATGA GTCGACGTAA TCTGGAACTC
GACACTAACC TGACAGGCGG TAAAAATCAC ACCTTGTTTG CCATTTTAGA CAATGCCACA
ACGGCGATGG GTAGCCGCTT GATGAACCGC TGGCTCAACC AGCCCTTGCG AAACCGAGAC
ATTATCAATG ATCGCTTCAA TGCGATTGAA GACATCATTG AACAACACAG CCAAGAAGAA
TTTCGCAGCG CGTTAAAACC CATTGGCGAC TTGGAACGCA TTTTAAGCCG CGTGTCTTTA
TATTCTGCAC GGCCTCGCGA CATCTTACAT TTAGGGCGGT CGCTAAACCA GCTCCCGGAA
CTTCAAGCCT TATTGAAGCA ACAAACGGCC AATAAATGGC AACAGTTGTC CAAACAACTT
GGTCTTTATC CGGAACTGGC AAGTCAACTA GAAACCGCGT TGGTTGAATC GCCCCCCATG
TTGATGCGAG ATGGCGGCGT TTTTGCCGAA GGTTATGACA GCGAACTGGA TGAACTCCGT
AACCTTAAAA ACCAAGCCGG CGATTATTTA TTGGCGCTGG AAGCACGTGA AAAAGAACGC
ACAGGCATCA CCACCTTAAA GGTGGGCTAT AACCGAGTGC ATGGCTATTA CATTGAAGTC
AGTAAACTGC AATCGGATAA TGTGCCGGCA GATTATGTCC GCCGCCAAAC TTTAAAAGCA
CAAGAACGTT ATATCACCCC CGAATTGAAA GAATTTGAAG ACAAAGTTCT CAGTGCGAAT
GAAAAAGCAC TGGCTCGCGA AAAGTGGTTA TATCAACAAT TATTGGAACG CTTAAACCAA
GATTTGCAAG CGCTACAACG AACCGCAGCG GCACTGGCCG AAACGGATGT TTTGGTGTCT
TTAGCCCGTC AGGCCATCAA CCTAAATTTA ACACGACCGA CCTTAAGTTC GGAACCTGGC
ATTGACATCA AACAGGGGCG TCACTTAACC GTGGAAGCGC TATCGAACCA ACCGTTTATT
CCGAACGATA CCTGTTTTGA TGAACAGCGC CGATTACAAA TTATCACCGG GCCCAACATG
GGCGGTAAAT CAACCTTCAT GCGCCAAACC GCTTTGATTG CGATTATGGC GTACATGGGA
AGCTTTGTGC CTGCTGAATC GGCCACTCTA GGACCGATTG ATCGTATTTT CACTCGCATC
GGGGCTTCGG ACGATCTGAC CTCCGGTCGC TCCACTTTTA TGGTGGAAAT GACGGAAACA
GCCAACATTC TTCATCATGC CTCACCGGAA TCTTTAATTC TGATGGACGA AGTCGGACGA
GGTACCTCAA CCTTTGACGG ACTGGCACTT GCCTGGGCCA TTGCCGAACA AATGGCGCAA
AGCATCCAAG GCTATTGCCT GTTTGCCACC CACTACTTTG AGCTCACCAC ACTGGTAGAG
CAGTTCAATA ATACGGTCAA CATTCATCTC AGCGCCATAG AACACCAGGA TAAAATTGTC
TTCATGCATC AGGTCGAAGA AGGTCCAGCC TCTCAAAGTT ACGGACTACA AGTAGCGGCT
TTAGCCGGTG TACCAACTGC CGTCATAGAC AAGGCCAAAA AACACCTACA CCGTTTGGAA
AATCAAACCG CAGCACAACA GCAAACCTCT GGCACAGCCT CTTCAGCAAA AGAATCTGTG
CAACAATTTG ATTTATTCGC TCAACCCGCT TTACCGGAAG CCATAGAAAC CATGCTGACC
GACCTGAAAG CCTTATCGGT TGATGATTTA ACACCGAGAC AAGCACTTGA AAAATTGTAC
GAAGTCACTA ATACAGTTAA AAATGCATCC GAATAA
 
Protein sequence
MADTIKHTPM MQQYLAVKAD YPNQLLFYRM GDFYELFYED AVKASELLEI TLTARGKSGG 
NPIPMAGIPH HSAEGYLAKL VKLGQSVAIC EQIGDPSISK GPVERKVVRV ITPGTLVEDA
LLEDKSENLL AAIFQQADEY GLATLDVASG RFEATLLPDS TQLSAEVERL KPAEIILPDD
PLFKQNLPES IQNRPGLVDY PSWHFEKDSC RKRLIDHFGT QDLVAFGCDQ LPAVISAAGV
ILHYAQSMLQ NTLAHVFSLQ TYQADDALAL DAMSRRNLEL DTNLTGGKNH TLFAILDNAT
TAMGSRLMNR WLNQPLRNRD IINDRFNAIE DIIEQHSQEE FRSALKPIGD LERILSRVSL
YSARPRDILH LGRSLNQLPE LQALLKQQTA NKWQQLSKQL GLYPELASQL ETALVESPPM
LMRDGGVFAE GYDSELDELR NLKNQAGDYL LALEAREKER TGITTLKVGY NRVHGYYIEV
SKLQSDNVPA DYVRRQTLKA QERYITPELK EFEDKVLSAN EKALAREKWL YQQLLERLNQ
DLQALQRTAA ALAETDVLVS LARQAINLNL TRPTLSSEPG IDIKQGRHLT VEALSNQPFI
PNDTCFDEQR RLQIITGPNM GGKSTFMRQT ALIAIMAYMG SFVPAESATL GPIDRIFTRI
GASDDLTSGR STFMVEMTET ANILHHASPE SLILMDEVGR GTSTFDGLAL AWAIAEQMAQ
SIQGYCLFAT HYFELTTLVE QFNNTVNIHL SAIEHQDKIV FMHQVEEGPA SQSYGLQVAA
LAGVPTAVID KAKKHLHRLE NQTAAQQQTS GTASSAKESV QQFDLFAQPA LPEAIETMLT
DLKALSVDDL TPRQALEKLY EVTNTVKNAS E