Gene Bind_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1011 
Symbol 
ID6199443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1160705 
End bp1162618 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content61% 
IMG OID641705002 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001832143 
Protein GI182677997 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.266056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGATCC GCCGTCTCGA TCCCGTCCTC ATCGATCGCA TCGCCGCCGG CGAGGTCATC 
GAGCGGCCCG CCGCGGCGGT GAAGGAACTC GTCGAAAATG CGCTCGACGC GCAGGCGAGC
GAGATTGATG TCGTGCTTGA AGGCGGCGGC AAGACGCTCA TCCGCGTCAC CGATAATGGC
TGCGGCATGA GCGCCGAGGA TCTGGAACTC TCCGTCGAGC GCCATGCCAC GTCCAAATTG
CCGGATGGCG ATCTCTTCGC CATTGCGACG CTCGGGTTTC GTGGCGAGGC TTTGCCCTCG
ATCGGTTCCG TTTCGGTTCT GTCGCTGACG AGCCGCATGG AAACGGCCAC GCATGGTGTC
GCGCTCAGCG TCGAACATGG ACGCAAACAG ACGGTCATTC CCTGCGGCCA GCCACGTGGC
ACGCGGATCG AGGTCCGTGA ACTGTTTCGC ACGACGCCGG CAAGGCTCAA ATTTTTGAAA
GGTGATCGCG CCGAGGCGCG CGCTGCCGCC GATGCCGTGC AGCGCCTGGC CATGGCGCAT
CCGACCCGGC GTTTTACCTT CACGAGCACG GATACAGCGG GCTTTGATTA TCTGCCTTGT
GCCGAGGGAC CAGAAGGCCT GCTCGCGCGT ATCGGCGCCG TGCTGGGCAA GGATTTCGAG
GCCAATGCTT TGCCGGTCGA GGCGGAACGC GAAGGGATTA TCCTCGAAGG GTTCATCGGC
CTGCCAACCT GGCATCGTGC CAATGGGCTA GCGCAATATC TTTTCGTCAA TGGCCGGCCG
GTGCGCGACA AATTGCTCAC CGGCGCTGTG CGCGCGGCCT ATATGGATTA TCTGCCAGCC
GGGCGTTATC CCGCCTTGGC TCTGTTTCTG CGGTGTGATC CGCAAGAGGT CGATGTCAAT
GTGCATCCGG CCAAGGCGGA AGTTCGTTTC CGCGATCAGG GTCTTGTGCG CGGTCTCCTT
GTTGGCGCCT TGAAACAGAC ATTGCAGCAA GCCATGCATC GGGCGACGCC TGATGGTGGC
AGAGTGGCGC TCGGCCTTCT TGCGATGCAT TCGGCGGGGC AGGGGCAGCG GCCCGTATCA
TCCGCGTCCA TGCCATCCGC GTCAAGGCAA GCGCCGACCA TGCCGCCCAG GGATTGGATC
AAGGAAGGCG TTCAGGATTG GGATTGGCGG CAATCCCCCG CGCGACCGCA AAATCCGCCC
CAAAATCCGC CCCCTGGCGA TCGTATCGAA ATGTCAGGCT CGCTGCCTTT GGGTTTTGCC
GAAACACCGG CTGGTCTGAA CGGGGATAGC GGGAAAGAGC TCAGCGAGGC GGCTTCTGAG
ACGCCGCATG ATGAGCCACT TGGATTTGCG CGGGCGCAAT TGCATGAGAC CTATATTGTC
GCGCAAACGC GCGATGGTTT CGTGCTGGTC GATCAGCATG CGGCGCATGA ACGTCTTGTC
TATGAACGTT TGAAACAGGC GCGCGCGGCG CAAACGGTCG AGCGCCAGAT CCTTCTCCTG
CCGACCATTG TCGAATTGCC TGAGGCTGAT GTCGAACGGC TGGTCGACGC GGCCTCGATG
CTGGCCGATT TTGGCCTCGT GGTTGAAAGC TTCGGTCCCG GCGCGCTAGC CGTGCGGGAA
ATTCCGATCG TTTTGAAGGA TGGCTCGGTT CCGGCCTTGA TCCATGATCT CGCGAACCAG
TTGCAAGAAG ACGACAAAGC CTTGATCCCG CTCGAACGCA AACTCGATCA TGTGCTGGCG
ACCTTTGCCT GCCATCATTC CGTGCGTGCC GGCCGTCGTC TCGGTATCGA GGAGATGAAT
GCGCTCCTGC GCGAGATGGA ACGCACGCCG GGCTCCGGCC AATGCAATCA CGGGCGCCCG
ACCTATATCG AATTGAAGCT CGGTGACATC GAGAGATTAT TCGGGCGTGG ATGA
 
Protein sequence
MTIRRLDPVL IDRIAAGEVI ERPAAAVKEL VENALDAQAS EIDVVLEGGG KTLIRVTDNG 
CGMSAEDLEL SVERHATSKL PDGDLFAIAT LGFRGEALPS IGSVSVLSLT SRMETATHGV
ALSVEHGRKQ TVIPCGQPRG TRIEVRELFR TTPARLKFLK GDRAEARAAA DAVQRLAMAH
PTRRFTFTST DTAGFDYLPC AEGPEGLLAR IGAVLGKDFE ANALPVEAER EGIILEGFIG
LPTWHRANGL AQYLFVNGRP VRDKLLTGAV RAAYMDYLPA GRYPALALFL RCDPQEVDVN
VHPAKAEVRF RDQGLVRGLL VGALKQTLQQ AMHRATPDGG RVALGLLAMH SAGQGQRPVS
SASMPSASRQ APTMPPRDWI KEGVQDWDWR QSPARPQNPP QNPPPGDRIE MSGSLPLGFA
ETPAGLNGDS GKELSEAASE TPHDEPLGFA RAQLHETYIV AQTRDGFVLV DQHAAHERLV
YERLKQARAA QTVERQILLL PTIVELPEAD VERLVDAASM LADFGLVVES FGPGALAVRE
IPIVLKDGSV PALIHDLANQ LQEDDKALIP LERKLDHVLA TFACHHSVRA GRRLGIEEMN
ALLREMERTP GSGQCNHGRP TYIELKLGDI ERLFGRG