Gene Bpro_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3209 
Symbol 
ID4014133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3394010 
End bp3395992 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content68% 
IMG OID637942876 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_550021 
Protein GI91789069 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.230102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAA CCATGAGCGC CCTGCCGTCA ACCCTTTCGC CGCTCCAGAC CTCCCCGCCC 
CCGCGCAAAC CCATCCGCGA GTTGCCGGAC GAGCTGATCA GCCAGATCGC CGCCGGCGAG
GTGGTCGAAC GGCCGGCTTC GGTGGTGCGC GAACTGGTGG ATAACGCGCT GGACGCAGGG
GCCACGCAGG TGACGGTGCG ACTGCTGGCC GGCGGCGTGC GGCTGATCCT CGTGGAGGAC
GACGGCCAGG GCATCCCGCG CGAAGAATTG CCGGTGGCCC TGCGGCGCCA CGCCACCAGC
AAGATCGCCT CGCTGCAGGA CCTCGAAGCC GTGGGCACCA TGGGCTTTCG GGGTGAGGCG
CTGGCCGCCA TCAACTCGAT TGCCGACATG AGCCTGCTGT CAAGAACACT TGACGGAGCC
AGCGGGAACG CCGGCGAAGC CGCCCATGCC TGGCAACTCG ATGGCCGCAC CGGCGAGTTG
AAGCCGGCCG CGCGCTCCCG CGGCACCAGC GTGGAAGTGC GCGAACTCTT TTATGCCACC
CCGGCGCGCC GCAAGTTTTT GAAAACCGAC GCCACCGAAC TGGCCCATTG CATTGAAGCC
GTGCGCCGCC ATGCGCTGGT GCGGCCAGAC GTCGGCTTTG CCATCTGGCA CGAGGGCAAG
CTGGTGGAGC AATGGCGCGC CTGCCCTGGT GAACCGGCTG CGGCCCACAC GCAGCGTCTG
GCCGATGTGC TGGGCAGCGA CTTTGTCGAG CAATCCGTCG CGGTCTATTA CGAAAGCGCG
GCGCGGCGGA CCGACGGGCT GCCCGCAGTG CGCGTGTGGG GCCGCGCCGG CATTCCGGAT
GCTGCGCGCT CGCGCGCCGA CCAGCAGTTT GCCTATGTCA ATGGCCGCTA TGTACGCGAC
AAGGTGCTGA CCCACGCGGC ACGCAGCGCC TATGAAGACG TGCTGCACGG CCATCGCCAG
CCGGTGTATG CGCTGTATGT CGAGATGGAC CCGGCCCGCG TCGACGTGAA CGTGCACCCG
ACCAAAATCG AAGTGCGCTT TCGCGACAGC CGCGAGGTGC ACCAGGCAGT GCGCCACGCC
ACCGAAAACG CACTGGCCAC GCCTCGCTCG GCCGCTGCGG CCAGCCCCGA CGGTGCTGCA
GCCGACACTG CCGCCCCCCT GATTTCCAGC GAATTTTCAG CATCAAATAC CGGCTTTACC
CAGAAAACCT GGGGGCAGCC AACGATCAAC TTCGCAGCAA ATGGTGGCCA CCGGGCGTCG
GACTTCGAGG CGATGTGGCC GGTACCGGTG CAGCCTGGCA GGCCAGCGGC AAGCGACGGC
TTTTCACCCT CGCCAAGCCT CCCCCAAGGC GCCTCCTCCG CCGGCCCGGC AGACAGCCTG
CCGCCCGGCG ACTGGCCGCT GGGCCGCGCC ATCGCCCAAC TGCAGGGCAT TTACGTACTG
GCCGAGAACG CGCAGGGCCT GGTCATCGTG GACATGCACG CGGCCCACGA GCGCATCGTC
TATGAACGCC TGAAAAGCCA GATGGACAGC AGCGAAGGCG CGCACATTGC CAGCCAGCCC
CTGCTGATTC CGGCCACCTT TGCCGCCAGC CCGCAGGAAG TGGCCACCGC CGAGGCTTGC
ATTGAAACGC TGGCCACCCT GGGCCTGGAA ATCACGCCGT TTTCCCCCAG GACCCTGGCG
GTGCGCGCCG TGCCGACCAG CCTGGCACAG GGGGACGCGG TGGAACTGGC GCGTAGCGTG
CTGGCCGAGC TGGCCCAGCA CGACGCCAGC ACCGTGATCC AGCGGGCCCA GAATGAGCTG
CTCTCTACCA TGGCCTGCCA TGGCGCCGTG CGGGCCAACC GCAAGCTCAC GATTGACGAG
ATGAACGCCC TGCTGCGCCA GATGGAAGCC ACCGAGCGCT CCGACCAGTG CAACCACGGG
CGGCCCACCT GGCGGCAGGT GAGCATCCGG GAGCTGGACG CCCTGTTTCT GCGCGGGCGC
TGA
 
Protein sequence
MAATMSALPS TLSPLQTSPP PRKPIRELPD ELISQIAAGE VVERPASVVR ELVDNALDAG 
ATQVTVRLLA GGVRLILVED DGQGIPREEL PVALRRHATS KIASLQDLEA VGTMGFRGEA
LAAINSIADM SLLSRTLDGA SGNAGEAAHA WQLDGRTGEL KPAARSRGTS VEVRELFYAT
PARRKFLKTD ATELAHCIEA VRRHALVRPD VGFAIWHEGK LVEQWRACPG EPAAAHTQRL
ADVLGSDFVE QSVAVYYESA ARRTDGLPAV RVWGRAGIPD AARSRADQQF AYVNGRYVRD
KVLTHAARSA YEDVLHGHRQ PVYALYVEMD PARVDVNVHP TKIEVRFRDS REVHQAVRHA
TENALATPRS AAAASPDGAA ADTAAPLISS EFSASNTGFT QKTWGQPTIN FAANGGHRAS
DFEAMWPVPV QPGRPAASDG FSPSPSLPQG ASSAGPADSL PPGDWPLGRA IAQLQGIYVL
AENAQGLVIV DMHAAHERIV YERLKSQMDS SEGAHIASQP LLIPATFAAS PQEVATAEAC
IETLATLGLE ITPFSPRTLA VRAVPTSLAQ GDAVELARSV LAELAQHDAS TVIQRAQNEL
LSTMACHGAV RANRKLTIDE MNALLRQMEA TERSDQCNHG RPTWRQVSIR ELDALFLRGR