Gene Bcep18194_A3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3853 
SymbolmutL 
ID3749037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp759905 
End bp761890 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content70% 
IMG OID637762131 
ProductDNA mismatch repair protein 
Protein accessionYP_368096 
Protein GI78065327 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.558268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.998912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATA TCACCGAAAC GGCGGCGGGC GCCGCGCCCG CCCCTGCTCC GCGCCCGCTG 
CGCGCGATCC AGCCCCTGCC CGACCAGCTG ATCAGCCAGA TCGCCGCCGG CGAGGTCGTC
GAACGCCCGG CGTCGGTCGT CAAGGAGCTG CTCGAGAACG CGATGGACGC CGGCGCCAGC
ACGCTGCGCA TCGTGCTGGA AGAAGGCGGC GTCAAGCGCA TCTCGATCAC CGACGACGGC
TGCGGGATTC CGCCGGACGA GCTGGCGCTC GCGCTGATGC GCCACGCGAC CAGCAAGATC
CGCTCGCTCG AGGAACTCGA GGCAGTCGCC ACGCTCGGGT TCCGCGGCGA AGCGCTGGCG
TCGATCGCGT CGGTCGCCGA GATGTCGATC ACGAGCCGGA CGGCCGATGC CGCGCATGCG
ATGAAGATCG ACGCGCAGAC GGGCGTGCTG TCCCCCGCCG CCGGCTCGAC CGGCACGACG
ATCGAGGTCC GCGAGCTGTA CTTCAATACG CCCGCGCGCC GCAAGTTCCT GAAGAGCGAG
CAGACCGAGT TCGGACACTG CCTGGAAATG ATCCGCCGCG CAGCGCTCGC GCGGCCGGAC
GTCGCGATCT CGGTACTGCA CAACGGCAAG GCGGTCGAGC ACTGGAATGC GACCGAGCCC
GCGCAGCGCG TCGCGAAAAT CCTCGGCGAC AGCTTCGCGA CCGCGCACCT GCCGCTCGAC
GAGCAGGCCG GCCCGCTTGC GGTCTACGGC TGCGCAGGCC TGCCGACCGC GAGCCGCGGG
CGCGCCGACC AGCAATACTT CTTCGTCAAC GGCCGCTTCG TGCGCGACAA GCTGCTGACG
CACGCGGTGC GCGCTGCCTA TGAGGACGTG CTGCACGGCG ACCGCTACCC GTCTTACGTG
CTGTTCCTCG ACCTGCCGCC CGAAGCCGTC GACGTGAACG TGCACCCGTC GAAGATCGAG
GTGCGTTTCC GCGATTCGCG CTCGATCCAC CAATATGTTT TCCATGCGGT CCAGCGCGCG
CTGGCGCGCC ACGCGGGCGC GTCGCCGGAG ACCACGGCGG GCGGCCATGC CGCGCAACTC
TCACCGGCGC CGCGCGGGCC CGCTTCGTTT CTGGACACCC CGCTCGGCCA GAGCCAGCAG
GGCAACGCGA TCGGCGGCAG CGGCTTCTCG CCGTCGTCGT CGTCATCGTC GGGCAACACC
TGGATGCGCC AGGCGCGGAT GACCCAGGGC ACGCTGCCCG TCGCGCAGCC GCTCGCGCTC
TATGACGCGC TGTTCGGCCG CAAGGACACG GGCGCGGGCA CGCCGGACGG CACGACGACC
ATCGCCCGCG ATTCGGCCGA CGCGCCGCTT GCGCCGCTGC CGGGCTTCCT GGCTTCGCCG
ATCGCGGCCA CTGCTCACGA CGAGCAGCCG CTCGGCTTCG CGCTCGGCCA GATCCACGGC
ATCTACGTGC TCGCGCAGAA CGCGCACGGC CTCGTGATCG TCGACATGCA CGCAGCCCAC
GAGCGGATCC TGTACGAACA GTTCAAGAAC GCGCTTGCCG ACCGCTCGGT CGCCGTGCAG
GCGCTGCTGC TGCCGATCTC CATGACGGCC ACGCCGGTCG AGATCGGCAC GGTCGAGGAA
GAACGCGACA CGCTCGAATC GCTCGGCTTC GACCTGGCGG TACTGTCGCC GACGACGCTC
GCGATCCGCG CGGTCCCGGC GTTGCTGAAG GATGCCGACC TGCAGTCGCT CGCGCGCGCG
GTGCTCGCCG ATCTTCACGC GTTCGGCGGC TCTCGCGTGC TCACCGAGCG TCAGCACGAA
CTGCTCGGTA CGCTCGCCTG CCATCACGCG GTACGTGCGA ACCGGCGCCT GACGCTCGAC
GAGATGAATG CACTGCTGCG GCAGATGGAG GCGACCGAAC GCGCGGACCA GTGCAATCAC
GGCCGGCCGA CGTGGTATCA GCTCACGCTG AACGATCTCG ACCGCCTCTT CATGCGCGGC
CAATGA
 
Protein sequence
MSDITETAAG AAPAPAPRPL RAIQPLPDQL ISQIAAGEVV ERPASVVKEL LENAMDAGAS 
TLRIVLEEGG VKRISITDDG CGIPPDELAL ALMRHATSKI RSLEELEAVA TLGFRGEALA
SIASVAEMSI TSRTADAAHA MKIDAQTGVL SPAAGSTGTT IEVRELYFNT PARRKFLKSE
QTEFGHCLEM IRRAALARPD VAISVLHNGK AVEHWNATEP AQRVAKILGD SFATAHLPLD
EQAGPLAVYG CAGLPTASRG RADQQYFFVN GRFVRDKLLT HAVRAAYEDV LHGDRYPSYV
LFLDLPPEAV DVNVHPSKIE VRFRDSRSIH QYVFHAVQRA LARHAGASPE TTAGGHAAQL
SPAPRGPASF LDTPLGQSQQ GNAIGGSGFS PSSSSSSGNT WMRQARMTQG TLPVAQPLAL
YDALFGRKDT GAGTPDGTTT IARDSADAPL APLPGFLASP IAATAHDEQP LGFALGQIHG
IYVLAQNAHG LVIVDMHAAH ERILYEQFKN ALADRSVAVQ ALLLPISMTA TPVEIGTVEE
ERDTLESLGF DLAVLSPTTL AIRAVPALLK DADLQSLARA VLADLHAFGG SRVLTERQHE
LLGTLACHHA VRANRRLTLD EMNALLRQME ATERADQCNH GRPTWYQLTL NDLDRLFMRG
Q