Gene Mpe_A1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1960 
SymboluvrC 
ID4784746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2097535 
End bp2099556 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content69% 
IMG OID640090530 
Productexcinuclease ABC subunit C 
Protein accessionYP_001021153 
Protein GI124267149 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0403452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CTTCCCCCGA CGACACGGTA CCTCCGCCCG ACGCCGCCGC AACAGCGGCA 
GCCAGCGACG AGGCGCGGGA CCGCCTGGCG ATCGAGGTGG CGGCCTTGCC CAGCCTGCCG
GGCGTCTACC GCTACTTCGA CGCGCAGGGG ACGCTGCTCT ATGTGGGCAA GGCCGGCAAC
CTGAAGAAGC GGGTGTCGAG CTACTTTCAG AAGAACCATG GCGGCACGCG CACCGGCCAC
ATGGTCGGCA GGATCGCTCG GCTGGAGACC ACGGTGGTGC GCTCCGAGGC CGAGGCGCTG
CTGCTCGAGA ACAACCTGAT CAAGACGCTG AACCCGAAGT TCAACATCCT GTTCCGCGAC
GACAAGAGCT ATCCCTACCT CAAGATCACC GGCGCCGCGG TCGCCGGCGG GCCGTCGCCT
GCCGCTGGCA CGGCGGCCGC CGTCGACTAT CCGCGCATGG CCTACTACCG CGGTGGGGTC
GACAAGCGGC ATCGCTACTT CGGCCCCTAC CCGAGCGCCT GGGCTGTGAA GGAATCGATC
CAGTTGCTGC AGAAGGTGTT CCGCCTGCGC ACCTGCGAGG ACACGGTGTT CAACCACCGC
ACGCGGCCCT GCCTGCTGTA CCAGATCAAG CGCTGCTCGG GGCCCTGCGT GCAGGCCATC
GGGGCCGCGG ACTACGCACG CGATGTCCGC AACGCCGAGC GCTTTCTGCT CGGTGAGCAG
CAGGAGGTGA TGGGTACGCT GCAGGCGCAG ATGATGGCGT ACGCCGATGC GCTGGAGTTC
GAGAAGGCCG CCGAGATCCG GAATCAGCTC GGGGCCCTGT CGCGGGTGCT GCACCAGCAG
TCGGTCGAGA CCAACGCCGA CGGCGCCTTC GACAAGGACG CCGACATCCT GGCCGCGGTG
GTGCAGGGGG GGCGCGCCTG CGTCAATCTG GCCATGGTGC GGGGCGGCCG GCACCTGGGC
GACCGGCCCT ATTTCCCGAC GCATGTCGAG GAGGCGAGTG CACTCGAGGG GATGGAGGAC
GCCGCCGGGC CGGTCGCCGC GGTGGCCCCG CCCCAGGCCC GCGTCGAGGT GCGGGTACTG
GAGGCCTTCA TCGCCCAGCA CTACCTCGAC GGTGCGCCAC CGCCACTGTT GATCACCAGC
CATGCGATCG ACAAGGCGCT CGTCGAGGCG CTGACCGACG CCAGCGGCGT GCGCGTTACG
GCCCAGCACC AGCCGCGCGA CCAGCGCCGG CAGTGGCTGG AGATGGCGGA GAAGAACGCA
CGCATCAAGC TCACGCAGTT GCTGGGCGAG GAAGGCTCGC AGCAGGCCCG CACGCGGGCC
CTGGTCGATG CGCTCGACCT GGCACCCGAC CAGCTGGAGC GCTTCCGCAT CGAATGCTTC
GACATAAGTC ACACGGCCGG CGAGGCGACC CAGGCCTCGT GCGTGGTATT CGAGGAGCAC
AAGATGCAGC CGGCCCAGTA CCGGCGTTAC AACATCACCG GCATCACCCC CGGCGACGAC
TACGCGGCCA TGCGCCAGGT GCTGACGCGC CGCTACGCCA AGCTCGCCGA GGCATCCGCC
CAGGGAACGG CGCGGCTGCC CGACCTCGTG CTGGTGGACG GCGGCAAGGG CCAGGTGGCG
ATGGCGCGCG ACGTGTTCGA GGAGCTCGGG CTCGACCTGT CCCTGATCGT CGGGGTCGAG
AAGGGGGAAG GCCGCAAGGT CGGCCTGGAA GAACTGGTAT TCGCCGATGG CCGCGCCAAG
GTCTACCTTG GCAAGGACTC GGCCGCGCTG ATGCTGGTGG CACAGATCCG TGACGAGGCC
CACCGCTTCG CGATCACCGG CATGCGCGCC CGGCGTGCCA GTGTGCGCAC CGGCGGCAGC
CGGCTGGAGG ACATCGCCGG CATCGGGCCC AAGAAACGTG CGAAGCTGCT GCAGCGCTTC
GGCGGCGCAC GCGGCGTGGC CAGTGCGAGC GTGGACGACC TGTCGAGCGT CGATGGCATT
TCGAGAGAAC TGGCGGAGGA GATCTACCGT GTCCTGCACT GA
 
Protein sequence
MTAASPDDTV PPPDAAATAA ASDEARDRLA IEVAALPSLP GVYRYFDAQG TLLYVGKAGN 
LKKRVSSYFQ KNHGGTRTGH MVGRIARLET TVVRSEAEAL LLENNLIKTL NPKFNILFRD
DKSYPYLKIT GAAVAGGPSP AAGTAAAVDY PRMAYYRGGV DKRHRYFGPY PSAWAVKESI
QLLQKVFRLR TCEDTVFNHR TRPCLLYQIK RCSGPCVQAI GAADYARDVR NAERFLLGEQ
QEVMGTLQAQ MMAYADALEF EKAAEIRNQL GALSRVLHQQ SVETNADGAF DKDADILAAV
VQGGRACVNL AMVRGGRHLG DRPYFPTHVE EASALEGMED AAGPVAAVAP PQARVEVRVL
EAFIAQHYLD GAPPPLLITS HAIDKALVEA LTDASGVRVT AQHQPRDQRR QWLEMAEKNA
RIKLTQLLGE EGSQQARTRA LVDALDLAPD QLERFRIECF DISHTAGEAT QASCVVFEEH
KMQPAQYRRY NITGITPGDD YAAMRQVLTR RYAKLAEASA QGTARLPDLV LVDGGKGQVA
MARDVFEELG LDLSLIVGVE KGEGRKVGLE ELVFADGRAK VYLGKDSAAL MLVAQIRDEA
HRFAITGMRA RRASVRTGGS RLEDIAGIGP KKRAKLLQRF GGARGVASAS VDDLSSVDGI
SRELAEEIYR VLH