Gene Mext_4439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4439 
Symbol 
ID5834231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4939263 
End bp4942406 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content73% 
IMG OID641370232 
Productdouble-strand break repair protein AddB 
Protein accessionYP_001641878 
Protein GI163853835 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.522195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.331635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA GCGCCGCCCC CCGCGTCTTC ACCATTCCCC CCGGTGCGCC GTTCCTGCCG 
ACGCTGGCGG ACGCGCTCGT GTCGGGCCGC CTCGTCGGCG CGGTCGGCGG CGATCCCTTC
GCGCTCGCCT CGGTGACGCT CTACCTGCCG ACGCAGCGCG CGGTGCGCGC GCTCTCGACC
GTGCTGGCGC AACGCCTCGG CGGAGCGGCT CTGCTGCCGC GGATGATCCC GTTGGGCGAG
GCCGACGAGG CGGAGCTCGA CCTCTCGTCC AACACGCTGC TGGAGACGCC GGAAGACCTG
CTCTACCCCT CCATCCCCGC GCTGGAGCGC CGGCTGATCC TGGCTCGGCT CGTCCAGAAA
TGGGCGGAGA CGGTGGACCG CGAACTCCTG CCCATCGACG ACGAGGTGCC CTTCCTCGTC
CCGTCCTCCC CGGCCGACGC GGTGGCGCTC GCCGCCGACC TCGAAGGGCT GATGGACGCG
CTCACCGTCG AGGGCCTGCC CTGGAGCGAG ATCGCCGCGG CGGTGGAGGC CGAGCATTCG
CGCTATTTCC GCCTCACCCT CGACTTCCTG AAGATCGCCG CCGAGCACTG GCCCGACATC
CTCGCCGCCC GTTCGCTCGC CGACCCCACC GCTCGCGCCC GCCGCCTCGT GCTCGCGGAA
GCCGACCGGC TGTTCCGCGA GCGGCCGGGC GATCCGGTGA TCGTGGCCGG CTCCACCGGC
TCGGTGCCGG CCACCGCCCG GCTGATCGCG GCGGTGGCCC GCCTGCCCCG CGGCGCGGTG
GTGCTGCCGG GGCTCGACCT GCACCTCGAC GCGGAGGGCT GGGACGCGAT CGACGGCGCC
GGCGGCAAGC ACGACGAGAT CGCCCACGGC CATCCGCAGG CGATCCTTCG GGCGCTGACC
GGACCGAGGG GCTTTGCCGT CGAGCGCCGG GATGTCGAGA CGCTCGGAGA TCTCTCGCCG
GAGGCGGTGG CGCGGGAAAA GCTGCTGTCG CAGGCGCTAC GCCCGGCCGA GACCACCGAT
GCCTGGGCCG GCCTCGACGC GGCAGAGCGG ATGACCCTCG CACGGGAGGG CATGGCGGGC
CTCGCCGTGG TCGAGGCGGC GGACGAGCGC GAGGAGGCGC TGGTCGCCGC GCTCGCCCTG
CGCGAGACGC TGGAGACGCC CGGCGCCACC GCGGCCCTCG TCACGCCCGA CCGCGGGCTG
GCGCTACGCG TCTCGGCCGA ACTCGCCCGC TGGGGCATCG CGGCCGAGGA TTCGGCGGGC
TTGAGCTTGG CGCGCTCGCA GGCCGGGCGC TTCGCCCGGC TCGTCGCCGA ACTCGCCGCC
GAGGTGGCGC CGGCCCGCGT CATCGCGCTT CTCGCCCACC CTTTCGTCCG ACTCGGGCTC
ACCCGCAGCG AGGTCGTGCG CGCGGCCGCG GCCCTTGAGA TCGGCGGCCT GCGCGGCCCC
GCCCCGATCC CGAAGAACAG CTTCGACGGA ATGCGGGCGG CGGTGGCGCT GCAACGCAAC
GCCACCGAGC GGGTGCCGCG CGCCAAGAAG CGGCTGAAGC CGGTCGATTG GGATCTCGCC
GAAGACATCC TGGACCGGCT GGAGATCGCG CTCGATGCGT TCCGCACCGA CCTGCAGCCC
GAGATCGGCA ACCTCGTGGC GCTCGCCGCC GGGCACCGCG AGGCCTGCGA GCGGATGATG
GGCGGGGCCG AAGAGGGTGA CGCGGACGAG AGCGACGATC CCTCCCTCGA CACGCTCGAC
GGCCTGTTCG ACGATCTCGA ATCCGCCGAG CAGGAAGAGC TGCCGGGCCG GTTCTCCGAC
TACGCCGCCT TCTTCACCGC GCTTGCCCGC GACCGCACCG TCGCCTGCGC GCAGCGGAGC
GCGCATCCGC GCCTGCGCAT CCTCGGGCCG CTCGAAGCGC GCCTGCTCTC CGTCGATCGC
ATCGTGCTGG GCGGGCTCGA CGAGACGGTC TGGCCGGTGC GCCAGACCAC CGACGCCTTC
CTCAACCGGC CGATGCGCGG AGATGTGGGC CTGAGCCCGC CCGAGCGGCG GATCGGACAG
GCCGCCCACG ACTTCGTGCA GGGCTTGGGC ACGCATGATG CCGTGGTCAC CCGCGCGGCC
AAGCGCGAGG GCTCGCCGAC CGTGCCGTCA CGCTTCCTCC AGCGCCTGCG CGCCTTCGGC
GGCGATGCGG TCTGGGCCGA CGCTATCGCC CGCGGCCAGC GCCTGCGCGG GCTCGCCGCG
CGGCTCGATC GCGGCCAGGC GGCCCCGCCC CCGCGCCTCA AGCGCCCCGC GCCGAAGCCC
GACCCGGCTC TGTTCCCGGA GCGCCTCAGC GTCACCGAGA TCGAGACGCT GGTGCGCGAT
CCCTACTCGA TCTACGCCCG GCACATCCTG GGCTTGGAGG CGCTGGAGCC GATCGCCGTG
GTTCCGGGCG CGGCCGAGCG CGGCAGCCTG ATCCACAAGG TCCTCGGCGA TTTCTCGCAA
GCTCATCCCG GCGCGCTGCC GGCGGACGCG GAAACCCTGC TCTACGCGAT CGCCTTCGAT
GCGTTCGGCC CGCTGCAGGA CCAGTACCCG GAGCTCTACG CCGAGTGGTT TCCCCGCTAC
GAGCGCATGG CGGTCGCCTT CCTCGAATGG GAAGCGAGGC AGCGCCAGGG CTTGCACACG
ATCCACGCCG AGCGCTCGGG CAAGGTAACG ATTCCTCTGG GCGAGCGAAC CTTCACGCTC
TCGGCCCGTG CCGACCGGAT CGAGGCCCGC GCCGACGGCA GCTACTGCAT CGTCGACTTC
AAAACCGGCA CGCCGCCGAG CAACCGCACC GTCTTTGCCG GTTTCTCGCC CCAGCTCACC
CTGGAGGCGG CGATGCTGAT GCATGGCGCC TTCGAGGGGC TGAAGGCCGC CTCCCCGCCG
GACCTCCTCT ACGTCTACGC CTCGGGCGGG CGCGAGCCGT TCCGGCCGAT CCCGGTCAAG
CCGCCGCCCG GCGATGCGCG GGCGGTGGAG GCCGTCGTCG AGGAGCATTG GCAGCGCCTG
CGCGGATTGA TCGCCCGCTA CATGACCGGC GAGGCCGCCT ACCTCTCGCG CCCCTATCCG
CAATACGAGC GCGCCTACAA CGAGTACGAT CACCTCGCCC GCGTGCTCGA ATGGTCGCTG
GCCGGGCAAG GAGAAGCCGC GTGA
 
Protein sequence
MSASAAPRVF TIPPGAPFLP TLADALVSGR LVGAVGGDPF ALASVTLYLP TQRAVRALST 
VLAQRLGGAA LLPRMIPLGE ADEAELDLSS NTLLETPEDL LYPSIPALER RLILARLVQK
WAETVDRELL PIDDEVPFLV PSSPADAVAL AADLEGLMDA LTVEGLPWSE IAAAVEAEHS
RYFRLTLDFL KIAAEHWPDI LAARSLADPT ARARRLVLAE ADRLFRERPG DPVIVAGSTG
SVPATARLIA AVARLPRGAV VLPGLDLHLD AEGWDAIDGA GGKHDEIAHG HPQAILRALT
GPRGFAVERR DVETLGDLSP EAVAREKLLS QALRPAETTD AWAGLDAAER MTLAREGMAG
LAVVEAADER EEALVAALAL RETLETPGAT AALVTPDRGL ALRVSAELAR WGIAAEDSAG
LSLARSQAGR FARLVAELAA EVAPARVIAL LAHPFVRLGL TRSEVVRAAA ALEIGGLRGP
APIPKNSFDG MRAAVALQRN ATERVPRAKK RLKPVDWDLA EDILDRLEIA LDAFRTDLQP
EIGNLVALAA GHREACERMM GGAEEGDADE SDDPSLDTLD GLFDDLESAE QEELPGRFSD
YAAFFTALAR DRTVACAQRS AHPRLRILGP LEARLLSVDR IVLGGLDETV WPVRQTTDAF
LNRPMRGDVG LSPPERRIGQ AAHDFVQGLG THDAVVTRAA KREGSPTVPS RFLQRLRAFG
GDAVWADAIA RGQRLRGLAA RLDRGQAAPP PRLKRPAPKP DPALFPERLS VTEIETLVRD
PYSIYARHIL GLEALEPIAV VPGAAERGSL IHKVLGDFSQ AHPGALPADA ETLLYAIAFD
AFGPLQDQYP ELYAEWFPRY ERMAVAFLEW EARQRQGLHT IHAERSGKVT IPLGERTFTL
SARADRIEAR ADGSYCIVDF KTGTPPSNRT VFAGFSPQLT LEAAMLMHGA FEGLKAASPP
DLLYVYASGG REPFRPIPVK PPPGDARAVE AVVEEHWQRL RGLIARYMTG EAAYLSRPYP
QYERAYNEYD HLARVLEWSL AGQGEAA