Gene Mext_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3936 
Symbol 
ID5830985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4375245 
End bp4376675 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID641369727 
ProductDNA repair protein RadA 
Protein accessionYP_001641378 
Protein GI163853335 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.075328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TCCAACAGAC CTTCGTCTGC CAGTCCTGCG GTGCGGTCTA CAACCGCTGG 
CGCGGGCGCT GCGAGGCCTG CAACGGCTGG AACACGATCC AGGAGGAGGT CGCCTCCGCC
GGCCCGCAAT CGGGCCCGGC CGCGACCCGG CCCTCGCGGG CGCGGGGCCG CGTCTTCCCC
CTCGAAGGCC TGACCGGTGA GGCCAAGGAG GCGCCGCGCA CGCCCTCGGG CATCAACGAA
CTCGACCGGG TGACCGGCGG CGGCTTCGTG CGCGGCTCGG TGATCCTGCT CGGCGGTGAT
CCCGGCATCG GCAAGTCGAC CCTGCTGATG CAGGCCTCCG CCGCGATGGC CAAGAGCGGC
GAGCGCGTCG CCTACATCTC CGGCGAGGAG GCGGTGGGGC AGGTGCGCCT GCGCGCCGAG
CGCCTCGGGC TGGCCAAACA CCCGGTGGAG CTGGCGGCGC AAACGAATGT CGAGGACATC
GTCGAGACGC TGTCGCAGGG CCACCCGCCG GCGCTGACCA TCATCGACTC GATCCAGACC
ATGTGGACCG AGACGGTGGA ATCGGCGCCG GGCACCGTCA CGCAGGTGCG CTCATCGGCC
CAGGCCCTGA TCCGATTCGC CAAGACCACG GGCACGGCCG TCATCCTCGT CGGCCACGTC
ACCAAGGACG GGCAGATCGC GGGCCCCCGC GTGGTCGAGC ACATGGTCGA TGCGGTCGCC
TCGTTCGAGG GCGACCAGGG CCACCATTTC CGAATCCTGC GCGCCGTGAA GAACCGCTTC
GGGCCGACCG ACGAGATCGG CGTGTTCGAG ATGACCGATG CCGGGCTCGC CGAGGTGCCG
AATCCCTCCG CCCTGTTCCT GGCCGGGCGC GACCACGCCG CACCGGGCAC CGCGGTCTTT
GCCGGGATGG AGGGCACGCG TCCGCTGCTG GTCGAGATCC AGGCGCTCGT CGCCCCCTCC
TCGCTCGGCA TGCCGCGCCG CGCCGTGGTC GGCTGGGACC CCAACCGCCT GTCGATGGTG
CTCGCCGTGC TGGAGGCCCA TGGCGGCATC CGGCTCGGCG GCCACGACGT CTACCTCAAC
GTCGCCGGGG GCCTGCGCAT CACCGAACCC GCCGCGGATC TCGCGGTCGC CGCCGCCCTC
GTCTCCTCGC TCTCGGGCGC GGCGCTTCCC TCCGACTCGG TCTATTTCGG CGAACTCGGC
TTGTCGGGGG CGGTGCGGCC GGTCTCGCAG GCGCCGGCCC GGCTGAAGGA GGCATTGAAG
CTCGGCTTTG GCAAAGCGAT CACCCCGCAA GGGCGCGGCG AGGCCGGAGA CCGGGCGCTG
CCGACGGATG CGCTGCGCCA CATCGCCGAC CTCGTCGCCG GCATCGCCTC GGGCGCACCG
CGGAAGGGCG GGGGCCGCCC GCGGGCGGTG CGGTTCGAGG AGGAGATGTA G
 
Protein sequence
MAKIQQTFVC QSCGAVYNRW RGRCEACNGW NTIQEEVASA GPQSGPAATR PSRARGRVFP 
LEGLTGEAKE APRTPSGINE LDRVTGGGFV RGSVILLGGD PGIGKSTLLM QASAAMAKSG
ERVAYISGEE AVGQVRLRAE RLGLAKHPVE LAAQTNVEDI VETLSQGHPP ALTIIDSIQT
MWTETVESAP GTVTQVRSSA QALIRFAKTT GTAVILVGHV TKDGQIAGPR VVEHMVDAVA
SFEGDQGHHF RILRAVKNRF GPTDEIGVFE MTDAGLAEVP NPSALFLAGR DHAAPGTAVF
AGMEGTRPLL VEIQALVAPS SLGMPRRAVV GWDPNRLSMV LAVLEAHGGI RLGGHDVYLN
VAGGLRITEP AADLAVAAAL VSSLSGAALP SDSVYFGELG LSGAVRPVSQ APARLKEALK
LGFGKAITPQ GRGEAGDRAL PTDALRHIAD LVAGIASGAP RKGGGRPRAV RFEEEM