Gene Mext_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3914 
Symbol 
ID5835138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4349068 
End bp4350165 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content73% 
IMG OID641369705 
Productplasmid encoded RepA protein 
Protein accessionYP_001641356 
Protein GI163853313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0766872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGGC TTGAGGCCAA GATCGCCGCG ATCCGTGACC CGGATCTGCA GGCGGAGCTG 
GAGGCGGCGC GGGGCGGGTT CCTGTTCGCG CCGATCGTCG AGCACCTGCT GTTCCGCCAG
CGCGAGCGTG ACGCCGCCCG GGCCCAGGAG GGGGCGCAGG CCGAGGCCCG CGAGGCCATG
GGCCGCGACC GCCGCCGCCG CGACGCCGTG CGCGAGGTGA TCGAGAGCGA GCCCACCGGC
CCGGAAAACC TCCAGCACCT GCACTCGGTG CTGGCGCTCT GCGGCCTGCC CTACCGCGAT
CCCGGTGATG CCCGCGACTT CGTGCGCGAA TACGGCCGCA ACTCGCTCAG CCTCTCGGCG
GGGCGCCTCA AGAACCCGAT CACCGGCGAG ATGGAGCTGC AGGGCCTGCC CTACGGCCCC
AAGGCCCGGC TCGTGCTGCT GCACCTCTGC ACCGAGGCGG TGCGCCAGCG CAGCCCGACC
ATCGAGGTCG CCGACAGCCT CTCGGGCTTC ATGAAGGCGA TGGGGTTCGC CGTCACCGGC
GGCGAGCGCG GCACCATCGG CGCCTTCAAG GAACAGCTCA ACCGGCTCGC CGCCTGCTCG
ATGCAGCTCG GCCTGTGGGA CGGGGAGGGG CAGGCCTCGA CCCTCAACGT GCCGCCCTTC
CGCCAGCTCG AATTGTGGCG GCGCGGCGAT GACGGCCTCG TCTGGCAGCG CACCGTCTCG
TTCCATCAGG ATTTCTACGA CAGCCTGATC CGGCACGCCC TGCCGGTCGA TATCCGCGCC
GCGCGGGCCT TCTCCGGCTC GGCGCGCAAG CTCGACCTCC TGTTCTGGAC CGGCTACCGC
CTGCGCGCCC TGCAGCGCCC CCTGCGGCTG ACCTGGGACA ACCTGCACCG CCAGTTCGGC
GCCGAGAACG CCAGCCTGCG CAGCTTCCGC CAGGCCTTCA AGGCGGATCT CGCCGGCCTG
CTCGAAGTGT TTCCGCGGCT GCGGATCGAC CTCGACGAGG GCGGCATGCT GCTCCACCCG
GCCGATCCCG GCAGCCTGCT GGTGCCGCCC AAGGCCGCCC GCACCGCGCG CGCGGCGGCC
TCTGCGGCCC GCGCCTGA
 
Protein sequence
MSGLEAKIAA IRDPDLQAEL EAARGGFLFA PIVEHLLFRQ RERDAARAQE GAQAEAREAM 
GRDRRRRDAV REVIESEPTG PENLQHLHSV LALCGLPYRD PGDARDFVRE YGRNSLSLSA
GRLKNPITGE MELQGLPYGP KARLVLLHLC TEAVRQRSPT IEVADSLSGF MKAMGFAVTG
GERGTIGAFK EQLNRLAACS MQLGLWDGEG QASTLNVPPF RQLELWRRGD DGLVWQRTVS
FHQDFYDSLI RHALPVDIRA ARAFSGSARK LDLLFWTGYR LRALQRPLRL TWDNLHRQFG
AENASLRSFR QAFKADLAGL LEVFPRLRID LDEGGMLLHP ADPGSLLVPP KAARTARAAA
SAARA