Gene Mext_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3397 
Symbol 
ID5835370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3767307 
End bp3768725 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content74% 
IMG OID641369196 
Productsporulation domain-containing protein 
Protein accessionYP_001640854 
Protein GI163852811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0891472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGAC ACGCTTCGCG CGCGACGGTC GACTTCGATG CCTTCGAGCG CGAGCTGCGT 
CAGACGTCGC AGGAGGCGAT CCGGGCGAAG GCGCCGCAGG CCGCCCCCAA GGGCGCACCC
AAGGGCGACC CGCTCGCCGA GCTTGCCCGC ATCGTCGGGC AGGACGATCC CTTCCGCGCC
CTGCTGGAAG CGCGGGAGAA GGGTGCCGCC CAGGAGGCGG CTCCGGTGAC GCGCGCATCG
GAGACGGGCC GTCCGGCCCG CGTCGAGCCG ACCTTCGTGG ACGAACCCGC CCACGACCCG
GCCCGGACTC AGGCGCATGC CGACATGCAC GGTCAGTCCC AGAGCCCCGC GGACGCCTTC
GACCAGTATC TCGCCTCCGT CGAGCAGGGC ATGTACGCCG ACGGCACGAC CGATCCGGCG
GCCTTCGCCG AGGCCGACGA GACTTACCGG ACGCGGTCTG CGGACCGTCC GCGCGGCCGC
AACCGCCTCG TCCAGGTCGG CGCCGGCCTC GCCGTGGTCG CCGTCTGCGT CGGCGGCGCC
CTGGCGTGGC GTGGCACCCA TGGCGGCGGC AGCGGCGGCC CGATCACCGT GCTCGCCGAC
AAGACCCCGC TGAAGGTGCA GCCGACTGCG ACCGACGGCG TCGAGATTCC CGACCAGAAC
AAGCAGATCT ACGACCGCAA CGCCAAGGAC GGTCAGATCA AGATCGTCAA CCGCGAAGAG
CAGCCGCTCG ACGTCAACCA AGCCGCCCGC TCCGCGGCCG CCCGCAGCGA CGGCGGCGAG
CCGGGGCAGG GCGGGGCGAC CCCCGGCGGC ACCTTGTCCG ACACGTTCGG CGAGCCGCGC
CGGGTGCGGA CCGTCTCGGT CAAGCCGGAC ACCCCGGTCC ACCAGCCGCC GGCGCCCCCG
GCCGAGACCG CCCAGGCTCC TGCCTCGGCA ATCCCGACCA TGACGATGCC CGACACCGCT
GCGAGCACCG CAACGCCGTC GTCGGAGCCC CGTCGCTCCG CGTCGCGCAC CCTGGCCACG
GCGCCGGCGA CCACGCCCGT CGCCGAGGCA CCGGCGGAGC CGCCGGCCGC GCCCGCCGCG
CGCCCGAAGG CCCCGCAGCG CGTCGCCTCC GTCTCGCCCG AGACCACCGC CAGCACCTCC
GAGCCCGCCC CCACCACCGC TTCGCTCACG GCGCCGGTCA GCGGCTACTC GGTCCAGCTC
GGCGTGCGCG GAAGCGAGAG CGAGGCGCGG GCCGCCTTCC GCGAGATGCA GGGCAAGTAC
AGCCAGCTCT CCGGCAAGCC CGAGCTGATC CGGCAGGCCG AGGTGAACGG CAAGACCCTG
TTCCGCGTCC GCGTCGGGCC GCTCGCCAAG AACGAGGCCT CCAGCCTGTG CAGCGCGCTG
CAGGGCGCGG GCGGCCAGTG CTTCGTCGCC AAGAACTGA
 
Protein sequence
MTGHASRATV DFDAFERELR QTSQEAIRAK APQAAPKGAP KGDPLAELAR IVGQDDPFRA 
LLEAREKGAA QEAAPVTRAS ETGRPARVEP TFVDEPAHDP ARTQAHADMH GQSQSPADAF
DQYLASVEQG MYADGTTDPA AFAEADETYR TRSADRPRGR NRLVQVGAGL AVVAVCVGGA
LAWRGTHGGG SGGPITVLAD KTPLKVQPTA TDGVEIPDQN KQIYDRNAKD GQIKIVNREE
QPLDVNQAAR SAAARSDGGE PGQGGATPGG TLSDTFGEPR RVRTVSVKPD TPVHQPPAPP
AETAQAPASA IPTMTMPDTA ASTATPSSEP RRSASRTLAT APATTPVAEA PAEPPAAPAA
RPKAPQRVAS VSPETTASTS EPAPTTASLT APVSGYSVQL GVRGSESEAR AAFREMQGKY
SQLSGKPELI RQAEVNGKTL FRVRVGPLAK NEASSLCSAL QGAGGQCFVA KN