Gene Mext_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3479 
Symbol 
ID5834384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3859151 
End bp3860272 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content71% 
IMG OID641369278 
Productregulatory protein LysR 
Protein accessionYP_001640935 
Protein GI163852892 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00637] ModE molybdate transport repressor domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGC CGTTCCGAAA CAAGCCCCGA CAAGGGAAAC CGGTTGTGGA CGACAACTAT 
GACGTCATCA ATATGAATCA AGATTCCGAA GAGGTGGCCG TCTCCGTCGT CCTCGGCCTG
GGTGGCACGA TTCGGATTGG CGCGAGCGCG CTCACGGTGT CCGACCTGAC GGTGATGTTC
GACGCGATCA CGCGGACCGG CTCGGTGCAG GGCTTCGCCG ACGCCCTCGG CCTGTCCTAT
CGCGCGGCCT GGGCGCGCCT CCAGGCCTAC GAGACGGCGC TCGGCCGGCC CTTGGTGCGC
AAGACACGTG GGCACGGCAC CGCCCTGACG GAGTTCGGGG CCGCCTTGGC AGACGCCTTC
ACCGCCGCCT CGGCAGCCTT GGAGGCCAGC CTCGGCCGTG AGACCCGCGC CGTCGAGCAT
CGCCTGCGTC TTCTCATGAG CGGCGGGGCC GGAGCACTGA CACTGGCCGC GAGCCACGAT
CCGCTGCTGG TCGAGGTCCT GACCGAGGTC ATGAGCGGAG AACCGGGTGC AGAGGCGGGC
ATCGAACTCT CCGTGACGGG CAGCAGCGCG GCGGTGCAGC GCCTCCTCGA TGGCGGAGCC
GATGCGGCGG GCTTCCATTG CGGGGCTCTC GCGCCGGAAG CGGCGGGCGC TCCGTTCTCG
GCGATCAATG CCGGCGCCGG CCTCGTGCTG CACCCCCTGT TCGAGCGCGA GCAAGGCCTG
CTGCTGGCCC CCGGCAACCC GCGCGGCATC CGCACGCTGG CCGACCTGGC CGCGCCCGGC
CTGCGCTACG TCAACCGCCA GAAGGGATCG GGCACGCGGG ACTGGTTCGA CCGCATGTTG
GCGCAGGCCG GCCTGCCGGC TGGCGCGATC CAGGGCTACA CGGTCGAGGA GTTCACCCAT
CAGGCGGTCG CGGCGGTCAT CGCCTGCGGC GCGGCGGATG CGGGACTCGG TGTACGGGCG
GCGGCCGACC GGCTCGGCCT CGATTTCCTC TCGGTCGGCT GGGAGACCTA TTATCTCGCC
GCCAGCCGCT CCCTCGCCAG CCCAGCACTC GACGCCCTCG TCGCCGCAGC GAGACGGCGC
GCGAGCCGCA CCCCCGGCTA CCGCGCCGCC GCGGATCTTT GA
 
Protein sequence
MAAPFRNKPR QGKPVVDDNY DVINMNQDSE EVAVSVVLGL GGTIRIGASA LTVSDLTVMF 
DAITRTGSVQ GFADALGLSY RAAWARLQAY ETALGRPLVR KTRGHGTALT EFGAALADAF
TAASAALEAS LGRETRAVEH RLRLLMSGGA GALTLAASHD PLLVEVLTEV MSGEPGAEAG
IELSVTGSSA AVQRLLDGGA DAAGFHCGAL APEAAGAPFS AINAGAGLVL HPLFEREQGL
LLAPGNPRGI RTLADLAAPG LRYVNRQKGS GTRDWFDRML AQAGLPAGAI QGYTVEEFTH
QAVAAVIACG AADAGLGVRA AADRLGLDFL SVGWETYYLA ASRSLASPAL DALVAAARRR
ASRTPGYRAA ADL