Gene Mext_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1911 
Symbol 
ID5835735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2128071 
End bp2129366 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID641367711 
Productarsenical pump membrane protein 
Protein accessionYP_001639381 
Protein GI163851338 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID[TIGR00935] arsenical pump membrane protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGC TCGCCATCTT CCTCGTCACC CTGGTGTTCG TCATCTGGCA GCCCAGGGGC 
CTCGGAATCG GGTGGAGCGC GCTCGCCGGC GCTGGCGTCG CGCTGGCCAC GGGCGTGATC
CACCCGGGCG ACATCCCGGT GGTCTGGCAC ATCGTCTGGG ACGCCACCTT CACGTTCGTG
GCGCTCATCA TCATCTCGCT GCTGCTGGAC GAGGCCGGGT TCTTCCATTG GGCCGCCCTG
CACATCGCTC GCTGGGGCGG TGGCCGGGGC CGGCGGCTGT TTCCCCTGGT GATCCTGCTC
GGCGCGGCCA TCGCGGCGGT CTTCGCGAAC GACGGCGCCG CGCTGTTGCT CACCCCCATC
GTGCTGGCGG TCCTGCTGCG GCTCGACTTC AAGCCGGCGG CGGCGCTCGC GTTCATCGTC
GCCTGCGGGT TCGTGGCGGA TTCGACGTCC CTGCCGCTGG TGATCTCGAA CCTCGTCAAC
ATCGTCTCGG CCAACTTCTT CGACGTGACC TTCGGCCGGT ACGCAGCCGT CATGGTGCCC
GTGGACCTCG TGTCCCTGGC GGCGACGTTA TTGGTACTGT GGGCCTACTT CCGGCGTGAC
GTGCCGGCGA CCTATCCCGT GGACGCCCTG GAACGCCCGG CCGAGGCGAT CCGCGACCCG
CTCGTGTTCC GTGCGGCGTT CCCTCTGCTC GGCGTCCTGC TGCTCGCCTA CTTCGTCACC
GCGCCGTTCG GGGTGCCGGT GTCGGTCGTG ACCTGTGCAG GCGCCGCGGT GCTGCTGCTG
CTCGCGAACC GCGGCGGGAC CATCCCGATC CGCAAGGTTC TGACCGGGGC GCCCTGGCAG
ATCGTCCTGT TCAGCCTCGG CATGTACCTC GTGGTCTACG GCCTGCGGAA CGCCGGCCTG
ACCGACGAGC TGGCCAAGGG CTTGGTCTGG CTCGCGGGCC ACGGCCCATG GGTCGCCACG
GTCGGCACCG GCTTCGCGGC GGCCATCCTA TCGTCGGTGA TGAACAACAT GCCGAGCGTG
CTGATCGGCG CGCTCTCGAT CCAGCAGGCC CCGGACCTGT CGCCGCTGAC CCGCGAACTG
ATGGTCTACG CCAACGTCAT CGGCTGCGAC CTCGGGCCGA AGTTCACGCC CATCGGCAGC
CTCGCCACGC TGCTCTGGCT GCACGTCCTC GACAGCAAGG GCCAGAGGAT CACCTGGGGC
CAGTACATGA AGGTCGGCCT CGTCATCACC CCGCCGGTGC TGCTGGTGAC GCTCCTCGCG
CTCGCCGTCT GGCTCCCGGT CCTCGGTCCC CAATGA
 
Protein sequence
MLALAIFLVT LVFVIWQPRG LGIGWSALAG AGVALATGVI HPGDIPVVWH IVWDATFTFV 
ALIIISLLLD EAGFFHWAAL HIARWGGGRG RRLFPLVILL GAAIAAVFAN DGAALLLTPI
VLAVLLRLDF KPAAALAFIV ACGFVADSTS LPLVISNLVN IVSANFFDVT FGRYAAVMVP
VDLVSLAATL LVLWAYFRRD VPATYPVDAL ERPAEAIRDP LVFRAAFPLL GVLLLAYFVT
APFGVPVSVV TCAGAAVLLL LANRGGTIPI RKVLTGAPWQ IVLFSLGMYL VVYGLRNAGL
TDELAKGLVW LAGHGPWVAT VGTGFAAAIL SSVMNNMPSV LIGALSIQQA PDLSPLTREL
MVYANVIGCD LGPKFTPIGS LATLLWLHVL DSKGQRITWG QYMKVGLVIT PPVLLVTLLA
LAVWLPVLGP Q