Gene Mext_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3966 
Symbol 
ID5835623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4406632 
End bp4407900 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content72% 
IMG OID641369757 
Productarsenical pump membrane protein 
Protein accessionYP_001641408 
Protein GI163853365 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.317509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGC TGATCCCGAA CCCGAACGCG GCGACCTGGG GTATCGCCGC GCTGGCGACG 
CTCGGCGTGA TCCTGCGCCC GTTCTCCTGG CCGGAGGCGA TCTGGGCGGT GCTCGGCTCG
GTGCTCCTCG TCCTCCTCGG CCTGATCCCC TGGCAGAATG CCCTGGAGGG CGCGGCCAAG
GGCACCGACG TCTATCTCTT CCTCGTGGGG ATGATGCTGC TCTCCGAGAT CGCCCGGAAG
CAGGGCTTGT TCGATTGGCT CGCCGCCCAC GCGGTGCGGG CCGCGAAGGG GTCGCCGACG
CGGCTGTTCT CGCTCGTCTA CGTCGTCGGC ACGGTGGTCA CGGTCTTCCT CTCGAACGAT
GCCTGCGCGG TGGTGCTGAC GCCCGCCGTC TTCGCCGCGA CGCGGGCCGC CGGGGTGAAG
CAGCCCCTGC CCTACCTGTT CGTCTGCGCC TTTATCGCCA ACGCGGCGAG CTTCGTGCTG
CCGATCTCGA ACCCGGCCAA CCTCGTCGTC TTCGCCGAGC ACATGCCGCC GCTCGGCCGA
TGGCTGGCGA CCTTCACCCT GCCCTCCCTC CTCGCCATCG TCGCGACCTA TCTCGTCCTG
CGCCTGACCC AGAACGCGCG GCTGAAGGCC GAGACGGTCG CGACCGACGT CGCGATCCCG
AGGCTCGGGC TCGGCGGCAC GATCGCGGCC GGGGGCATCG TCGCCACCGG CGCGGCCCTG
ATCGGCGCCT CGGCCGCCGG GATCGAACTC GGCCTGCCGA CCTTCATCGC CGGGCTCGCC
ACGACCCTCG TCGTGCTCGC AATCAACCGG GGCGGGCTGG TCGCGGTCGC TCGGGACGTC
TCCTGGGGCG TGCTGCCGCT GGTCGCCGGG CTCTTCGTCC TCGTCGAGTC CCTGGAGAAA
ACCGGCCTGC TCGCGAAACT CGCCGACCTC CTGGGCCGCG CCGCGCAGGG CGATCCCGCC
GCGACGGCCT GGGCCGGCGG CGCGCTCGTC GCCTTCGGAT CGAACCTCGT GAACAACCTG
CCGGCGGGTC TCCTGGCGGG CGCGGCGGTG CAGGCCGCCC ATGTGCCGGA GACGGTGGCG
GGGGCGATCC TGATCGGCGT CGATCTCGGG CCGAACCTCT CGGTCACGGG CTCGCTCGCC
ACGATCCTCT GGCTCACCGC GATCCGCCGC GAGGGCCAAA ACGTCTCCGC CTGGGCTTTC
CTGAAGCTCG GCGCCCTGGT CATGCCCCCG GCGCTGGCGC TGGCCCTCGC GGCTCTGATC
CTCGCCTGA
 
Protein sequence
MGALIPNPNA ATWGIAALAT LGVILRPFSW PEAIWAVLGS VLLVLLGLIP WQNALEGAAK 
GTDVYLFLVG MMLLSEIARK QGLFDWLAAH AVRAAKGSPT RLFSLVYVVG TVVTVFLSND
ACAVVLTPAV FAATRAAGVK QPLPYLFVCA FIANAASFVL PISNPANLVV FAEHMPPLGR
WLATFTLPSL LAIVATYLVL RLTQNARLKA ETVATDVAIP RLGLGGTIAA GGIVATGAAL
IGASAAGIEL GLPTFIAGLA TTLVVLAINR GGLVAVARDV SWGVLPLVAG LFVLVESLEK
TGLLAKLADL LGRAAQGDPA ATAWAGGALV AFGSNLVNNL PAGLLAGAAV QAAHVPETVA
GAILIGVDLG PNLSVTGSLA TILWLTAIRR EGQNVSAWAF LKLGALVMPP ALALALAALI
LA