Gene M446_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1991 
Symbol 
ID6135634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2224411 
End bp2225682 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content74% 
IMG OID641642222 
Productarsenical pump membrane protein 
Protein accessionYP_001768890 
Protein GI170740235 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.722795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.961968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGC TCGGCGTCAC ACCCCACCTC GCCACCTGGG GCATCGCCGC GCTGGCGACG 
CTCGGCGTGA TCCTGCGCCC CCTCGGCTGG CCGGAAGCGG TCTGGGCCGT GCTCGGGGCG
ATCGTGCTCG TCGCCCTCGG CCTGCTCCCC GCCGGCACCG CCTGGGACGG GGTCCTGAAG
GGCACGGACG TCTACCTCTT CCTGATCGGC ATGATGCTGC TCGCCGAGGT CGCCCGGAAG
GAGGGGCTGT TCGACTGGCT CGCCGGCATC GCGGTGCGGC GGGCGCGGGG CTCGGCGACG
CGGCTCTTCA CCCTCGTCTA CGCGGTCGGC ACGGTCGTGA CGGTGTTCCT GTCGAACGAT
GCCTGCGCGG TGGTGCTGAC GCCGGCGGTC GCGTGCGCCG CCAAGGCCGC CCGGGTGCGC
GACCCGCTGC CCTACCTGCT GGTCTGCGCC TTCATCGCCA ACGCGGCGAG CTTCGTGCTG
CCGATCTCGA ACCCGGCCAA CCTCGTCGTC TACGCGGCGC ACATGCCCCC GCTCGCCGAG
TGGCTCGCCC GCTTCACCTT GCCCTCCGCG CTGGCGATCC TGGCGACCTA CGCGGCCTTG
CGCCTCACGC AGGGCCCCAC CCTGCGCGCC CAGGAGGTGG CGACCGACGT GCCGCGCGCC
GACCTGTCCC GCACCGGGCT CGTCGCCGGG CTCGGCATCC TGGCGACGGG GCTCGTGCTG
ATCGCCGCCT CGGCGCGCGG CCTCGCTCTC GGCCCGCCGA CCTGCCTTGC GGGACTCGCC
ACCGCGCTCC TCGTCCTGGC GCTGCGGCGG GAGGGTTTGG CCGAACTGGT CCGGGACGTG
TCCTGGAGCG TGCTGCCACT CGTCGCCGGG CTGTTCGTGC TGGTCGAGGC CCTGGAGAGG
ACCGGGGTGC TGCGCCTCGT CGCCGACACC CTGAGGGTGC AGGCGGGCGC CCATCCGGCC
GGCACCGCCT GGGGGGCGGG CGCGCTCGTC GCTTTGCTCT GCAACCTCCT CAACAATCTG
CCGGCCGGGC TGATCGCCGG CGCGGCGGTG CAGGCGGCGG AGGTCTCCGA CAGGATCGCG
GGCGCGATCC TGATCGGGGT CGATCTCGGC CCCAACCTCT CGGTCACGGG CTCCCTCGCC
ACGATCCTCT GGCTCACCGC GATCCGGCGG GAGGGGCAGC ATGTCGGGGC GTGGCGCTTC
CTGGCGCTGG GAGCGCTGGT GATGCCGCCC GCCCTCCTGC TGGCGCTCGC CGGCCTCCTC
CTCGTGCCCT GA
 
Protein sequence
MGALGVTPHL ATWGIAALAT LGVILRPLGW PEAVWAVLGA IVLVALGLLP AGTAWDGVLK 
GTDVYLFLIG MMLLAEVARK EGLFDWLAGI AVRRARGSAT RLFTLVYAVG TVVTVFLSND
ACAVVLTPAV ACAAKAARVR DPLPYLLVCA FIANAASFVL PISNPANLVV YAAHMPPLAE
WLARFTLPSA LAILATYAAL RLTQGPTLRA QEVATDVPRA DLSRTGLVAG LGILATGLVL
IAASARGLAL GPPTCLAGLA TALLVLALRR EGLAELVRDV SWSVLPLVAG LFVLVEALER
TGVLRLVADT LRVQAGAHPA GTAWGAGALV ALLCNLLNNL PAGLIAGAAV QAAEVSDRIA
GAILIGVDLG PNLSVTGSLA TILWLTAIRR EGQHVGAWRF LALGALVMPP ALLLALAGLL
LVP