Gene M446_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0520 
Symbol 
ID6129261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp618052 
End bp619107 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID641640842 
Productarsenical-resistance protein 
Protein accessionYP_001767517 
Protein GI170738862 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00313463 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGCACCT TCGAACGCTA CCTGACCCTC TGGGTCGCCC TCTGCATCGT CGTCGGCGTC 
GCGCTCGGCC ACGCCGTGCC GGGCGTCTTC CGCGCCGTCG GCGCGGCCGA GATCGCCAAG
GTCAACCTGC CGGTGGCGGG GCTGATCTGG CTCATGGTCG TGCCCATGCT GCTCAAGATC
GACTTCGCCG CGCTGCGCCA CGTCGGGCGG CACTGGCGCG GCATCGGCGT GACGCTCCTG
GTCAACTGGG CCGTGAAGCC GTTCTCGATG GCGGCGCTGG GCTGGCTGTT CATCGGCCAC
CTGTTCCGGC CGCTGCTGCC GGCCGACCAG ATCGACGGCT ACGTCGCCGG GCTGATCATC
CTGGCCGCGG CGCCCTGCAC CGCGATGGTG TTCGTGTGGT CGAACCTCAC GCGGGGCGAG
CCGCACTTCA CGCTGAGCCA GGTGGCGCTC AACGACGGCA TCATGGTGGT GGCCTTCGCG
CCCATCGTCG GGCTGCTGCT CGGGCTCTCG GCCATCACGG TCCCGTGGGG CACGCTGGTC
CTCTCGGTCG TGCTCTACAT CGTCATCCCG GTCGTGGTCG CGCAGGCGGC CCGGCGCAGC
CTGCTCGCCG CGGGCGGCCA GCCCGCCCTC GACCGCCTCC TGGCGCGGCT GGGGCCGGCC
TCGCTGGCGG CGCTGCTGGC GACCCTCGTC CTGCTGTTCG GCTTCCAGGG CGAGCAGATC
CTGGCCCAGC CGGCCGTGAT CGCGCTCCTG GCGGTCCCCA TCCTGATCCA GGTCTACCTG
AACAGCGGGC TCGCCTACCT CCTGAACCGG CTCGCCGGGG AGCAGCACTG CGTCGCCGGC
CCCTCCGCGC TGATCGGGGC CTCCAACTTC TTCGAACTCG CGGTGGCGGC GGCCATCTCG
CTGTTCGGGT TCGAGTCGGG CGCGGCGCTC GCCACGGTGG TGGGCGTGCT GATCGAGGTC
CCGGTCATGC TCAGCGTGGT CCGGATCGTG AACCGCTCCA AGGACTGGTA CGAGCGCGGG
GCGGCGCGGG CGCGGCTCGC GCCCCAGGAG GGGTGA
 
Protein sequence
MGTFERYLTL WVALCIVVGV ALGHAVPGVF RAVGAAEIAK VNLPVAGLIW LMVVPMLLKI 
DFAALRHVGR HWRGIGVTLL VNWAVKPFSM AALGWLFIGH LFRPLLPADQ IDGYVAGLII
LAAAPCTAMV FVWSNLTRGE PHFTLSQVAL NDGIMVVAFA PIVGLLLGLS AITVPWGTLV
LSVVLYIVIP VVVAQAARRS LLAAGGQPAL DRLLARLGPA SLAALLATLV LLFGFQGEQI
LAQPAVIALL AVPILIQVYL NSGLAYLLNR LAGEQHCVAG PSALIGASNF FELAVAAAIS
LFGFESGAAL ATVVGVLIEV PVMLSVVRIV NRSKDWYERG AARARLAPQE G