Gene Mchl_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1049 
Symbol 
ID7118552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1064720 
End bp1065787 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content66% 
IMG OID643523842 
Productarsenical-resistance protein 
Protein accessionYP_002419884 
Protein GI218529068 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGT TCGAACGTTA CCTGACCCTC TGGGTCGCGC TCTGCATCGT GGCCGGCATC 
GCGCTCGGTT ACGTCATGCC CGGCTTCTTC CACGCCGTGG GCGAGGCGGA GGTCGCCAAG
GTGAACCTGC CGGTGGCCGC CCTGATCTGG CTCATGGTCA TCCCCATGCT GCTCAAGATC
GACTTTGCGT CCTTGCGCCA CGTCGGGCGG CACTGGCGCG GGATCGGCGT GACGCTGTTC
ATCAACTGGG CGGTGAAGCC GTTCTCGATG GCCGCGCTCG GCTGGCTGTT CATCGGCTAT
CTTTTTCGGT CCTACCTGCC CGCCGACCAG ATCGACAGCT ACATCGCCGG GCTCATCATC
CTGGCGGCGG CCCCCTGCAC CGCCATGGTG TTCGTCTGGT CGAACCTGAC GCGGGGCGAA
CCGCACTTCA CCCTGAGCCA GGTGGCGTTG AACGACAGCA TCATGGTGGT GGCCTTCGCC
CCCATCGTCG GGTTGCTGTT GGGCCTCTCG GCGATCACGG TGCCCTGGGG AACGCTAGTC
CTGTCGGTGG TGCTCTACAT CGTCATCCCG GTCATCATCG CGCAGGTGGT CCGCGGCAGC
CTTCTCGCCT CGGGCGGCCA AGCCGCCCTC GATCGACTCC TTGCCAAGCT CGGTCCGGTC
TCGCTGGTGG CGTTGCTGGC CACCCTGGTG CTGCTGTTCG GCTTCCAGGG CGAGCAGATC
CTGGCGCAGC CCGCGGTCAT CGGCCTGCTC GCGGTGCCCA TCCTCATCCA GGTCTACTTG
AACTCAGGGT TGGCTTACCT GCTGAACCGC GTCGCCGGCG AGCAGCACTG CGTCGCCGGA
CCCTCGGCCC TGATCGGTGC CTCAAACTTC TTCGAGCTTG CGGTGGCCGC CGCCATCAGC
CTGTTCGGTT TCAACTCGGG CGCGGCGCTC GCCACCGTTG TCGGCGTGCT CATCGAGGTC
CCCGTGATGC TGTCCGTGGT CTTGATCGTG AACCGCAGCC AGGGTTGGTA CGAGCGCGGC
GCGGCAGGCA AGGGAGCGGC CCTGAAGCCC GCCTCCCGTG AGACCTGA
 
Protein sequence
MSLFERYLTL WVALCIVAGI ALGYVMPGFF HAVGEAEVAK VNLPVAALIW LMVIPMLLKI 
DFASLRHVGR HWRGIGVTLF INWAVKPFSM AALGWLFIGY LFRSYLPADQ IDSYIAGLII
LAAAPCTAMV FVWSNLTRGE PHFTLSQVAL NDSIMVVAFA PIVGLLLGLS AITVPWGTLV
LSVVLYIVIP VIIAQVVRGS LLASGGQAAL DRLLAKLGPV SLVALLATLV LLFGFQGEQI
LAQPAVIGLL AVPILIQVYL NSGLAYLLNR VAGEQHCVAG PSALIGASNF FELAVAAAIS
LFGFNSGAAL ATVVGVLIEV PVMLSVVLIV NRSQGWYERG AAGKGAALKP ASRET