Gene Mvan_3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3222 
Symbol 
ID4647622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3431296 
End bp3432399 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID639806698 
Productarsenical-resistance protein 
Protein accessionYP_954029 
Protein GI120404200 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA CGTCCACGGC GTCGCCGGCG GTCGTCGCGA AGCTGTCCAC CCTCGACCGC 
TTCCTGCCCC TGTGGATCGG CGCCGCGATG GTCGCCGGTC TGCTTCTGGG CCGCTGGATA
CCGGGCCTGA ACACGGCGCT GGAAAGCGTT GCCATCGACG GTGTTTCATT GCCGATCGCG
CTGGGTCTGC TGATCATGAT GTACCCGGTG CTGGCGAAGG TCCGCTACGA CCGCCTCGAC
ACCGTCACCG GCGACCGCAG GCTGCTGATC TCCTCGCTGG TGCTGAACTG GGTGTTCGGC
CCGGCGCTGA TGTTCGCGCT CGCCTGGCTG ATGCTGCCGG ACCTGCCCGA ATACCGCACC
GGGCTGATCA TCGTCGGCCT GGCGCGCTGC ATCGCGATGG TCATCATCTG GAACGACCTC
GCCTGCGGCG ACCGCGAAGC CGCCGCCGTG CTGGTCGCGC TGAACTCGAT CTTCCAGGTG
TTCATGTTCG CGGTGCTCGG CTGGTTCTAC CTGTCGGTGC TTCCCGGCTG GCTGGGGCTT
GAACAGACCA CCATCGACAC GTCGCCGTGG CAGATCGCGA AATCGGTGCT GATCTTCCTG
GGCATCCCGC TGCTCGCCGG ATACCTGTCG CGCCGCATCG GCGAGAAGAC CAAGGGCAGG
GCGTGGTACG AGTCGAGGTT CCTGCCCGTC ATCGGACCGT GGGCGCTCTA CGGACTGCTG
TTCACCATCG TGATCCTCTT TGCGCTGCAA GGCGATCAGA TCACCAACCG ACCCTGGGAC
GTGGCCCGCA TCGCGTTGCC GCTGCTGGTC TACTTCGCGG TCATGTGGGG CGGCGGCTAC
GCCCTCGGCG CCGCGCTGCG CCTGGGCTAC GAGCGGACCA CCACGCTGGC CTTCACCGCC
GCGGGGAACA ACTTCGAACT CGCCATCGCC GTCGCCATCG CCACCTACGG CGCCACCTCC
GGGCAGGCGC TCGCCGGGGT GGTCGGACCG TTGATCGAGG TCCCGGTGCT CGTCGCATTG
GTCTACGTCT CGCTGGCGCT GCGGCGTAGA TTCCCCGATA GCACCACCGA TTCGACGAGG
ACGAAGGAAG TAGTCGATGA CTGA
 
Protein sequence
MSDTSTASPA VVAKLSTLDR FLPLWIGAAM VAGLLLGRWI PGLNTALESV AIDGVSLPIA 
LGLLIMMYPV LAKVRYDRLD TVTGDRRLLI SSLVLNWVFG PALMFALAWL MLPDLPEYRT
GLIIVGLARC IAMVIIWNDL ACGDREAAAV LVALNSIFQV FMFAVLGWFY LSVLPGWLGL
EQTTIDTSPW QIAKSVLIFL GIPLLAGYLS RRIGEKTKGR AWYESRFLPV IGPWALYGLL
FTIVILFALQ GDQITNRPWD VARIALPLLV YFAVMWGGGY ALGAALRLGY ERTTTLAFTA
AGNNFELAIA VAIATYGATS GQALAGVVGP LIEVPVLVAL VYVSLALRRR FPDSTTDSTR
TKEVVDD