Gene Mkms_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3341 
Symbol 
ID4611267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3503553 
End bp3504818 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID639793014 
Productarsenite-transporting ATPase 
Protein accessionYP_939325 
Protein GI119869373 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0464361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.354276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCCA GTGGCGCCGC GGCCCGCATC AGCCTGTTCG TCGGCAAGGG CGGGGTAGGT 
AAGTCCACGC TGGCGACGGC CACCGCCGTC CGCGAGGCGC GGGCGGGTCG TCGTGTGCTC
ATCGTGTCCA CCGACCAGGC GCACTCCACC GGTGACGTGC TGGGGGAGAC GGTCACCCCG
ACCGGGCGGC GGGAACCGAC CCGGATCCTC GCCGACCTCG ACGCGGGCAC CCCGGACGCC
GGTGGCACAC TCGACGCGCT GGCCCTCGAC ACGCTCGCGC TGCTCACCGA GCGGTGGCGG
GAGATCGCCG GGCCGGTCAC CGCCAGGTTC CCCGACTCGG ACCTGGGTGA TGTTGCGCCA
GAAGAACTCT CGGCCCTGCC CGGGGTGCAG GAGGTGCTCG GACTGCACGA GGTCGCCGAG
CTGGCGGCGA GCGGTCTGTG GGAGCACGTC GTCGTCGACT GCGCCTCCAC CGCGGATGCG
CTGCGCATGC TGACGCTGCC CGGCACGCTC GCGCTCTACC TGGAGCGGGC GTGGCCCAGG
CACCGCCGGC TGTCGCGCAG CGCCGACGAT GCCGCGTCGG CCGCGATGGT GGACCTCGTC
GAACGCATCG ACGCGGCGAC CGGGCGGTTG ACCGCCCTGC TCGCCGACGC GTCACAGGTC
AGTGCGCATC TGGTGCTCAC CGCCGAACGG GTGGTGGCCG CCGAGGCGTC GCGCACGCTG
GGCTCGCTTT CGCTGATGGG TGTGCGGGTG GCCGAGCTGA TCGTCAATCA AGTTCTGCTG
CAAGATGATT CGTTTGAGTA TCGGAACCTG CCCGAACATC CGGCGTTCGA CTGGTACGCC
GAACGCATCT CCGAGCAGAA GTCGGTGCTC GACCACCTCG ACACCGCGAT CGGGGACGTG
GCGCTGGTGC TGGTGCCCCA CCTGCCCGGG GAGCCGATCG GCCCCAAGGC GTTGGGCGAA
CTGCTCGACG CCGCGCGCAG GCGTGACGGA TCGGCCCCGC CGGCGCCGGT GCGGCCGATC
GTCGACCGGG AGTCGGGCAC CGGACTCGAT GCGGTGTACC GGTTGCGGTT AGAGTTGCCG
CAGGTCGATC CCGGCGAACT CACGTTGGGC CGGGTCGACG ACGACCTGAT CATCGGCGCA
GGCGGTATGC GGCGCCGCGT CCGACTCGCG TCCGTGCTGC GCAGGTGCAT CGTCACCGAT
GCGGCGCTGC GGGGAAGCGA GCTGACCGTG CGATTTCGAC CGAATCCGGA GGTGTGGCCG
GCGTGA
 
Protein sequence
MNPSGAAARI SLFVGKGGVG KSTLATATAV REARAGRRVL IVSTDQAHST GDVLGETVTP 
TGRREPTRIL ADLDAGTPDA GGTLDALALD TLALLTERWR EIAGPVTARF PDSDLGDVAP
EELSALPGVQ EVLGLHEVAE LAASGLWEHV VVDCASTADA LRMLTLPGTL ALYLERAWPR
HRRLSRSADD AASAAMVDLV ERIDAATGRL TALLADASQV SAHLVLTAER VVAAEASRTL
GSLSLMGVRV AELIVNQVLL QDDSFEYRNL PEHPAFDWYA ERISEQKSVL DHLDTAIGDV
ALVLVPHLPG EPIGPKALGE LLDAARRRDG SAPPAPVRPI VDRESGTGLD AVYRLRLELP
QVDPGELTLG RVDDDLIIGA GGMRRRVRLA SVLRRCIVTD AALRGSELTV RFRPNPEVWP
A