Gene Mvan_5095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5095 
Symbol 
ID4644147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5461251 
End bp5463440 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content68% 
IMG OID639808569 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_955872 
Protein GI120406043 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TGGAAGTGTT CGAGCCGGCC CTGTGCTGCG CCACCGGAGT CTGCGGCGAG 
GACGTCGACC AGCAGTTGGT GACGTTCTCC GCGGACCTGG ATTTCGTCTC CGGCCGTGGC
GGCGACATTT CTCGCTACAA CCTGGCCGGC GAGCCTCGCG CGTTCGCCGA GAACGACACC
GTGCGAGCCT TTCTCCACGT CGCCGGATCC GTCGGCCTAC CGCTGGTACT GGTGGACGGA
GTCACCGTGA TGACCGGGCG CTACCCCGAC CGCAGCCAGC TCGCCGCCTG GGCGGGGATC
GACACGCCGG ACGGCAAGAT CCCGCTCGAC ATCGAGGACA CCACCGCCGC TGGAGGCGGT
TGCTGCGGCA GCGACAGTTC CGGCGCCACC CAGCTGCTGT TAGAGAGTGC CCGACCGATG
ACCTTGAGGT TCCTCGACGC TCCACCACGG TTCTTGTTCT TCACCGGCAA AGGCGGGGTG
GGCAAGACCT CGATCGCGTG TGCCGCCGCA ATCCACCTGG CGCGCAACGG CAAACGCGTT
CTGCTCGTCA GCACCGACCC GGCGTCCAAC GTCGGCCAGG TGTTCGGACT GCGCGTCGGC
AACGCCATCA CGACGGTTCC CGCCGTCGAG GGCCTGTCGG CGCTGGAGAT CGATCCCGAA
CAGGCCGCAG AGGCCTACCG GGAACGCATC GTCGCCCCCG TGCGAGGCCT GTTGCCCGAC
AGGGAGGTGC AGTCGATCAC CGAACAACTC TCCGGCTCAT GCACCACCGA GATCGCCTCG
TTCAACGAAT TCACCGAACT GCTCACCGAT TCCGACGCCC TGACGGGCCA GTTCGACCAC
GTGCTGTTCG ACACCGCACC AACCGGGCAC ACCATCCGAC TACTGCAGCT GCCGGGCTCG
TGGACGGACT TCCTCAACGA AGGCAAGGGC GACGCGTCCT GCCTCGGACC GCTGTCGGGT
CTGGACAAGC AGCGAGCCAT CTACGCCGAG GCAGTCGAGG CGTTGGCAGA TCCCCTGCGC
ACCCGCCTGG TCCTCGTCGC CCGCGCCCAG CGTTCCACTC TCGCCGAGAT CACCCGCACA
CACCGGGAAC TCGCTGCGAT CGGCCTGACC CGCCAGTACG TCGTGATCAA CGGGGTGCTT
GCTTCGCCCG CCGGAAACGG CGACCCGCTC GCGGCGGCGA TCCACACGCG CGAGCAACAG
GCCATCGCGG CTCTTCCCGA CGAGCTTCGG GGACTACCGC TCGATCAGGT CGAGCTGAAG
GCCACCAACA TCGTGGGCGT GGACGCGCTG GACTCGCTGT TCACCTCGGA CGCCCGCATC
CCCGAAGGCG ATGACGGTAC GGGTCTCCCC GTGGTGGACG CCCCGCTGTC GGCCCTGATC
GACGAACTCG CCGAGGGCGA CCACGGTTTG ATCATGTGCA TGGGCAAGGG CGGCGTCGGC
AAGACGACCA TCGCCGCCGC GATCGCCGTT GCCCTCGCCG ACCGCGGCCA CCCCGTGCAC
CTGACGACCA CCGATCCCGC CGGGCACCTC TCCGGGACGC TGCACGGAAC ACTGCCGAAT
CTGCATGTGT CGAGCATCGA CCCGGTCGAG GCGACCCGTG CATACCGCGA TCATGTGCTC
GCCACCAAAG GCGCCGCGCT CGACGAGCAG GGCCGTTCGA TGCTCGAAGA GGACCTCCGC
TCGCCCTGCA CCGAGGAAGT TGCTGTGTTC CAGGCATTTT CGCGGGTGAT CAGTGAGTCG
AGAAACAAGT TCGTCGTAGT CGACACCGCA CCGACCGGAC ACACCCTGCT GCTGCTCGAT
GCCACCGGCT CCTACCATCG CGAGGTGGCT CGGCAGCTCG GCAACCGGCA TTTCACCACG
CCCCTGATGC GCTTGCAGGA CCCCGAGCTC ACCAAGGTCA TCGTCGTCAC ACTCGCCGAG
ACGACGCCGG TCCTGGAGGC CGCCGGGCTG CAGAGCGAAC TCGAACGCGC CCAGATCCAG
CCGTGGGCGT GGGTGGTCAA CAACTCGCTC GCCGCCGCGC ACCCGACGTC ACCGCTGCTG
CGTCAGCGCG CGGTCGCCGA GCTACCGCAA ATCGACAAGG TCCGAACGGA TTACGCGGAT
CGGGTCGCGG TCATCCCACT GCTGGCCTGC GAGCCCGTCG GCATCCCCGC GCTGGAAGCG
CTCGCCGGGT CACGGACCGC CGCGGTATAG
 
Protein sequence
MSKVEVFEPA LCCATGVCGE DVDQQLVTFS ADLDFVSGRG GDISRYNLAG EPRAFAENDT 
VRAFLHVAGS VGLPLVLVDG VTVMTGRYPD RSQLAAWAGI DTPDGKIPLD IEDTTAAGGG
CCGSDSSGAT QLLLESARPM TLRFLDAPPR FLFFTGKGGV GKTSIACAAA IHLARNGKRV
LLVSTDPASN VGQVFGLRVG NAITTVPAVE GLSALEIDPE QAAEAYRERI VAPVRGLLPD
REVQSITEQL SGSCTTEIAS FNEFTELLTD SDALTGQFDH VLFDTAPTGH TIRLLQLPGS
WTDFLNEGKG DASCLGPLSG LDKQRAIYAE AVEALADPLR TRLVLVARAQ RSTLAEITRT
HRELAAIGLT RQYVVINGVL ASPAGNGDPL AAAIHTREQQ AIAALPDELR GLPLDQVELK
ATNIVGVDAL DSLFTSDARI PEGDDGTGLP VVDAPLSALI DELAEGDHGL IMCMGKGGVG
KTTIAAAIAV ALADRGHPVH LTTTDPAGHL SGTLHGTLPN LHVSSIDPVE ATRAYRDHVL
ATKGAALDEQ GRSMLEEDLR SPCTEEVAVF QAFSRVISES RNKFVVVDTA PTGHTLLLLD
ATGSYHREVA RQLGNRHFTT PLMRLQDPEL TKVIVVTLAE TTPVLEAAGL QSELERAQIQ
PWAWVVNNSL AAAHPTSPLL RQRAVAELPQ IDKVRTDYAD RVAVIPLLAC EPVGIPALEA
LAGSRTAAV