Gene SeAg_B2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2121 
Symbol 
ID6796146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2041456 
End bp2043216 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content58% 
IMG OID642776335 
Productarsenical pump-driving ATPase 
Protein accessionYP_002146960 
Protein GI197249294 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000832973 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGC TTCGGCAGGT GCCCCCCTTT TTGTTTTTTA CCGGTAAAGG CGGTGTGGGT 
AAAACATCGC TGGCCTGTGC TACCGCGATT CATCTGACCG CCTCTGGCAA ACGTGTACTG
TTGGTCAGCA CCGATCCGGC TTCCAATGTC GCGCAGGTGT TTGAACAGAC CATCGGCCAT
CAGATAACCT CGGTTGCAGC GGTTAACAGA CTGTCCGCGC TGGAAGTCGA TCCATCTGCT
GCCGCAGCGG CATACCGTGA ACGCATTGTC GGGCCGGTAC GGGGGATCCT GCCAGACGAT
ATCATGGCGG GTATTGAGGA ACAGCTCTCC GGTGCCTGTA CCACCGAGAT TGCCGCCTTT
GACGAATTCA CCGCGCTGCT GACTAATCAG CAACTGCGCG ATGAGTACGA TCATATTGTG
TTCGACACCG CGCCAACCGG GCATACGCTA CGAATGCTGC AACTGCCCGG CGCGTGGAGC
GGTTATCTCG ACAACAGCCA GCACGGTGCG TCCTGTCTCG GCCCTCTGGT CGGGCTGGAA
AAACAGCGCA GCCAGTATCG CGCCGCCGTA GACGCACTGG CGAATGCGGA ACTGACGCGG
ATGGTGCTGG TCGCCAGAGC ACAAACCGCA ACGCTGAAAG AAGTATCGCG TACCTATGAC
GAACTGGCCG CCATCGGTCT GACGCAGCAG TATCTGGTCA TTAACGGCCT GTTGCCTGAG
CAGGAAACCG TACGCGATAA GCTGGCGCAG GCACTGTATC AGCGTGAACA ACAGGCGCTG
CAACATTTAC CTGATAACCT GCGCGCATTG CCCTGCGATC GCCTGCCGTT AAAACCGTTC
AATATGGTAG GGCTGGCGGC ATTACGGGGT CTGTTGGACG ACAGTTCAAC TGGCTCCCCG
GCGGAAGTCG GACATATCTC CCCCGTAGAT CTTCCTTCAT TGTCATCACT GATCGACGGA
TTCGCGTCGC AGGGACATGG CCTGATCATG CTGATGGGCA AAGGCGGCGT GGGGAAAACT
ACGCTGGCAG CGGCGATTGC CGTTGAACTG GCTCGTCGCG GCTATCCTGT TCACCTGTCC
ACATCCGATC CGGCGGCGCA CCTGACTGAC ACGCTGGACG GCTCATTCGA CGGTCTGAGC
GTCAGCCGTA TCGACCCGCA GGCTGAAACC GAGCGTTACC GGCAGCAGGT GATGGCTGAG
CAGGGTAAAA ACCTCGACGA ACAGGGGCGT GCCGTTCTTG AAGAAGATCT GCGTTCTCCC
TGTACGGAAG AAATTGCCGT GTTTCAGGCT TTTTCACGCA TCATTCAGGA GGCAGGTAAG
CAGTTTGTCG TTATGGATAC AGCGCCAACC GGCCATACAT TGCTACTGCT TGACGCCACC
GGCGCTTACC ACCGTGAGAT TGCCCGACTG GCCGGTGAGC ACGGTCAGCC TGTACTGACG
CCCATGATGC GTCTACAGGA CAGCGAACAG ACAAAAGTTC TCATCGCCAC GCTGGCGGAA
ACCACACCGG TACTGGAAGC CGCTCATTTG CAGGACGATC TGCGCCGTGC AGGGATTGAA
CCGTGGGGCT GGGTAATCAA CAACAGCCTG ATCAATACGC CGACTACATC GCCGCTGCTG
CGCCAGCGTG CTGAACGTGA ACGGTCGCAG ATTGATGCCG TTTGTACCCA CCATGCCCGC
CGCTGCGCGC TGGTGCCGTT ACAGGCCGAA GAGCCTGTTG GTGTCGAACG TTTGCTACAA
CTGAGCACAA CGGGAAAATA A
 
Protein sequence
MLMLRQVPPF LFFTGKGGVG KTSLACATAI HLTASGKRVL LVSTDPASNV AQVFEQTIGH 
QITSVAAVNR LSALEVDPSA AAAAYRERIV GPVRGILPDD IMAGIEEQLS GACTTEIAAF
DEFTALLTNQ QLRDEYDHIV FDTAPTGHTL RMLQLPGAWS GYLDNSQHGA SCLGPLVGLE
KQRSQYRAAV DALANAELTR MVLVARAQTA TLKEVSRTYD ELAAIGLTQQ YLVINGLLPE
QETVRDKLAQ ALYQREQQAL QHLPDNLRAL PCDRLPLKPF NMVGLAALRG LLDDSSTGSP
AEVGHISPVD LPSLSSLIDG FASQGHGLIM LMGKGGVGKT TLAAAIAVEL ARRGYPVHLS
TSDPAAHLTD TLDGSFDGLS VSRIDPQAET ERYRQQVMAE QGKNLDEQGR AVLEEDLRSP
CTEEIAVFQA FSRIIQEAGK QFVVMDTAPT GHTLLLLDAT GAYHREIARL AGEHGQPVLT
PMMRLQDSEQ TKVLIATLAE TTPVLEAAHL QDDLRRAGIE PWGWVINNSL INTPTTSPLL
RQRAERERSQ IDAVCTHHAR RCALVPLQAE EPVGVERLLQ LSTTGK