Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2121 |
Symbol | |
ID | 6796146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 2041456 |
End bp | 2043216 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642776335 |
Product | arsenical pump-driving ATPase |
Protein accession | YP_002146960 |
Protein GI | 197249294 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000832973 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATGC TTCGGCAGGT GCCCCCCTTT TTGTTTTTTA CCGGTAAAGG CGGTGTGGGT AAAACATCGC TGGCCTGTGC TACCGCGATT CATCTGACCG CCTCTGGCAA ACGTGTACTG TTGGTCAGCA CCGATCCGGC TTCCAATGTC GCGCAGGTGT TTGAACAGAC CATCGGCCAT CAGATAACCT CGGTTGCAGC GGTTAACAGA CTGTCCGCGC TGGAAGTCGA TCCATCTGCT GCCGCAGCGG CATACCGTGA ACGCATTGTC GGGCCGGTAC GGGGGATCCT GCCAGACGAT ATCATGGCGG GTATTGAGGA ACAGCTCTCC GGTGCCTGTA CCACCGAGAT TGCCGCCTTT GACGAATTCA CCGCGCTGCT GACTAATCAG CAACTGCGCG ATGAGTACGA TCATATTGTG TTCGACACCG CGCCAACCGG GCATACGCTA CGAATGCTGC AACTGCCCGG CGCGTGGAGC GGTTATCTCG ACAACAGCCA GCACGGTGCG TCCTGTCTCG GCCCTCTGGT CGGGCTGGAA AAACAGCGCA GCCAGTATCG CGCCGCCGTA GACGCACTGG CGAATGCGGA ACTGACGCGG ATGGTGCTGG TCGCCAGAGC ACAAACCGCA ACGCTGAAAG AAGTATCGCG TACCTATGAC GAACTGGCCG CCATCGGTCT GACGCAGCAG TATCTGGTCA TTAACGGCCT GTTGCCTGAG CAGGAAACCG TACGCGATAA GCTGGCGCAG GCACTGTATC AGCGTGAACA ACAGGCGCTG CAACATTTAC CTGATAACCT GCGCGCATTG CCCTGCGATC GCCTGCCGTT AAAACCGTTC AATATGGTAG GGCTGGCGGC ATTACGGGGT CTGTTGGACG ACAGTTCAAC TGGCTCCCCG GCGGAAGTCG GACATATCTC CCCCGTAGAT CTTCCTTCAT TGTCATCACT GATCGACGGA TTCGCGTCGC AGGGACATGG CCTGATCATG CTGATGGGCA AAGGCGGCGT GGGGAAAACT ACGCTGGCAG CGGCGATTGC CGTTGAACTG GCTCGTCGCG GCTATCCTGT TCACCTGTCC ACATCCGATC CGGCGGCGCA CCTGACTGAC ACGCTGGACG GCTCATTCGA CGGTCTGAGC GTCAGCCGTA TCGACCCGCA GGCTGAAACC GAGCGTTACC GGCAGCAGGT GATGGCTGAG CAGGGTAAAA ACCTCGACGA ACAGGGGCGT GCCGTTCTTG AAGAAGATCT GCGTTCTCCC TGTACGGAAG AAATTGCCGT GTTTCAGGCT TTTTCACGCA TCATTCAGGA GGCAGGTAAG CAGTTTGTCG TTATGGATAC AGCGCCAACC GGCCATACAT TGCTACTGCT TGACGCCACC GGCGCTTACC ACCGTGAGAT TGCCCGACTG GCCGGTGAGC ACGGTCAGCC TGTACTGACG CCCATGATGC GTCTACAGGA CAGCGAACAG ACAAAAGTTC TCATCGCCAC GCTGGCGGAA ACCACACCGG TACTGGAAGC CGCTCATTTG CAGGACGATC TGCGCCGTGC AGGGATTGAA CCGTGGGGCT GGGTAATCAA CAACAGCCTG ATCAATACGC CGACTACATC GCCGCTGCTG CGCCAGCGTG CTGAACGTGA ACGGTCGCAG ATTGATGCCG TTTGTACCCA CCATGCCCGC CGCTGCGCGC TGGTGCCGTT ACAGGCCGAA GAGCCTGTTG GTGTCGAACG TTTGCTACAA CTGAGCACAA CGGGAAAATA A
|
Protein sequence | MLMLRQVPPF LFFTGKGGVG KTSLACATAI HLTASGKRVL LVSTDPASNV AQVFEQTIGH QITSVAAVNR LSALEVDPSA AAAAYRERIV GPVRGILPDD IMAGIEEQLS GACTTEIAAF DEFTALLTNQ QLRDEYDHIV FDTAPTGHTL RMLQLPGAWS GYLDNSQHGA SCLGPLVGLE KQRSQYRAAV DALANAELTR MVLVARAQTA TLKEVSRTYD ELAAIGLTQQ YLVINGLLPE QETVRDKLAQ ALYQREQQAL QHLPDNLRAL PCDRLPLKPF NMVGLAALRG LLDDSSTGSP AEVGHISPVD LPSLSSLIDG FASQGHGLIM LMGKGGVGKT TLAAAIAVEL ARRGYPVHLS TSDPAAHLTD TLDGSFDGLS VSRIDPQAET ERYRQQVMAE QGKNLDEQGR AVLEEDLRSP CTEEIAVFQA FSRIIQEAGK QFVVMDTAPT GHTLLLLDAT GAYHREIARL AGEHGQPVLT PMMRLQDSEQ TKVLIATLAE TTPVLEAAHL QDDLRRAGIE PWGWVINNSL INTPTTSPLL RQRAERERSQ IDAVCTHHAR RCALVPLQAE EPVGVERLLQ LSTTGK
|
| |