Gene SNSL254_A2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2187 
Symbol 
ID6485443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2101175 
End bp2102935 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content58% 
IMG OID642737536 
Productarsenical pump-driving ATPase 
Protein accessionYP_002041278 
Protein GI194443345 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATGC TTCGGCAGGT GCCCCCCTTT TTGTTTTTTA CCGGTAAAGG CGGTGTGGGT 
AAAACATCGC TGGCCTGTGC TACCGCGATT CATCTGACCG CCTCTGGCAA ACGTGTACTG
TTGGTCAGCA CCGATCCGGC TTCCAATGTG GCGCAGGTAT TTGAACAGAC TATCGGTCAT
CAGATAACCC CGATTGCAGC GGTCAACGGA CTGGCTGCGC TGGAAGTCGA TCCGTCCGCT
GCCGCAGCGG CATACCGTGA GCGCATTGTT GGACCGGTGC GTGGAATTCT GCCGGACGAC
ATCGTAGCGG GTATTGAGGA GCAGCTTTCC GGTGCCTGTA CCACCGAAAT AGCCGCCTTT
GACGAATTCA CCGCGCTGCT GACTAATCAG CAACTGCGCG ATGAGTACGA TCATATTGTG
TTCGACACCG CGCCAACCGG GCATACGCTA CGAATGCTGC AACTGCCCGG CGCGTGGAGC
GGTTATCTCG ACAACAGCCA GCACGGTGCG TCCTGTCTCG GACCGCTGGC CGGGCTGGAA
AAACAGCGCA GCCAGTATCG TGCCGCCGTA GACGCACTGG CGAATGCGGA ACTGACGCGA
ATGGTGCTGG TCGCCAGAGC GCAAACCGCA ACATTGAAGG AAGTATCGCG CACTTATGAC
GAACTAGCCG CCATCGGCCT GACGCAGCAG TATCTGGTCA TTAACGGCCT GTTGCCTGAG
CAGGAAACCG CACGCGATAA GCTGGCGCAG GCACTGTATC AGCGTGAACA ACAGGCGCTG
CAACATTTAC CTGATAACCT GCGCGCATTG CCCTGCGATC GCCTGCCGTT AAAACCGTTC
AATATGGTAG GGCTGGCGGC ATTACGGGGT CTGTTGGACG ACAGTTCAAC TGGCTCCCCG
GCGGAAGTCG GAGATATCTC CCCCGTAGAT CTTCCTTCAT TGTCATCACT AATCGACGGG
TTCGCATCTC AGGGACATGG CCTGATCATG TTGATGGGCA AAGGCGGCGT GGGGAAAACC
ACGCTGGCGG CGGCTATTGC CGTTGAACTG GCCCGTCGCG GTTATTCGGT TCACCTGTCC
ACATCCGATC CGGCGGCGCA CCTGACCGAC ACGCTGGACG GCTCATTCGA CGGTCTGACC
GTCAGTCGTA TCGATCCGCA GGCCGAAACC GAGCGTTACC GGCAGCAGGT AATAGCTGAA
CAGGGTAAAA ACCTCGACGA ACAGGGGCGT GCCGTTCTTG AAGAGGATCT GCGTTCTCCC
TGTACGGAAG AAATTGCCGT GTTTCAGGCT TTTTCACGCA TCATTCAGGA AGCGGGTAAG
CAGTTTGTCG TTATGGATAC AGCGCCAACC GGCCATACAT TGCTACTGCT TGACGCCACT
GGCGCTTACC ACCGTGAGAT TGCCCGACTG GCCGGGGAAC ACGGTCAGCC CGTACTGACG
CCCATGATGC GCCTACAGGA CAGCGAGCAG ACGAAAGTTC TCATCGCCAC GCTGGCGGAA
ACCACGCCGG TGCTGGAAGC CGCTCATTTG CAGGACGACC TGCGTCGCGC GGGGATTGAA
CCGTGGGGCT GGGTCATTAA TAACAGCCTG ATCAATACGC CGACTACATC GCCGCTGCTG
CGCCAGCGGG CCGAACGCGA ACGGTCGCAG ATTGATGCCG TCTGTACCCA CCATGCCCGA
CGCTGTGCGC TGGTGCCGCT ACAGGCGGAA GAGCCTGTCG GCGTTGAGCG TTTACTACAA
CTGAGCACAA CGGGAAAATA A
 
Protein sequence
MLMLRQVPPF LFFTGKGGVG KTSLACATAI HLTASGKRVL LVSTDPASNV AQVFEQTIGH 
QITPIAAVNG LAALEVDPSA AAAAYRERIV GPVRGILPDD IVAGIEEQLS GACTTEIAAF
DEFTALLTNQ QLRDEYDHIV FDTAPTGHTL RMLQLPGAWS GYLDNSQHGA SCLGPLAGLE
KQRSQYRAAV DALANAELTR MVLVARAQTA TLKEVSRTYD ELAAIGLTQQ YLVINGLLPE
QETARDKLAQ ALYQREQQAL QHLPDNLRAL PCDRLPLKPF NMVGLAALRG LLDDSSTGSP
AEVGDISPVD LPSLSSLIDG FASQGHGLIM LMGKGGVGKT TLAAAIAVEL ARRGYSVHLS
TSDPAAHLTD TLDGSFDGLT VSRIDPQAET ERYRQQVIAE QGKNLDEQGR AVLEEDLRSP
CTEEIAVFQA FSRIIQEAGK QFVVMDTAPT GHTLLLLDAT GAYHREIARL AGEHGQPVLT
PMMRLQDSEQ TKVLIATLAE TTPVLEAAHL QDDLRRAGIE PWGWVINNSL INTPTTSPLL
RQRAERERSQ IDAVCTHHAR RCALVPLQAE EPVGVERLLQ LSTTGK