Gene Dtpsy_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2139 
Symbol 
ID7383074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2285265 
End bp2287028 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content61% 
IMG OID643655456 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002553592 
Protein GI222111328 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAT TCCTGCAACT TCCTTCCCGC TTTCTGTTTT TCACGGGCAA GGGCGGCGTC 
GGCAAAACCT CGATTGCCTG TGCCACGGCT ATTCAATTGG CCGAAGCCGG AAAGCGCGTC
CTCCTGGTCA GTACCGACCC GGCATCCAAC GTTGGGCAGG TATTTGGTGT TGATATCGGT
AATCGCGTCA CACCGATTCC GGCGGTTCCA CGTCTTTCTG CTCTGGAGAT TGATCCCGAG
GCAGCGGCCA GCGCCTATCG GGAGCGCCTG GTCGGCCCGG TACGCGGCGT GCTTCCTGAT
GACGTGGTGA AGGGCATCGA AGAATCGTTG TCCGGCGCGT GCACCACCGA AATCGCCGCA
TTTGACGAGT TCACCGCACT GCTGACCAAC ACGGCACTTA CGGCTGATTA CGAGCACATC
ATCTTTGATA CTGCGCCCAC CGGCCACACC ATCCGCTTGC TGCAACTGCC GGGCGCGTGG
AGTGGTTTCC TGGAAGCTGG CAAGGGTGAT GCCTCGTGCC TCGGCCCGCT GGCCGGTCTG
GAAAAGCAGC GGAACCAGTA CAAGGCGGCT GTTGAAGCCT TGGCCGATCC GCTGCACACC
CGTCTGGTGC TGGTCGCTCG CGCCCAGCAG GCGACCTTGC GCGAGGTAGC CCGAACCCAC
GAAGAACTGG CAGCCATAGG CCTCAAACAG CAACATCTCG TCATCAACGG CATCCTGCCG
CACGTCGAAG CCGCTACCGA CCCGCTGGCC GCAGCAATCC ACGAACGGGA ACAAACGGCG
CTGAAGAACA TCCCGACTAC GTTGACTGCG CTTCCGCGTG ATCATGTAGA ACTCAAGCCC
TTCAATCTCG TCGGCCTTGA AGCACTGCGG CAGTTGCTGA CCGACCTTCC TCCACAAGCA
CCCGCAGCGG TTGATTCCCC GATCGAACTC GACGAGCCCG GCATGGCCGA CCTGATCGAC
GGCATCGCGG CGGATGGACA CGGGCTGGTC ATGTTGATGG GCAAAGGTGG TGTAGGCAAG
ACGACCCTGG CGGCCGCCAT CGCGGTCGAA CTGGCACATC GTGGCTTGCC GGTGCATCTG
ACGACCTCCG ATCCTGCGGC CCATTTGACC GATACCCTGG AAGCCTCGCT CGATAATCTG
ACCGTGAGCC GGATCGATCC GCACGCCGAG ACCGAGCGCT ATCGCCAGCA CGTGCTGGAA
ACCCAGGGCG CTCAACTCGA TGCCGAAGGC CGCGCGCTGT TGGAAGAGGA TTTGCGTTCG
CCCTGCACGG AAGAGATTGC GGTCTTCCAG GCGTTCTCCC GCATCATTCG CGAGGCCGGG
AAAAAGTTCG TCGTCATGGA CACGGCCCCG ACCGGGCACA CCTTGCTCCT GCTCGACGCG
ACGGGTGCGT ATCACCGCGA AGTGTCACGA CAAATGGGCA AGACCGGCAT GCACTTCACG
ACGCCGATGA TGCAATTGCA GGATCCGAAA CAAACGAAGG TACTCGTCGT CACGCTGGCG
GAGACGACGC CGGTACTGGA GGCCGCCAAC CTGCAAGCTG ATTTGCGCCG TGCCGGGATC
GAGCCCTGGG CCTGGATCAT CAACACCAGC GTGGCGGCAG CTTCGGCCAA GTCGCCGTTA
CTGCGTCAGC GTGCGGCCAA CGAGCTACGC GAAATCAGCG CTGTGGCGAA TCAGCACGCG
GACCGTTACG CGGTTGTCCC GCTGCTGAAG GAAGAACCGA TCGGTACAGA ACGACTGCGT
GCGCTCATCC ATCCTCAAGC ATAA
 
Protein sequence
MMKFLQLPSR FLFFTGKGGV GKTSIACATA IQLAEAGKRV LLVSTDPASN VGQVFGVDIG 
NRVTPIPAVP RLSALEIDPE AAASAYRERL VGPVRGVLPD DVVKGIEESL SGACTTEIAA
FDEFTALLTN TALTADYEHI IFDTAPTGHT IRLLQLPGAW SGFLEAGKGD ASCLGPLAGL
EKQRNQYKAA VEALADPLHT RLVLVARAQQ ATLREVARTH EELAAIGLKQ QHLVINGILP
HVEAATDPLA AAIHEREQTA LKNIPTTLTA LPRDHVELKP FNLVGLEALR QLLTDLPPQA
PAAVDSPIEL DEPGMADLID GIAADGHGLV MLMGKGGVGK TTLAAAIAVE LAHRGLPVHL
TTSDPAAHLT DTLEASLDNL TVSRIDPHAE TERYRQHVLE TQGAQLDAEG RALLEEDLRS
PCTEEIAVFQ AFSRIIREAG KKFVVMDTAP TGHTLLLLDA TGAYHREVSR QMGKTGMHFT
TPMMQLQDPK QTKVLVVTLA ETTPVLEAAN LQADLRRAGI EPWAWIINTS VAAASAKSPL
LRQRAANELR EISAVANQHA DRYAVVPLLK EEPIGTERLR ALIHPQA