Gene Acid345_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2410 
Symbol 
ID4071408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2850547 
End bp2852451 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content61% 
IMG OID637984426 
Productarsenite-transporting ATPase 
Protein accessionYP_591485 
Protein GI94969437 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAAGTT TCACATTCGT GATCGGAAAA GGCGGCGTCG GCAAGACGAC CGTAGCGGCG 
TCGCTTGCGC TGCACACGGC GAACACACAC CCGCGCGCGA AGACGCTGCT GCTCTCGACC
GATCCTGCAC ACTCGCTCGC CGATGTGCTT GAAACCAAGC TCGGCGATAC TCCAAAAAAA
CTGAAAGCCA AAGGTGCGTT GTACGCCCGG GAACTCGATG CCTCGGCGGC AGTCGAAGAG
TTTCTCGCCG CGCAGCGTGA AGGCATTCTA CGTATCCTTG AGAGCGGATC TTTGTTCACG
CGCGACGAGA TCGCGCCACT GCTCGACAGC GCTCTGCCCG GAATGGCGGA AGTCGCGGCG
CTGCTGGCGA TCCACGACCT GCTCGAATCC GATTACGACG AAGTGATCGT AGATACGGCG
CCCATGGGCC ACACCCTGCG CCTATTCGAA CTCCCAGCGC ACCTGGAGCG CTTCTTGCAC
TTACTGGAAG TCTCCGCCGG TCGCGATGCA GTGCTGGCAG CCCACTTCGG CGGAAGCGTA
AGCGAGAACC AGTACGTCGC GCGCTGGCAG GAAATGGTGC GAAAAGTGGC GCAGTCGCTG
GATCACGAAC ACGCGCGATT GCTCCTGGTG ACGTCGTCGG AAAAGTTTTC GTTGAATGAG
GCCATTCGCG CGCGGGAGCA GCTTCAGCGA GCGCCGGTTC CGATGGAGAT CGCAGAGATC
GTGCTGAACC GAGCTGTAAC TGCGGTTTCC GGCTGCAAGC GATGTACAAC GGCGGCGAAG
AAAACGGTGG CAGCGAGACG GTTTCTCGCG AAGGAATTCA AGCGCGTACC GCTGCGGACA
GGCGAAGACC CAGGCAGCCC AATCGCAGGC GTCGACGCGC TGACGGCATT CGGCAAGCAT
GTATTCGAGG GAAGGGCGCT GCGGCTAAAG CAGTCGAAGC CCGTGCGCGA AAAGGCACTC
GATATCGAAG AAGCGCAGTG GCCCGTACTC AACACCCCGC TGACTCTCAC GCTAGGCAAA
GGCGGTGTCG GCAAGACGAC CATCTCTGCG GCAATGGCCT TTCACGCCCG TGCAAAAAAT
GCGAAGGAAG CAGTGTGCAT CTGCTCAATC GATCCCGCGC CGTCACTCGA TGACGTCTTC
CAAACCGAAG TCACGAACCA ACTAGCTCCG GTGTTAGACG ACGCCAAACT CTTCGCTGCG
GAAATCGATG CAGTTGGGGA GTATCAGCGC TGGGCGGAAG AGATGCGCGC AAGGGTCGAA
GACGCTACTT CGACCGAAGT CCGCGGCGTG CATCTCGATC TCAGCTTCGA GCGCGACCTC
TTCCTGGCAA TTCTCGACGT GGTGCCGCCC GGCGTGGACG AACTCTTCGC GACCTTTCGC
ATCCTCGACC TCGTAGAACG CGGCGGTCGG GTGCAGATTG ACATGGCGCC CACCGGCCAC
GCCTTGGAAG TATTGCGCAC CCCGGCACGG CTATTGGGTT GGGCGCGGGT TTTGTTGAAA
ACCCTCGCGC ACCACCGTAC ACTACCCCTC GCGCGCGATG CGGCCGTGGA GATTGCGACA
GTCTCGCAAC GAGTGCGCGA ACTTTCGACA ACGCTCAGCG ATTCCAAACG CAGTCAGGTG
TGGGTGGTCA TGCTGGCAGA ACCGCTGCCG GACCGGGAGA CGCGTCGCTT GCTGTGCGAT
TTGCAGGAAT TGAAAGCGCC GGTGGCGGGA GTTTTCGTCA ACCGCGTCTT GATGGACGAG
ACCCACTGCC CGCGCTGTAG CCGCGCACAG GCGTGGCAGC GGCAAACGCT GGCGAAGATG
AAAGACGGCG CTTTCCCGGT ATTTGTCGTG CCGGAGATGC CAGAGGAAAT CGCAGGAGCG
CGCGGGCTGC AACGGTTCAC CAAATCTCTA TGGCGACTGC AATAA
 
Protein sequence
MPSFTFVIGK GGVGKTTVAA SLALHTANTH PRAKTLLLST DPAHSLADVL ETKLGDTPKK 
LKAKGALYAR ELDASAAVEE FLAAQREGIL RILESGSLFT RDEIAPLLDS ALPGMAEVAA
LLAIHDLLES DYDEVIVDTA PMGHTLRLFE LPAHLERFLH LLEVSAGRDA VLAAHFGGSV
SENQYVARWQ EMVRKVAQSL DHEHARLLLV TSSEKFSLNE AIRAREQLQR APVPMEIAEI
VLNRAVTAVS GCKRCTTAAK KTVAARRFLA KEFKRVPLRT GEDPGSPIAG VDALTAFGKH
VFEGRALRLK QSKPVREKAL DIEEAQWPVL NTPLTLTLGK GGVGKTTISA AMAFHARAKN
AKEAVCICSI DPAPSLDDVF QTEVTNQLAP VLDDAKLFAA EIDAVGEYQR WAEEMRARVE
DATSTEVRGV HLDLSFERDL FLAILDVVPP GVDELFATFR ILDLVERGGR VQIDMAPTGH
ALEVLRTPAR LLGWARVLLK TLAHHRTLPL ARDAAVEIAT VSQRVRELST TLSDSKRSQV
WVVMLAEPLP DRETRRLLCD LQELKAPVAG VFVNRVLMDE THCPRCSRAQ AWQRQTLAKM
KDGAFPVFVV PEMPEEIAGA RGLQRFTKSL WRLQ