Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2301 |
Symbol | |
ID | 4569405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2633956 |
End bp | 2635173 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639766863 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_912717 |
Protein GI | 119358073 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0024147 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTT TAACTTTTAC AGGTAAAGGC GGAGTAGGAA AAACCAGTGT GTCGGCTGCA ACAGCTGTTC GTTTATCCCA GTTGGGGTAT CGTACTCTTG TGCTTTCAAC TGATCCCGCG CACAGTTTAT CGGATTCATT CAACCTTCCG CTTGGTGCTG AACCAACCAA AATCAAGGAG AACCTGCACG CAATTGAGGT CAATCCCTAT GTTGATTTGA AGCAGAACTG GCAGTCAGTG CAGAAATACT ATACAAGAAT TTTTATGGCC CAGGGGGTTT CAGGGGTTAT GGCCGATGAG ATGACCATTC TTCCCGGTAT GGAAGAGCTT TTTTCCCTCC TCAGAATCAA ACGATATAAA ACCGCCGGAC TTTATGATGT CCTTGTGCTC GATACTGCGC CGACCGGTGA AACGCTCAGG TTGCTTTCCC TTCCCGATAC CCTTGCATGG GGAATGAAGG CGGTAAAAAA TATCAACAAA TATATTGTCA GACCGCTCAG CAAGCCACTG TCAAAAATGT CTGACAGAAT TGCCTTCTAT ATTCCGCCTG AAGATGCTGT TGAGTCTGTT GATCAGGTGT TCGATGAGCT TGAAGATATT CGGGAGATTC TGACCGATAA TGTCAAATCT ACCGTACGTC TTGTCATGAA TGCCGAGAAG ATGTCGATCA AGGAAACCAT GCGTGCACTT ACTTACCTTA ACCTTTACGG CTTCAAGGTG GATATGGTAC TCGTCAACAG GTTGCTTGAT ACAAAAGAGG ACAGCGGGTA TCTGGAGAAA TGGAAAGGCA TTCAGCAGAA ATACCTTGGC GAGATTGAAG AGGGTTTTTC TCCGCTTCCG GTTAAAAAAC TCAGGATGTA TGAACAGGAA ATCGTCGGCC TTGATGCGCT CGAGCTTTTT GCAAAAGACA TGTATGGCGA TTCCGATCCT TCTGATCTTA TGTACGACGA ACCTCCGATC AAGTTTGTAA GAAACGGGGA TGTGTATGAG GTACAACTGA AGCTTATGTT TGCCAATCCT GTCGATATTG ATGTCTGGGT CACAGGCGAT GAACTCTATG TACAGATCGG TAATCAGAGA AAGATCATCA CGCTTCCCAT AAGTTTGACC GGACTTGAGC CCGGCGATGC CGTTTTTAAG GATAAATGGC TGCATATTCC CTTTGACCTT AACCATCAGG GAAAGCACCA GCGGCAGCGG CAGGGAGAGG CCGATTAA
|
Protein sequence | MRILTFTGKG GVGKTSVSAA TAVRLSQLGY RTLVLSTDPA HSLSDSFNLP LGAEPTKIKE NLHAIEVNPY VDLKQNWQSV QKYYTRIFMA QGVSGVMADE MTILPGMEEL FSLLRIKRYK TAGLYDVLVL DTAPTGETLR LLSLPDTLAW GMKAVKNINK YIVRPLSKPL SKMSDRIAFY IPPEDAVESV DQVFDELEDI REILTDNVKS TVRLVMNAEK MSIKETMRAL TYLNLYGFKV DMVLVNRLLD TKEDSGYLEK WKGIQQKYLG EIEEGFSPLP VKKLRMYEQE IVGLDALELF AKDMYGDSDP SDLMYDEPPI KFVRNGDVYE VQLKLMFANP VDIDVWVTGD ELYVQIGNQR KIITLPISLT GLEPGDAVFK DKWLHIPFDL NHQGKHQRQR QGEAD
|
| |