Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1248 |
Symbol | |
ID | 4570266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1417025 |
End bp | 1418248 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639765839 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_911705 |
Protein GI | 119357061 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.245815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTT TAACGTTTAC CGGAAAAGGC GGCGTAGGAA AGACAAGTGT TTCTGCAGCT ACAGCTGTTC GATTATCGGA GCTTGGTTAT CGTACGCTTG TCCTTTCAAC AGATCCTGCG CATAGTCTGT CGGATTCGTT CAATCTTCCT CTTGGCGCCG AGCCAACAAA GATCAAGGAA AATCTTCATG CCATCGAGAT TAATCCCTAT GTTGATCTGA AGCAGAATTG GCATGCTGTT CAGAAATTCT ATACAGGAAT ATTCAAGCCC CAGGGCGTAT CGGGTGTTGT CGCCGATGAG ATGACCATTC TTCCGGGAAT GGAAGAGCTG TTTTCCCTTT TGAGGATAAA ACGTTATAAA ACTTCAGGAC TCTACGATGT TCTCGTACTC GATACCGCTC CGACAGGTGA AACCCTTCGC TTGCTCTCTC TGCCGGACAC GCTTGCATGG GGCATGAAAG CCGTTAAAAA TGTTACCAAA TATATCGTTC GGCCACTCAG CAAGCCCCTC TCCCGGATGT CTGACAAGAT CGCGCAATAT ATTCCACCTG AAGAAGCGCT GGATTCTGTC GATCAGGTTT TTGATGAACT TGAAGATATT CGCGAGATTC TGACCGATAA TCAGAAATCG ACTGTCCGTC TGGTGATGAA TGCTGAAAAG ATGTCGATAA AGGAGACGAT GCGAGCACTT ACCTATCTCA ATCTGTATGG TTTCAAAGTC GATATGGTGC TGGTAAACCG GTTGCTTGAC ACTAAGGAAA ACAGCGGATA TCTTGAAAAC TGGAAAACCA TTCAGCAGAA ATATCTTGGA GAGATCGAAC AGAGTTTTTC GCCTCTTCCG GTTAAAAAAC TCAGGATGTA TGAAGAAGAG ATTGTTGGTC TCAAGGCACT TGAGCTTTTT GCCCGGGATA TGTATGGCGA AACTGATCCT GCCGATATGA TGTACGACGA ACCGCCGATC AAGTTCGTTC GCACGGGTGA TATTTATGAG GTACAGTTAA AGCTTATGTT TGCCAATCCC GTTGATATCG ATGTATGGGT TACTGGTGAT GAACTCTATG TACATATCGA AAACCAGCGC AAGATTATCA CGCTTCCGAT CAGTTTAACC GGACTTGAAC CGGGAGATGC CTATTTCAAG AACAAGTGGC TGCACATTCC TTTTGATCTT GACAATCATA AACAACACAA GACAACAAAG CAGTACAATA AAGCTCTTAA TTGA
|
Protein sequence | MRILTFTGKG GVGKTSVSAA TAVRLSELGY RTLVLSTDPA HSLSDSFNLP LGAEPTKIKE NLHAIEINPY VDLKQNWHAV QKFYTGIFKP QGVSGVVADE MTILPGMEEL FSLLRIKRYK TSGLYDVLVL DTAPTGETLR LLSLPDTLAW GMKAVKNVTK YIVRPLSKPL SRMSDKIAQY IPPEEALDSV DQVFDELEDI REILTDNQKS TVRLVMNAEK MSIKETMRAL TYLNLYGFKV DMVLVNRLLD TKENSGYLEN WKTIQQKYLG EIEQSFSPLP VKKLRMYEEE IVGLKALELF ARDMYGETDP ADMMYDEPPI KFVRTGDIYE VQLKLMFANP VDIDVWVTGD ELYVHIENQR KIITLPISLT GLEPGDAYFK NKWLHIPFDL DNHKQHKTTK QYNKALN
|
| |