Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0078 |
Symbol | |
ID | 6355601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 84212 |
End bp | 85402 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642667701 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_001942163 |
Protein GI | 189345634 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.121619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAATA TCGTTTTCAC CGGAAAAGGG GGCGTAGGCA AAACCTCGAT CGCCGCCGCA ACGGCTGTAA AAGCCGCTTC GATGGGCTAC AAGACCCTTG TCATATCCAC CGACCCCGCG CACAGCCTCG GCGACTCCTT CGATATCGAA CTCGGCCCCT CTCCGGTAAA GATCGCCGAA AACCTCTTTG GACAGGAGGT CAGCGTCTAT GGCGATCTCA ATATGAACTG GGAGATCGTG CGGGAGCACT TCGCCCACCT CATGGAAGTA CAGGGCATTC AGGGCATCTA CGTCGAAGAG ATGGGCGTGC TGCCCGGCAT GGAGGAGCTC TTCTCGCTCT CCTACATCAA GCGCTACAAC GAATCGAACG AATACGACCT GCTCGTGGTT GACTGCGCGC CGACCGGAGA AACCCTTCGC CTGCTCTCGC TGCCCGAAAC CTTCGGCTGG ATGCTCAAGC TGATGCGCAA TCTCGAAAAA TATGTGGTCA AACCGCTGAT CCGTCCGCTC TCCAAACGGG TAGGCAAACT GCACGAGCTT GTCCCCGACT CCGACGTTTA CGATCAGGTC GATCACCTCT TTTCCTCCAT CGAGGGGATC ATCGAACTGC TCTCCGATTC GACCAAAACC ACAGTCCGCC TGGTCATGAA CCCCGAAAAA ATGGTGATAA AGGAGTCCAT GCGCGCGCTC ACCTACCTGA ACCTCTACGG CATCACGGTC GATCAGGTTA TCATCAACCG GGTCTTCATG GACGAAGTGG ACGGCCAGTA CATGAAGGAG TGGAAAGAGA TCCAGCACAA GTATATCGAC CAGATCGAAA CATCCTTCGC CCCGGTGCCC ATCACGAAGG TGCCGCTCTT CCGACGCGAG GTGCTCGGTC TGGAGATGCT CAAACAGGTC GGCGAGGTTG TCTATGGCGA TAAGAATCCT CTCGACATTT TCTACCATGA AGAGCATGTC GATATCAAAA AGATTTCGGA CGGCCACTAC GTCATGAAGC TCCGCCTCCC CTTCGTCTTC GACAACAAAA TGGAAGCGAA CGTGGTGCAG ATCGGCGACT CGCTCACCGT GCGCATCGGC AACTACCAGA AAGGGGTGGT GCTGCCGCTC TTTCTGGCCG GCATGCGGGT TGCTGAAGCC GGCTACGAGG AGAAGTGGCT GAAAATCGAT TTCCGGAAAA AAGAGGGCTG A
|
Protein sequence | MRNIVFTGKG GVGKTSIAAA TAVKAASMGY KTLVISTDPA HSLGDSFDIE LGPSPVKIAE NLFGQEVSVY GDLNMNWEIV REHFAHLMEV QGIQGIYVEE MGVLPGMEEL FSLSYIKRYN ESNEYDLLVV DCAPTGETLR LLSLPETFGW MLKLMRNLEK YVVKPLIRPL SKRVGKLHEL VPDSDVYDQV DHLFSSIEGI IELLSDSTKT TVRLVMNPEK MVIKESMRAL TYLNLYGITV DQVIINRVFM DEVDGQYMKE WKEIQHKYID QIETSFAPVP ITKVPLFRRE VLGLEMLKQV GEVVYGDKNP LDIFYHEEHV DIKKISDGHY VMKLRLPFVF DNKMEANVVQ IGDSLTVRIG NYQKGVVLPL FLAGMRVAEA GYEEKWLKID FRKKEG
|
| |