Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_2021 |
Symbol | |
ID | 6355525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 2239583 |
End bp | 2240638 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642669619 |
Product | arsenical-resistance protein |
Protein accession | YP_001944032 |
Protein GI | 189347503 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGT CAACGAAACA ACTCTCCTTT CTCGACCGGT ACCTTACGCT CTGGATATTC CTCGCCATGG CAGTCGGCGT ATTCTCGGGC TATCTTGTTC CCGGTACGGC GGCGTTCTGG AATACCTTTC AGTCGGGTAC CACCAATATC CCCATCGCCA TAGGGCTTAT CGTCATGATG TATCCGCCTC TGGCCAAGGT GAAGTACGAG GAGCTTGGCG ATGTGTTCCG CAACACCAGA GTGCTCGGGC TTTCCCTTGT GCAGAACTGG GTGATCGGGC CGGTGCTCAT GTTCGTGCTT GCAGTGCTCT TTCTTTCCGA TATGCCGCAT TACATGGCCG GACTCATCCT GATCGGCCTG GCACGCTGCA TTGCCATGGT GATCGTCTGG AACGAGCTGG CAAAAGGCGA TACGGAGTAC GCGGCAGGTC TCGTCGCCTT CAACTCGGTC TTTCAGGTGC TGTTCTTCTC CGTTTACGCG TGGGTATTCC TCACGGTGCT GCCCGGCTGG CTCGGCCTCT CGTCGTTCAG GGTCGATATC ACCATCGCCG AAATCGCTTC GTCGGTTTTC ATCTACCTCG GCATTCCCTT CATTGCGGGT TTTCTTACCC GGTTCGTCAT GCTCCGCATC AAAGGCAGGG AGTGGTACGA AACCGTTTTC GTGCCGCGCA TCAGTCCCCT GACGCTCGTT GCGCTGCTCT TTACCATCGT GGTGATGTTC TCGCTGAAAG GCGAGTATAT CGTGAAAATT CCCATGGATG TGGTGCGCAT CGCCATACCG CTGCTTTGCT ACTTCATCAT CATGTTCTTC GTTTCGTTCT GGATGGGACG CAAGGTCGGT GCGGATTACT CCAAAACAGC CACGCTCTCA TTCACGGCGG CGAGCAACAA CTTCGAGCTT GCCATAGCCG TGGCCGTGGC GGTGTTCGGT ATCGATTCCG GCGAAGCGTT CGCCGCCGTG ATCGGGCCGC TCGTGGAGGT TCCCGCCCTG ATCGGTCTGG TAAATGTATC GCTCTGGTTC AGGGAGAAAT GGTTCGGGGG ATCATCTGCA GCTTAA
|
Protein sequence | MGMSTKQLSF LDRYLTLWIF LAMAVGVFSG YLVPGTAAFW NTFQSGTTNI PIAIGLIVMM YPPLAKVKYE ELGDVFRNTR VLGLSLVQNW VIGPVLMFVL AVLFLSDMPH YMAGLILIGL ARCIAMVIVW NELAKGDTEY AAGLVAFNSV FQVLFFSVYA WVFLTVLPGW LGLSSFRVDI TIAEIASSVF IYLGIPFIAG FLTRFVMLRI KGREWYETVF VPRISPLTLV ALLFTIVVMF SLKGEYIVKI PMDVVRIAIP LLCYFIIMFF VSFWMGRKVG ADYSKTATLS FTAASNNFEL AIAVAVAVFG IDSGEAFAAV IGPLVEVPAL IGLVNVSLWF REKWFGGSSA A
|
| |