Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0466 |
Symbol | |
ID | 4569391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 514430 |
End bp | 515491 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765066 |
Product | arsenical-resistance protein |
Protein accession | YP_910948 |
Protein GI | 119356304 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0132809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATGT CAACAAAACA GCTCTCGTTT CTTGATCGCT ATCTGACGCT CTGGATCTTT CTTGCCATGG GCATTGGTGT GCTCTGGGGT TATCTGTTTC CCGGCATTGC CGGATTCTGG AACCATTTTC AGCGCGGCAC CACCAATATT CCCATTGCCA TTGGGCTGAT CGTTATGATG TATCCGCCGT TGGCGAAGGT TAAATATGAG GAACTGGGCG ATGTGTTTCG TAATACGAGG GTACTGGGGC TCTCGCTGCT GCAGAACTGG GTTGTAGGGC CGCTGCTGAT GTTCGTGCTT GCTGTTATAT TTCTTTCCGA TATGCCGCAC TATATGGCAG GTCTTATCAT GATCGGTCTT GCCCGGTGTA TTGCCATGGT GATTGTCTGG AACGAGCTTG CCAAAGGGGA TACGGAGTAT GCTGCAGGGC TTGTTGCCTT CAACTCGATC TTTCAGGTGC TTTTTTTCTC CCTCTATGCA TGGGTTTTTC TCACCGTTCT GCCGGAGTGG CTTGGAATGA CGACCGTAGC CGTTGATATT ACCATTGGTG AAATTGCAGG CTCCGTGTTT ATCTACCTCG GCATTCCCTT TATTGCAGGG TTTCTGACCC GTTTTTTTCT ATTGCGTCTG AAAGGACGCG AGTGGTATGA GGGTGAATTT ATTCCCCGCA TCAGCCCGCT CACCCTCATA TCGCTGCTTT TCACCATTGT GGTGATGTTT TCACTCAAAG GTGAGTATAT CGTTACCATA CCCATGGATG TCGTGCGCAT CGCCATACCG CTGCTCATTT ATTTCGTGAT CATGTTTCTT CTCTCGTTTT ACATGGCAAG AAAGGCCGGA GCCGATTACC CGAAAACTGC CACGCTCTCC TTTACAGCCG CAAGTAATAA TTTTGAGCTT GCTATTGCTG TGGCGGTTTC GGTATTCGGT ATCAATTCGG GAGAGGCATT TGCCGCCGTT ATCGGTCCGC TGGTTGAGGT TCCCGCATTG ATTGCGCTCG TCAATGTTTC ACTCTGGTTC AGGGGCAGAT TTTTCGCAAC GCAGGAGAGC GGTTTTCAAT GA
|
Protein sequence | MGMSTKQLSF LDRYLTLWIF LAMGIGVLWG YLFPGIAGFW NHFQRGTTNI PIAIGLIVMM YPPLAKVKYE ELGDVFRNTR VLGLSLLQNW VVGPLLMFVL AVIFLSDMPH YMAGLIMIGL ARCIAMVIVW NELAKGDTEY AAGLVAFNSI FQVLFFSLYA WVFLTVLPEW LGMTTVAVDI TIGEIAGSVF IYLGIPFIAG FLTRFFLLRL KGREWYEGEF IPRISPLTLI SLLFTIVVMF SLKGEYIVTI PMDVVRIAIP LLIYFVIMFL LSFYMARKAG ADYPKTATLS FTAASNNFEL AIAVAVSVFG INSGEAFAAV IGPLVEVPAL IALVNVSLWF RGRFFATQES GFQ
|
| |