Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2827 |
Symbol | |
ID | 7315264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2970153 |
End bp | 2971220 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643617727 |
Product | arsenical-resistance protein |
Protein accession | YP_002514887 |
Protein GI | 220935988 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCGC CGCAACCGGC ATCCCGCCTG TCCTTTCTGG ACCGCTACCT TACGCTCTGG ATATTCGCCG CCATGACGCT GGGCGTGGTG CTGGGCATGA TCTTCAAGGG CCTGCCCGAC GCGCTGAACG CGATGTCGGT CGGCACCACC AACATCCCGA TCGCCATCGG CCTGATCCTG ATGATGTACC CGCCGCTGGC GAAGGTGCGC TACGAGGAAT TGCATCGGGT GTTCGCCGAC AAGCGCGTGC TGCTGCTCTC CCTGGTGCAG AACTGGCTGA TCGGGCCGGT GCTGATGTTT GCGCTGGCGG TGATCTTCCT GCGCGACCAT CCCGAGTACA TGACGGGCCT GATCCTGATC GGCCTGGCCC GCTGCATCGC GATGGTGCTG GTCTGGAATC AGTTGGCGCG GGGCGACAAC CAGTATGTCG CGGGCCTGGT CGCGTTCAAT TCCATCTTCC AGATCCTGTT CTTCTCGACC TATGCCTGGT TCTTCCTGTC GGTGCTGCCA CCGCTGTTCG GCTTGCAGGG TAGCGTCATC GACGTCAGCT TCTGGACCAT CACCGAGGCC GTGCTGATCT ACCTGGGCAT TCCATTCCTG GCCGGCTTCC TGACCCGCCG CATCCTGATC GGCAGGAAGG GTGCCGAATG GTACGAGCGC GTCTTCCTGC CCAGGATTAG CCCGATCACG CTGGCGGCGC TGCTGTTCAC CATCGTCGCC ATGTTCAGTC TCAAGGGCGG CGACGTGGTG CGGTTGCCGA TGGACGTGCT GCTGATCGCC GTGCCGCTGA CGATTTATTT CATCGTCCAG TTCCTCGTCA GCTTCTTCAT GGGGAAGCTG ATCGAAATCG ATTATCCGCG CACTACCGCG ATCGCCTTCA CGGCGGCCGG CAACAACTTC GAGCTGGCGA TCGCGGTCGC CATCGCGGCG TTCGGGTTGG CGTCGCCCGT CGCGTTCACG ACCATCATCG GCCCACTGGT GGAAGTGCCG GTGCTGATCA TGCTGGTTCA TGTGGCCCTG CGACTGGGCG AGAAGTGGTT CCCCGCAACC GCTCCCGTGC GGCCATGA
|
Protein sequence | MASPQPASRL SFLDRYLTLW IFAAMTLGVV LGMIFKGLPD ALNAMSVGTT NIPIAIGLIL MMYPPLAKVR YEELHRVFAD KRVLLLSLVQ NWLIGPVLMF ALAVIFLRDH PEYMTGLILI GLARCIAMVL VWNQLARGDN QYVAGLVAFN SIFQILFFST YAWFFLSVLP PLFGLQGSVI DVSFWTITEA VLIYLGIPFL AGFLTRRILI GRKGAEWYER VFLPRISPIT LAALLFTIVA MFSLKGGDVV RLPMDVLLIA VPLTIYFIVQ FLVSFFMGKL IEIDYPRTTA IAFTAAGNNF ELAIAVAIAA FGLASPVAFT TIIGPLVEVP VLIMLVHVAL RLGEKWFPAT APVRP
|
| |