Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1873 |
Symbol | |
ID | 8137204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2179142 |
End bp | 2180200 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644869484 |
Product | arsenical-resistance protein |
Protein accession | YP_003021684 |
Protein GI | 253700495 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 104 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG AAGTTTCCAG GAGGCTGTCG TTTCTCGACC GCTATCTTAC CCTTTGGATC TTCCTCGCCA TGTTTGCAGG GGTGGGAGCG GGGTACTTGT TTCCTGGCGT CGAAAGTTTC ATCAACAGCT TCCAGGTAGG GACCACCAAC ATCCCGATCG CCGCAGGTCT CATTCTCATG ATGTACCCTC CCTTTGCCAA GGTGAAATAC GAGGAGATGC CTGAGGTCTT CCGCAACAAG CGGGTTCTCG GGCTTTCGCT GGTTCAAAAC TGGCTGATAG GGCCGGTTCT GATGTTCATC CTGGCTGTCG CCTTTCTTCC CGACAAGCCT GAGTACATGG TGGGACTCAT CATGATAGGG CTCGCCAGGT GCATCGCCAT GGTCATCGTC TGGAACGATC TGGCAAAGGG AAACACCGAG TACGCCGCGG GTTTGGTCGC TTTCAACAGC ATCTTCCAGG TCCTTTTCTA CAGCGTCTAT GCCTGGTTCT TCATCACGGT CCTCCCGCCT CTGGTCGGTC TGTCGGGCAG CATCGTGGAA ATCGGCATAG GGCAGATCGC CAAGAGCGTC TTCATCTACC TTGGCGTCCC GTTTATCGCC GGCGCCATCA CGCGCCTGGT CGGCGTCAAG GTAATGGGTA GGGAGCGCTA CCATAGGGAG TTCGTGCCCA GGATCGGCCC GATCACGCTG ATTGCGCTTC TGTTCACCAT CGTCGTCATG TTCAGCCTGA AGGGGAACCT CATCGTTCAG CTCCCTCTCG ACGTCGTCCG GATCGCGGTA CCGCTGCTCA TCTATTTCGT CGCCATGTTC TTCGTTTCCT TCTGGATGGG GAAAAAGCTC GGGGCTGACT ACAGCAAGAC GACGACCCTT GCCTTCACCG CCGCGAGCAA CAACTTCGAA TTGGCCATCG CCGTCGCGGT CGCCGTTTTC GGACTCAATT CCGGCGCCGC GTTCGCGGCC GTGATCGGCC CCTTGGTGGA GGTGCCCGTG ATGATTGCCC TGGTCAACGT GGCCTTCCGG TTCCAGCGCC GCTACTTCAC TACTACAACA GGCCAATAG
|
Protein sequence | MSSEVSRRLS FLDRYLTLWI FLAMFAGVGA GYLFPGVESF INSFQVGTTN IPIAAGLILM MYPPFAKVKY EEMPEVFRNK RVLGLSLVQN WLIGPVLMFI LAVAFLPDKP EYMVGLIMIG LARCIAMVIV WNDLAKGNTE YAAGLVAFNS IFQVLFYSVY AWFFITVLPP LVGLSGSIVE IGIGQIAKSV FIYLGVPFIA GAITRLVGVK VMGRERYHRE FVPRIGPITL IALLFTIVVM FSLKGNLIVQ LPLDVVRIAV PLLIYFVAMF FVSFWMGKKL GADYSKTTTL AFTAASNNFE LAIAVAVAVF GLNSGAAFAA VIGPLVEVPV MIALVNVAFR FQRRYFTTTT GQ
|
| |