Gene GM21_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1873 
Symbol 
ID8137204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2179142 
End bp2180200 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content58% 
IMG OID644869484 
Productarsenical-resistance protein 
Protein accessionYP_003021684 
Protein GI253700495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG AAGTTTCCAG GAGGCTGTCG TTTCTCGACC GCTATCTTAC CCTTTGGATC 
TTCCTCGCCA TGTTTGCAGG GGTGGGAGCG GGGTACTTGT TTCCTGGCGT CGAAAGTTTC
ATCAACAGCT TCCAGGTAGG GACCACCAAC ATCCCGATCG CCGCAGGTCT CATTCTCATG
ATGTACCCTC CCTTTGCCAA GGTGAAATAC GAGGAGATGC CTGAGGTCTT CCGCAACAAG
CGGGTTCTCG GGCTTTCGCT GGTTCAAAAC TGGCTGATAG GGCCGGTTCT GATGTTCATC
CTGGCTGTCG CCTTTCTTCC CGACAAGCCT GAGTACATGG TGGGACTCAT CATGATAGGG
CTCGCCAGGT GCATCGCCAT GGTCATCGTC TGGAACGATC TGGCAAAGGG AAACACCGAG
TACGCCGCGG GTTTGGTCGC TTTCAACAGC ATCTTCCAGG TCCTTTTCTA CAGCGTCTAT
GCCTGGTTCT TCATCACGGT CCTCCCGCCT CTGGTCGGTC TGTCGGGCAG CATCGTGGAA
ATCGGCATAG GGCAGATCGC CAAGAGCGTC TTCATCTACC TTGGCGTCCC GTTTATCGCC
GGCGCCATCA CGCGCCTGGT CGGCGTCAAG GTAATGGGTA GGGAGCGCTA CCATAGGGAG
TTCGTGCCCA GGATCGGCCC GATCACGCTG ATTGCGCTTC TGTTCACCAT CGTCGTCATG
TTCAGCCTGA AGGGGAACCT CATCGTTCAG CTCCCTCTCG ACGTCGTCCG GATCGCGGTA
CCGCTGCTCA TCTATTTCGT CGCCATGTTC TTCGTTTCCT TCTGGATGGG GAAAAAGCTC
GGGGCTGACT ACAGCAAGAC GACGACCCTT GCCTTCACCG CCGCGAGCAA CAACTTCGAA
TTGGCCATCG CCGTCGCGGT CGCCGTTTTC GGACTCAATT CCGGCGCCGC GTTCGCGGCC
GTGATCGGCC CCTTGGTGGA GGTGCCCGTG ATGATTGCCC TGGTCAACGT GGCCTTCCGG
TTCCAGCGCC GCTACTTCAC TACTACAACA GGCCAATAG
 
Protein sequence
MSSEVSRRLS FLDRYLTLWI FLAMFAGVGA GYLFPGVESF INSFQVGTTN IPIAAGLILM 
MYPPFAKVKY EEMPEVFRNK RVLGLSLVQN WLIGPVLMFI LAVAFLPDKP EYMVGLIMIG
LARCIAMVIV WNDLAKGNTE YAAGLVAFNS IFQVLFYSVY AWFFITVLPP LVGLSGSIVE
IGIGQIAKSV FIYLGVPFIA GAITRLVGVK VMGRERYHRE FVPRIGPITL IALLFTIVVM
FSLKGNLIVQ LPLDVVRIAV PLLIYFVAMF FVSFWMGKKL GADYSKTTTL AFTAASNNFE
LAIAVAVAVF GLNSGAAFAA VIGPLVEVPV MIALVNVAFR FQRRYFTTTT GQ