Gene Arth_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0212 
Symbol 
ID4447337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp221689 
End bp222786 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content65% 
IMG OID639688008 
Productarsenical-resistance protein 
Protein accessionYP_829713 
Protein GI116668780 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCC AGACCGTCTC TTCGCCCACT CGTGAAGGCG AGGCTGCCGT CGTTGGCAAA 
CTCTCCACCC TGGACCGGTT CCTGCCGGTG TGGATCATCG CTGCCATGGT CCTGGGCCTG
TTCCTCGGCA GTTTCGTTCC CGGCCTGAAC ACTGCACTTG AAGCAGTCAA GGTGGGCGAA
GTTTCGCTGC CGATCGCCAT CGGGCTGCTG GTGATGATGT ACCCGGTGCT CGCGAAAGTC
CGCTACGACC AGGCGCACCG CGTGGTCGGT GACCGGAAGC TGATGATCAC CTCGCTGGTG
CTGAACTGGC TTCTCGCCCC GGCGTTCATG TTTGCCCTGG CCTGGATCTT CATCCCGGAT
CTGCCCGACT ACCGTACCGG CCTGATCATC GTGGGCCTGG CCCGCTGCAT CGCCATGGTG
ATGATCTGGA ACGACCTCGC CTGCGGGGAC CGCGAAGCCG CCGCCGTGCT GGTGGCCATC
AATTCCGTCT TCCAGGTCAT CGCGTTCGGC GCGCTGGGCT GGTTCTACCT GCAGTTGCTG
CCGGGCTGGC TGGGCCTGCC CACCACCAGC GCGGACTTCT CCTTCTGGGC CATCACCGCT
TCCGTCCTGG TCTTCCTGGG GATCCCGCTG CTGGCCGGCT TCCTCACCCG CACAATCGGC
GAAAAGGCCA AAGGCCGCGC CTGGTACGAA GGAACCTTCC TGCCGAAGCT CGGACCGTGG
GCGCTGTACG GGCTGCTGTT CACCATCACG CTGCTCTTCG CCCTGCAGGG CGGGACCATC
ACCTCCCGCC CGCTGGACGT CGTCCGGATC GCCCTGCCCC TGCTGGTCTA CTTCCTGGTG
GTCTTCGGCG CCGGCATGCT GATCGGAAGG TGGCTGGACC TGGGCTACGC CAAAACCACC
ACACTGGCCT TCACTGCCGC GGGCAACAAC TTCGAGCTCG CCATCGCAGT GGCGATCGGC
ACTTTCGGTG TCACGTCGGG GCAGGCGCTG GCCGGCGTCG TCGGACCCTT GATCGAAGTC
CCCGTCCTTG TTGCACTGGT TTACGTGGCC CTCTGGGCCC GGAAACGCCA CTTCATCACC
AGCCCCCTTT CCATCTGA
 
Protein sequence
MSTQTVSSPT REGEAAVVGK LSTLDRFLPV WIIAAMVLGL FLGSFVPGLN TALEAVKVGE 
VSLPIAIGLL VMMYPVLAKV RYDQAHRVVG DRKLMITSLV LNWLLAPAFM FALAWIFIPD
LPDYRTGLII VGLARCIAMV MIWNDLACGD REAAAVLVAI NSVFQVIAFG ALGWFYLQLL
PGWLGLPTTS ADFSFWAITA SVLVFLGIPL LAGFLTRTIG EKAKGRAWYE GTFLPKLGPW
ALYGLLFTIT LLFALQGGTI TSRPLDVVRI ALPLLVYFLV VFGAGMLIGR WLDLGYAKTT
TLAFTAAGNN FELAIAVAIG TFGVTSGQAL AGVVGPLIEV PVLVALVYVA LWARKRHFIT
SPLSI