Gene Athe_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1933 
Symbol 
ID7407347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2041453 
End bp2042499 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content36% 
IMG OID643716305 
Productarsenical-resistance protein 
Protein accessionYP_002573793 
Protein GI222529911 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAGA AAAAAGGTTT GTCATTTCTG GACAGATTTT TGACAGTTTG GATTTTGCTT 
GCTATGATTG TAGGTGTTCT GATAGGGTAC TTTTTCCCAA ACTTTGCAAA TGTGTTAAAT
AGACTTAGCA TTGGAACAAC GTCAATTCCA ATTGCTATTG GGTTAATTTT GATGATGTAT
CCTCCTCTTG CAAAAGTAAG ATATGAAGAG ATAGGAAAGA CAAAAGCAGG TAAAAAACCT
TTTGGAATAG CAATCTTATA TAACTGGTTT ATAGGACCTA TTGTCATGTT TTTGCTTGCC
ATCTTGCTTT TGAGAGATTA TCCACATTAT ATGATAGGAG TAATATTGGT AGGCTTGGCT
CGATGCATTG CAATGGTCCT TGTGTGGAAT GACCTTGCAG ATGGTGACAG GGATTTTGTT
GCAGGGCTTG TTGCTCTCAA TGCAATCTGG CAGGTTCTGA CCTATTCAGT ACTTGCATAT
GTGTTTATAA AGATACTTCC TCCACTTTTT GGAATAAGCA CATCTGCAAT TGCTTTGCAT
ATTTCAATGA AAGAAATAGC AATTTCGGTG TTTATTTATC TTGGTATTCC TTTTATAGCT
GGAGTATTGA CAAGAATTTT TTTGGTCAGA AAAAACGGCA GAGAATGGTA CGAAAAGAAT
TTTGTGCCCA AGATAAGTCC AATAACCTTG GTAGCACTGC TTTTTACAGT CATTGTGATG
TTCTCATTAA AAGGAAAGTA TATTGTTACA CTTCCACTTC ATGTATTGAG AATAGCAATA
CCACTTTCGC TGTATTTTGT TATAATGTTT TTGATAACAT TCTTTACATC ATATAAAAGA
AAATATCCTT ATCCTGAGAG TGCAACTGTT GGTCTGACAG CAGCAAGCAA TGACTTTGAA
CTTGCAATTG CAGTTGCTGT TGCAACCTTT GGTTTAGGGT CTGGTGAAGC CTTTGCAACA
GTTATTGGTC CTCTGATTGA AGTTCCTGTT ATGCTTCTTT TGGTAAATGT TGCTCTATTT
TTGAAAAAGA AACTTTATGC TAAATAA
 
Protein sequence
MEQKKGLSFL DRFLTVWILL AMIVGVLIGY FFPNFANVLN RLSIGTTSIP IAIGLILMMY 
PPLAKVRYEE IGKTKAGKKP FGIAILYNWF IGPIVMFLLA ILLLRDYPHY MIGVILVGLA
RCIAMVLVWN DLADGDRDFV AGLVALNAIW QVLTYSVLAY VFIKILPPLF GISTSAIALH
ISMKEIAISV FIYLGIPFIA GVLTRIFLVR KNGREWYEKN FVPKISPITL VALLFTVIVM
FSLKGKYIVT LPLHVLRIAI PLSLYFVIMF LITFFTSYKR KYPYPESATV GLTAASNDFE
LAIAVAVATF GLGSGEAFAT VIGPLIEVPV MLLLVNVALF LKKKLYAK