Gene Anae109_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3643 
Symbol 
ID5375099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4256220 
End bp4257365 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID640845164 
Productarsenical-resistance protein 
Protein accessionYP_001380807 
Protein GI153006482 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0704607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.350418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC CCGCAGTCAC CCTCGCTGCG CCCTCCGGCG TCGCCCGCCG GCTCTCGTTC 
CTCGACCGCT ACCTGACCCT CTGGATCTTC CTCGCCATGG GCGCGGGCAT CGCGCTCGGC
TGGGCCGTGC CCGGGGTCGT CCCGGCGCTC GACCGGCTGA GCGTCGGCAC GACCTCGATT
CCCATCGCGA TCGGCCTGAT CCTCATGATG TACCCGCCGC TCGCCAAGGT CCGGTACGAG
GAGCTGCCGC GGATCTTTCG CGACGGGAAG GTGCTCGGGC TGTCGCTCGT GCAGAACTGG
GTGGTCGGGC CGCTGCTCAT GTTCGCGCTG GCCGTGATCT TCCTGCGAGA CAGGCCCGAG
TACATGGTCG GGCTCATCCT GATCGGCCTC GCCCGCTGCA TCGCGATGGT CATCGTCTGG
AACGACCTCG CGAGGGGCGA CACCGAGTAC TGCGCCGGCC TCGTCGCCTT CAACTCCATC
TTCCAGGTGC TCTTCTTCTC CGTCTACGCC TGGATCTTCA TCACCGTGCT GCCTGGGTGG
CTCGGCCTCC GGGGCGCCGA GGTCCACATC ACCATCGGCG AGATCGCCCG GAGCGTCTTC
GTCTATCTGG GCATCCCGTT CCTCGCCGGG ATGGCGAGCC GCTTCGGGCT TCGGGCGTGG
AAGGGCGAGG ACTGGTACCG CAGGGTGTTC ATCCCCCGGA TCTCGCCCGT CACGCTCGTC
GCCCTCCTCT TCACCATCGT CGTGATGTTC TCCCTGAAGG GCGAGACCAT CGTGCAGGTG
CCGCTCGACG TGGTGCGGAT CGCGATCCCG CTCCTCGTCT ACTTCCTGCT CATGTTCTTC
GTCTCCTTCT GGATGAGCCG GAAGGTCGGC GCGACCTACG GGCAGACCGC CACGCTGTCC
TTCACGGCCG CGTCGAACAA CTTCGAGCTC GCCATCGCGG TGGCGGTCGC GACGTTCGGC
ATGTCCCACG GGGCGGCCTT CGCCGCCGTG ATCGGTCCCC TGGTCGAGGT GCCGGTCCTC
ATCGGGCTCG TGAACGTGGC CCTGAAGCTG CGCGACCGCT GGTTCCCCGG CGAGACCGGC
GAGCTCGGGA AGGTGGCGAG CTGCGCGGTC ACCGTGGAGG GCCCTGCGGC CAGGGGGGCG
CCGTGA
 
Protein sequence
MSEPAVTLAA PSGVARRLSF LDRYLTLWIF LAMGAGIALG WAVPGVVPAL DRLSVGTTSI 
PIAIGLILMM YPPLAKVRYE ELPRIFRDGK VLGLSLVQNW VVGPLLMFAL AVIFLRDRPE
YMVGLILIGL ARCIAMVIVW NDLARGDTEY CAGLVAFNSI FQVLFFSVYA WIFITVLPGW
LGLRGAEVHI TIGEIARSVF VYLGIPFLAG MASRFGLRAW KGEDWYRRVF IPRISPVTLV
ALLFTIVVMF SLKGETIVQV PLDVVRIAIP LLVYFLLMFF VSFWMSRKVG ATYGQTATLS
FTAASNNFEL AIAVAVATFG MSHGAAFAAV IGPLVEVPVL IGLVNVALKL RDRWFPGETG
ELGKVASCAV TVEGPAARGA P