Gene AnaeK_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_4018 
Symbol 
ID6785488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4540357 
End bp4541502 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID642765487 
Productarsenical-resistance protein 
Protein accessionYP_002136352 
Protein GI197124401 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAG CGAACCTCGT TGCACCGGCC CGCGCCGGCG TCGCCCGCCG CCTCTCCTTC 
CTCGACCGCT ACCTCACGCT CTGGATCTTC CTCGCCATGG GCGCGGGCGT GGCGCTCGGG
TTCCTCGTCC CCGGCGTCGT GCCGATGCTC GACCGGATGT CGGTCGGGAC GACCTCCATC
CCGATCGCGA TCGGCCTCAT CCTGATGATG TACCCGCCGC TCGCGAAGGT CCGCTACGAG
GAGCTCCCCC GGGTCTTCCG GAACGGCAAG GTGCTGGCGC TCTCGCTGGT GCAGAACTGG
ATCGTGGGCC CGATCCTGAT GTTCGCGCTC GCGGTGATCT TCCTCCGCGA CCGCCCGGAG
TACATGGTCG GGCTCATCCT CATCGGCCTC GCCCGCTGCA TCGCGATGGT CATCGTGTGG
AACGACCTCG CGAAGGGCGA CACCGAGTAC TGCGCCGGCC TGGTCGCGTT CAACTCGATC
TTCCAGGTGC TGTTCTTCTC GGTCTACGCC TGGATCTTCA TCACCCTGCT GCCCGGCTGG
CTCGGCGTGC GCGGCGCCGA GGTGCACATC ACCATCGGCG AGATCGCCCG GAGCGTCTTC
GTCTACCTCG GCGTCCCGTT CATCGCCGGG ATGGCGAGCC GCTTCGGGCT CCGGGCGCTG
AAGGGCGAGG AGTGGTACCG CAGGGTGTTC ATCCCCCGGA TCTCCCCGAT CACGCTCGTC
GCCCTGCTCT TCACCATCGT GGTGATGTTC TCGCTGAAGG GCGAGACCAT CGTGCAGGTG
CCGCTCGACG TGGTGCGGAT CGCCCTCCCG CTGCTCGTCT ACTTCCTGCT CATGTTCTTC
GTGTCCTTCT GGATGAGCCG GAAGGTGGGC GCGACCTACG GGCAGACCGC CACGCTCTCG
TTCACGGCCG CCTCGAACAA CTTCGAGCTC GCCATCGCGG TCGCGGTCGC CACCTTCGGC
ATGGCGCACG GCGCCGCCTT CGCCGCGGTG ATCGGCCCGC TGGTCGAGGT CCCGGTGCTC
ATCGGCCTCG TGAACGTCGC GCTCCGGCTG CGCGATCGGT GGTTCCCCGG CGAGACCGGC
GAGATCGCGA AGGTCGCGAA CTGCGCGGTG ACCGTCGAGC GGCCGACCGC CGGGCGGGGG
CGGTGA
 
Protein sequence
MSEANLVAPA RAGVARRLSF LDRYLTLWIF LAMGAGVALG FLVPGVVPML DRMSVGTTSI 
PIAIGLILMM YPPLAKVRYE ELPRVFRNGK VLALSLVQNW IVGPILMFAL AVIFLRDRPE
YMVGLILIGL ARCIAMVIVW NDLAKGDTEY CAGLVAFNSI FQVLFFSVYA WIFITLLPGW
LGVRGAEVHI TIGEIARSVF VYLGVPFIAG MASRFGLRAL KGEEWYRRVF IPRISPITLV
ALLFTIVVMF SLKGETIVQV PLDVVRIALP LLVYFLLMFF VSFWMSRKVG ATYGQTATLS
FTAASNNFEL AIAVAVATFG MAHGAAFAAV IGPLVEVPVL IGLVNVALRL RDRWFPGETG
EIAKVANCAV TVERPTAGRG R