Gene Paes_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1119 
Symbol 
ID6458989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1225870 
End bp1226916 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content51% 
IMG OID642725114 
Productarsenical-resistance protein 
Protein accessionYP_002015799 
Protein GI194333939 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.335014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG CATCCAGACA ATTATCGTTT CTCGACCGCT ATCTGACGCT GTGGATTTTT 
CTCGCCATGG CGATCGGCGT CTTTTCCGGC TACCTCTTCC CTTCTGTCAC AGCATTCTGG
AGTCGTTTTC AGACAGGGAC CACCAATGTT CCGATCGCAA TCGGGTTAAT CGTCATGATG
TACCCGCCGC TGGCTAAAGT CAAATATGAG GAGCTTGGCG ATGTATTCCG CAACACCAGA
ATACTCGGGC TCTCTCTCCT CCTGAACTGG GTGATCGGCC CACTGCTCAT GTTCGGACTT
GCTGTTCTGT TCCTTTCCGA CATGCCGCAT TACATGGCCG GCCTGATTCT CATTGGCCTG
GCCCGTTGCA TAGCAATGGT CATCGTCTGG AACGATCTTG CAGGAGGAGA CAGGGAATAT
GCTGCCGGTC TGGTGGCCTT CAACTCGCTC TTTCAGGTCT TCTTCTTTTC CGTCTACGCC
TGGCTGTTTC TCTCGGTACT CCCTCCCCTG CTTGGCCTGG AGTCCTTCAA TGTCTCAATC
ACGATAGCAG AAATCGCAAG CTCAGTATTC ATCTACCTCG GCATTCCTTT TATCGCAGGG
TTTCTCACCC GCTTTTTCCT GATTCGTCTG AAAGGAGCAG AATGGTATGA GTCTGAATTT
ATTCCCCGTA TAAGTCCGCT GACACTGGTC GCGCTGCTCT TCACCATTGT CGTGATGTTC
TCGCTCAAAG GTGAATACAT CGTCACCATT CCTTTTGACG TCGTCAGAAT CGCTGTCCCG
CTCCTGATTT ACTTCGTCAT CATGTTTCTG GTATCATTTT ATCTTGGCAG AAAAGCAGGC
GCCGACTACC CTAAAACGGC AACTCTTTCG TTCACCGCTG CCAGCAACAA CTTCGAACTT
GCCATTGCGG TAGCTGTCGC TGTTTTCGGT ATCAATTCCG GCGAAGCGTT CGCCGCGGTC
ATTGGGCCGC TTGTCGAGGT TCCGGTACTT GTCAGTCTCG TCAACGTTGC GCTCTGGTTC
AAGATGAAGT TTTTTGCTGA AGCGTAA
 
Protein sequence
MSVASRQLSF LDRYLTLWIF LAMAIGVFSG YLFPSVTAFW SRFQTGTTNV PIAIGLIVMM 
YPPLAKVKYE ELGDVFRNTR ILGLSLLLNW VIGPLLMFGL AVLFLSDMPH YMAGLILIGL
ARCIAMVIVW NDLAGGDREY AAGLVAFNSL FQVFFFSVYA WLFLSVLPPL LGLESFNVSI
TIAEIASSVF IYLGIPFIAG FLTRFFLIRL KGAEWYESEF IPRISPLTLV ALLFTIVVMF
SLKGEYIVTI PFDVVRIAVP LLIYFVIMFL VSFYLGRKAG ADYPKTATLS FTAASNNFEL
AIAVAVAVFG INSGEAFAAV IGPLVEVPVL VSLVNVALWF KMKFFAEA