Gene Cphamn1_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0565 
Symbol 
ID6374229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp596667 
End bp597860 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content52% 
IMG OID642683080 
Productarsenical-resistance protein 
Protein accessionYP_001959007 
Protein GI189499537 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.998655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.774064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG AAATTTGTCC GGCGAACGAC CGGCATATGA CCAGCATCTT TGAGAAGTAC 
CTGACCGTCT GGGTCGGGTT GTGCATTGTG TTTGGGATTT TCCTCGGTAA AATCGCCCCC
GGGCTGGCAA AAACACTGGA CGGCATGAGT ATCAATGTCA ATGGCGCACC TGTCGTGTCC
ATTCCCATTG CCATATGTCT GTTCTTTATG ATGTACCCGA TCATGGTCAA GATTGATTTC
GCCAGCGTGG TCAAGGCCGG TAAAAGCGGA AAGCCCGTTT TTCTGACGCT GTTCATCAAC
TGGTGCATCA AACCGTTCAC CATGTATGCC ATCTCCATGC TGTTTCTTGG TGTTTTCCTT
AAAGGATTGA TCGGAAACGA AGCGATGGAT CTGGTCAAAA TGCCATTTGG CCTCGACCTG
CCTGTCGGAG CGGAGCACGG CGCAGGAACC GTGATTTTGC AGGACGGTGT AAAAATGCTT
CAGATTCCTC TGTGGCGCAG CTATTTTGCC GGATGCATCC TGCTGGGAAT AGCTCCCTGC
ACAGCCATGG TTCTCGTCTG GGGATACCTG GCCCGGGGAA ATGATGGTCT GACTCTCGTG
ATGGTCGCGA TCAACTCCCT GACCATGCTC GTCCTCTACG GGGTTCTGGG CGGGTTTCTG
TTGGGTATCG GTAGGTTACC AGTTCCGTGG CAGGCTCTGC TGCTGTCCGT TGCCATCTAT
GTGGCTCTGC CGCTGATCGC CGGTTATTTC TCCCGCAAAT GGATTATCGG ACACAAGGGT
GAAAAATGGT TCAAAGAAAA GTTTCTGCAT GTTCTTACAC CGGTAACCAT CAGCGCACTG
CTGCTGACAT TGGTTTTACT GTTCAGCTTC AAGGGCGAGA CCATTCTGGC CAACCCGCTG
ACCATTCTGT GGATCTCAAT TCCGCTTTTC CTGCAAACCG TTCTCATTTT CGGACTGGGC
TACGGTGCGG CAAAAATCCT CAAACTGAAT TACGAAGATG CCGCGCCAGC TGCCATGATC
GGCGCTTCCA ACCATTTTGA AGTGGCTATC GCCACCGCAG TCATGCTGTT CGGCCTGTCA
TCCGGTGCTG CTCTTGCAAC AGTGGTCGGT GTACTGATCG AGGTGCCGGT GATGCTGATG
CTTGTTGGTT TCTGCAAGAG AACAGCGGGT TGGTTCAACA CAGAACAGTC GTAA
 
Protein sequence
MNNEICPAND RHMTSIFEKY LTVWVGLCIV FGIFLGKIAP GLAKTLDGMS INVNGAPVVS 
IPIAICLFFM MYPIMVKIDF ASVVKAGKSG KPVFLTLFIN WCIKPFTMYA ISMLFLGVFL
KGLIGNEAMD LVKMPFGLDL PVGAEHGAGT VILQDGVKML QIPLWRSYFA GCILLGIAPC
TAMVLVWGYL ARGNDGLTLV MVAINSLTML VLYGVLGGFL LGIGRLPVPW QALLLSVAIY
VALPLIAGYF SRKWIIGHKG EKWFKEKFLH VLTPVTISAL LLTLVLLFSF KGETILANPL
TILWISIPLF LQTVLIFGLG YGAAKILKLN YEDAAPAAMI GASNHFEVAI ATAVMLFGLS
SGAALATVVG VLIEVPVMLM LVGFCKRTAG WFNTEQS