Gene Csal_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2005 
Symbol 
ID4027089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2264396 
End bp2265337 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content69% 
IMG OID637967200 
Productarsenite-activated ATPase (arsA) 
Protein accessionYP_574055 
Protein GI92114127 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.41819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGA GCCTTTTCTT CGGCGGCAAG GGCGGGGTCG GCAAGACCAG CTGCGCCACC 
GCCTACGCGC TTGGCTGCGC GGCGGCCGGC TGGCGCACCC TGCTGGTCTC CACGGACCCG
GCGCACAATC TGGCCGATCT GTTCGGACGC GCGCCGGGGC CGACCCCCAC ACGCATGCAG
GCCGGCCTCG ATGTCGTCGA ACTCGATCCC GACCACGAAA CGCAACGCTA CCTGGAGCAG
GTCAAGGCCA CGCTGCGTCC TCTGGTCAGC GGCGAGCGCA GCGCGACGGT GTTTCGCCAG
CTCGATCTGG CGCGGCACGC GCCCGGCACC GAGGAAGCCG CCCTCTTCGA TGCCCTGGTC
GGCCTGTTGC TGGATACCGG CGAGAAGTAC GATCGGCTGA TCTTCGACAC CGCCCCCGGC
GGGCACACGG TTCGCCTGCT GGCGCTCCCC GAGATCATGG GCGCCTGGGT GGAAGGCTTG
ATGCAACGCC GCCGCAAGGT GCGCAGCGAC TACAAGGCGT GGCTGGGCGA CGGCGAGGTG
GTCGACGATC CCATTCAGGA AACGCTGATG CGTCGTCGCG GACGCCTCGC CGCCGCCCGC
GAGCACTTGA CCTGCCCTGC ACACTCGGCG GTGATCCTGG TCGCCAACCC GGAACGCCTG
CCGGCACTGG AAACCGCGCG CACCCGCGAG CTGCTCGAAA GCCATGGCCT GCACGTCGGC
GCCGTGGTGA TCAACAAGTG CCTGCCCGCC GAGGTCGATA GCCAATGGCT CGCCAACTGG
CGCGAGGAAC AACGCCCCTG GATCGAACAT CTCGAGGCAT CTTTCCCCGA CCGCGAGCGC
ATTCGCATCG ACCACCAGCC CCATGCGCCC GCGTCCTGCA ACGACCTAGC CCCTCTCCAG
GAGGCACTGG GCCGACTCGC TCCCTGGCAT GACCACGCCT AG
 
Protein sequence
MAKSLFFGGK GGVGKTSCAT AYALGCAAAG WRTLLVSTDP AHNLADLFGR APGPTPTRMQ 
AGLDVVELDP DHETQRYLEQ VKATLRPLVS GERSATVFRQ LDLARHAPGT EEAALFDALV
GLLLDTGEKY DRLIFDTAPG GHTVRLLALP EIMGAWVEGL MQRRRKVRSD YKAWLGDGEV
VDDPIQETLM RRRGRLAAAR EHLTCPAHSA VILVANPERL PALETARTRE LLESHGLHVG
AVVINKCLPA EVDSQWLANW REEQRPWIEH LEASFPDRER IRIDHQPHAP ASCNDLAPLQ
EALGRLAPWH DHA