Gene Cpha266_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1088 
Symbol 
ID4570032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1231774 
End bp1232961 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content45% 
IMG OID639765685 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_911553 
Protein GI119356909 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.782242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAATA TCATTTTTAC CGGAAAGGGA GGCGTCGGCA AAACCTCTGT AGCTGCCGCA 
ACAGCACTGA GGGCGGCAGA AATGGGTTAT AAGACCCTTA TAATGTCTAC TGATCCGGCG
CACAGTCTGG GTGACTCGCT TGATGTTCAG TTAGGCCCAT CTCCAGTCAA GGTTGCTGAA
AATCTCTGGG GCCAGGAGGT CAGTGTTTTC GGTGATCTTA ATCTTAACTG GGATGTAGTC
AGGGAGCATT TTGCGCATTT GATGGCATCT CGCGGTATCG AGGGTGTGTA TGCAGAGGAA
ATGGGCGTTC TTCCTGGTAT GGAAGAGCTC TTTTCGCTCT CCTATATCAA ACGGTACAAT
GAAGGAAATC AGGATTACGA TCTTCTTGTC GTCGATTGTG CTCCTACCGG CGAAACGCTT
CGTCTGCTTT CGCTTCCGGA AACCTTCGGA TGGTTTATCA AGTTTATCCG TAATGTCGAA
AAGTATATGG TGAAACCGGT TATCAGACCG CTTTCAAAGA AAATCAAGAA AATTGATGAT
TTTGTCGCTC CTGAAGAGGT CTATGAAAAG GTTGATAATC TTTTCTCTTC CACGGAAGGC
ATTATTGATC TTCTTGCAGA TGGCACGAAA TCCACGGTGC GTCTTGTCAT GAACCCCGAG
AAGATGGTTA TCAAAGAGTC CATGCGCGCG TTAACCTATC TCAATCTCTA TGGAATAACC
GTTGACAGTA TTACCATCAA CAGGATTATG CCCGATCATA CCGAGGATCC TTACTTTAAA
AAATGGAGAG CTATTCAGCA GAAGTATATT GAGCAGATTA AAGGAGCATT TTCTCCGATT
CCGATTGCTG AAGTGCCTTT GTTTGATGAA GAGGTTGTTG GTCTCGATAT GCTTCGCAAG
GTTGGAGAAA AGGTTTATGC GGGTAAAAAT CCGCTTGACA TTTTCTTCAA GGAAGATCCT
ATTGATATCA AGAAGGTTGC TGATGGACAC TATAAGGTAC GCGTAAGGCT TCCATTCATG
GAAACAATGG GTATGGAACC AAAGATTCTT AAACTGGGTG ATGATTTGAC CATTCGCATC
GGCGATTATC AGAAAATCGT TGCCTTGCCG ATTTTCCTTG CCGGTCTTGA ATCAACGGGC
GCCACGTTTG AAGAAAAATG GCTGAGCATT GACTTTACAA AGCCATGA
 
Protein sequence
MRNIIFTGKG GVGKTSVAAA TALRAAEMGY KTLIMSTDPA HSLGDSLDVQ LGPSPVKVAE 
NLWGQEVSVF GDLNLNWDVV REHFAHLMAS RGIEGVYAEE MGVLPGMEEL FSLSYIKRYN
EGNQDYDLLV VDCAPTGETL RLLSLPETFG WFIKFIRNVE KYMVKPVIRP LSKKIKKIDD
FVAPEEVYEK VDNLFSSTEG IIDLLADGTK STVRLVMNPE KMVIKESMRA LTYLNLYGIT
VDSITINRIM PDHTEDPYFK KWRAIQQKYI EQIKGAFSPI PIAEVPLFDE EVVGLDMLRK
VGEKVYAGKN PLDIFFKEDP IDIKKVADGH YKVRVRLPFM ETMGMEPKIL KLGDDLTIRI
GDYQKIVALP IFLAGLESTG ATFEEKWLSI DFTKP