Gene Cpha266_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2301 
Symbol 
ID4569405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2633956 
End bp2635173 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content46% 
IMG OID639766863 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_912717 
Protein GI119358073 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0024147 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTT TAACTTTTAC AGGTAAAGGC GGAGTAGGAA AAACCAGTGT GTCGGCTGCA 
ACAGCTGTTC GTTTATCCCA GTTGGGGTAT CGTACTCTTG TGCTTTCAAC TGATCCCGCG
CACAGTTTAT CGGATTCATT CAACCTTCCG CTTGGTGCTG AACCAACCAA AATCAAGGAG
AACCTGCACG CAATTGAGGT CAATCCCTAT GTTGATTTGA AGCAGAACTG GCAGTCAGTG
CAGAAATACT ATACAAGAAT TTTTATGGCC CAGGGGGTTT CAGGGGTTAT GGCCGATGAG
ATGACCATTC TTCCCGGTAT GGAAGAGCTT TTTTCCCTCC TCAGAATCAA ACGATATAAA
ACCGCCGGAC TTTATGATGT CCTTGTGCTC GATACTGCGC CGACCGGTGA AACGCTCAGG
TTGCTTTCCC TTCCCGATAC CCTTGCATGG GGAATGAAGG CGGTAAAAAA TATCAACAAA
TATATTGTCA GACCGCTCAG CAAGCCACTG TCAAAAATGT CTGACAGAAT TGCCTTCTAT
ATTCCGCCTG AAGATGCTGT TGAGTCTGTT GATCAGGTGT TCGATGAGCT TGAAGATATT
CGGGAGATTC TGACCGATAA TGTCAAATCT ACCGTACGTC TTGTCATGAA TGCCGAGAAG
ATGTCGATCA AGGAAACCAT GCGTGCACTT ACTTACCTTA ACCTTTACGG CTTCAAGGTG
GATATGGTAC TCGTCAACAG GTTGCTTGAT ACAAAAGAGG ACAGCGGGTA TCTGGAGAAA
TGGAAAGGCA TTCAGCAGAA ATACCTTGGC GAGATTGAAG AGGGTTTTTC TCCGCTTCCG
GTTAAAAAAC TCAGGATGTA TGAACAGGAA ATCGTCGGCC TTGATGCGCT CGAGCTTTTT
GCAAAAGACA TGTATGGCGA TTCCGATCCT TCTGATCTTA TGTACGACGA ACCTCCGATC
AAGTTTGTAA GAAACGGGGA TGTGTATGAG GTACAACTGA AGCTTATGTT TGCCAATCCT
GTCGATATTG ATGTCTGGGT CACAGGCGAT GAACTCTATG TACAGATCGG TAATCAGAGA
AAGATCATCA CGCTTCCCAT AAGTTTGACC GGACTTGAGC CCGGCGATGC CGTTTTTAAG
GATAAATGGC TGCATATTCC CTTTGACCTT AACCATCAGG GAAAGCACCA GCGGCAGCGG
CAGGGAGAGG CCGATTAA
 
Protein sequence
MRILTFTGKG GVGKTSVSAA TAVRLSQLGY RTLVLSTDPA HSLSDSFNLP LGAEPTKIKE 
NLHAIEVNPY VDLKQNWQSV QKYYTRIFMA QGVSGVMADE MTILPGMEEL FSLLRIKRYK
TAGLYDVLVL DTAPTGETLR LLSLPDTLAW GMKAVKNINK YIVRPLSKPL SKMSDRIAFY
IPPEDAVESV DQVFDELEDI REILTDNVKS TVRLVMNAEK MSIKETMRAL TYLNLYGFKV
DMVLVNRLLD TKEDSGYLEK WKGIQQKYLG EIEEGFSPLP VKKLRMYEQE IVGLDALELF
AKDMYGDSDP SDLMYDEPPI KFVRNGDVYE VQLKLMFANP VDIDVWVTGD ELYVQIGNQR
KIITLPISLT GLEPGDAVFK DKWLHIPFDL NHQGKHQRQR QGEAD