Gene Cphamn1_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2198 
Symbol 
ID6375892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2376650 
End bp2377867 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content46% 
IMG OID642684685 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001960584 
Protein GI189501114 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.416717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCT TAACATTTAC CGGAAAAGGT GGAGTTGGCA AGACAAGCGT TTCCGCTGCG 
ACAGCCGTCC GTTTGTCGCA AATGGGATAT CGTACGCTGG TATTATCCAC TGATCCTGCT
CACAGTCTAT CGGACTCTTT TAATATCTCA TTAGGGCCTG AACCAACCAA GATCAAGGAG
AACCTGCATG CCATCGAAGT GAATCCATAT GTTGATTTAA AGGAGAACTG GCAGGCTGTT
CAGAAGTATT ATACAAGGGT ATTCGCCGCA CAAGGTGTTT CAGGAGTGGT CGCTGATGAG
ATGACGATCC TGCCAGGCAT GGAAGAACTG TTTTCGCTTT TGAGAATAAA ACGCTACAAG
TCTTCGGGGC TATACGATGT ACTTGTGCTC GATACCGCTC CGACCGGTGA AACGCTTCGG
CTTCTTTCTC TTCCCGATAC CCTTTCATGG GGTATGAAGG CGGTAAAGAA TGTCAATAAA
TATATCATGA AGCCGCTCAG CAAGCCGCTT GCAAAGATGT CTGACAAGAT AGCCTACTAT
ATTCCTCCTG AAGATGCGAT TGATTCTGTC GATCAGGTTT TTGACGAGCT TGAAGATATC
AGAGAGATTC TTACCAACAA CAAGAACTCT ACCGTGAGAC TTGTTATGAA CGCGGAAAAG
ATGTCTATCA AGGAGACCAT GCGGGCACTT ACCTATCTGA ATCTCTATGG ATTCAATGTG
GATATGGTTC TTGTGAACAG ACTGCTGGAT GTCAAGGAAG ACAGCGGATA TCTTGAGAAA
TGGAAATCTA TTCAGCAGAA ATATCTTCTT GAGATCGAGA GCGGATTTAC ACCTCTGCCT
GTAAAACGTC TCAAGATGTA CGATCAGGAA ATTGTCGGGT TGCCGGCCCT CGATGTTTTT
GCCAAAGACA TGTATGGGGA TTCAGACCCC TCTCAGCTTA TGTTCGATGA GCCTCCGATC
AAGTTCGAAA GGAGTGGTGA CACCTATGAG GTTCAATTGA AGCTTATGTT CGCCAATCCG
GTTGATATCG ATGTCTGGGT TACCGGAGAT GAACTTTTCG TTCAGATAGG AAATCAGAGG
AAAATAATCA CGCTTCCGAT CAGTCTTACC GGGCTTGAGC CGGGAGATGC CGTATTCAAG
GACAAGTGGC TGCATATACC GTTTGACCTC AACAAGCAGA AAGAACATCA GAGAGAAAAG
GAATACAACA GGGCTTGA
 
Protein sequence
MRILTFTGKG GVGKTSVSAA TAVRLSQMGY RTLVLSTDPA HSLSDSFNIS LGPEPTKIKE 
NLHAIEVNPY VDLKENWQAV QKYYTRVFAA QGVSGVVADE MTILPGMEEL FSLLRIKRYK
SSGLYDVLVL DTAPTGETLR LLSLPDTLSW GMKAVKNVNK YIMKPLSKPL AKMSDKIAYY
IPPEDAIDSV DQVFDELEDI REILTNNKNS TVRLVMNAEK MSIKETMRAL TYLNLYGFNV
DMVLVNRLLD VKEDSGYLEK WKSIQQKYLL EIESGFTPLP VKRLKMYDQE IVGLPALDVF
AKDMYGDSDP SQLMFDEPPI KFERSGDTYE VQLKLMFANP VDIDVWVTGD ELFVQIGNQR
KIITLPISLT GLEPGDAVFK DKWLHIPFDL NKQKEHQREK EYNRA