Gene Cphamn1_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0091 
Symbol 
ID6373734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp85202 
End bp86410 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content49% 
IMG OID642682607 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001958555 
Protein GI189499085 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.304953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACA TAGTCTATAC CGGAAAAGGC GGCGTCGGCA AAACCACAAT CGCAGCAGCT 
ACGGCGTTGA AAGCCGCCAC AATGGGCTAC AAAACGCTTG TTATCTCTAC AGACCCGGCG
CACAGCCTGG GTGACTCATT CGACAGAGAG CTCGGATCGT CACCTGTAGC GATCGCAGAC
AATCTTTACG GTCAGGAGGT CAGTGTCTAT GGCGACCTGT CGCTTAACTG GGAAATAGTA
CGTGAGCATT TCGCCCACCT GATGGAAGTC CAGGGGATCA AGGGCATCTA CGTCGAAGAG
ATGGGGGTTC TGCCCGGCAT GGAAGAACTT TTTTCGCTTT CCTACATCAA GAAGTACAAC
GAATCAGATG ATTATGACCT TCTGGTGGTA GATTGTGCTC CCACAGGGGA AACCCTGCGC
CTGCTCTCTA TCCCTGAAAC CTTCGGCTGG ATGCTCAAGC TCATGCGGAA CATGGAAAAG
TACGTTGTAA AACCGCTTAT CCGCCCTATA TCCAAGCGTG TCGGCAAACT GCACGATGTC
GTCCCTGAAG AGGATGTCTA TAATCAGGTT GATCATCTCT TTTCCTCTGT CGAGGGAATC
ATCGATCTTC TTTCAGACGG CAGCAAAACA ACTGTCCGTC TGGTTATGAA TCCGGAGAAA
ATGGTCTTAA AGGAAACCAT GCGTGCCCTG ACCTACCTCA ACCTCTACGG GATAACGGTT
GACCAGATAG TGGTAAACCG CGTTCTTCTC GATGAGGTTG ACGGGAAGTT CCTGAGTGAA
TGGAAAGAGA TACAGAAAAA ATATCTGGAT CAGATCGACA GGACTTTTTC GCCGATACCG
ATCATACAGG TACCCTTTTT CAGACAGGAA GTCGTTGGCC TCGACATGCT GGAAAAAGTG
GGGGAAATAG TCTACAGAGA TTCCGACCCG CTTGATATCC TCTACCGTGA AGAGCATGTC
AACATCAAAA AACAGGATGA AGGTCACTAC ATCATGAAAC TGCGCGCCCC GTTTATCTTC
GATAACAACA TGGAAGCCAA TATCGTGCAG GTAGGGGAAT TGATGACCGT ACGCATCGGG
AACTACCAGA AAGGCGTTAT ACTCCCCGCC TTTCTTGCCG GACTCCGTGT CAGCAGCGCA
AACTATAAAG AGAAATGGCT TGTCGTTGAA TTCAAAAAGA AGGAAAAAGA CGCAACGAAG
TCTGAATGA
 
Protein sequence
MRNIVYTGKG GVGKTTIAAA TALKAATMGY KTLVISTDPA HSLGDSFDRE LGSSPVAIAD 
NLYGQEVSVY GDLSLNWEIV REHFAHLMEV QGIKGIYVEE MGVLPGMEEL FSLSYIKKYN
ESDDYDLLVV DCAPTGETLR LLSIPETFGW MLKLMRNMEK YVVKPLIRPI SKRVGKLHDV
VPEEDVYNQV DHLFSSVEGI IDLLSDGSKT TVRLVMNPEK MVLKETMRAL TYLNLYGITV
DQIVVNRVLL DEVDGKFLSE WKEIQKKYLD QIDRTFSPIP IIQVPFFRQE VVGLDMLEKV
GEIVYRDSDP LDILYREEHV NIKKQDEGHY IMKLRAPFIF DNNMEANIVQ VGELMTVRIG
NYQKGVILPA FLAGLRVSSA NYKEKWLVVE FKKKEKDATK SE