Gene Cphamn1_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0797 
Symbol 
ID6374464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp853310 
End bp854611 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content46% 
IMG OID642683305 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_001959229 
Protein GI189499759 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.541359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCGA GGGAGCTCGA AGAAGGTCAG TCAAATCCAC GGGTGATTAT CTATTCAGGT 
AAGGGAGGGA CGGGTAAAAC CACGATCTCT TCATCCACTG CCGTTGCGCT CGCGAGAAAG
AACAAGCGCG TGCTTATTAT GTCATCCGAC CCGGCTCATT CACTTTCGGA TGTATTCAAT
ACCTCTATAA GTCGGAACGA ACCGCAGAAG ATCGAAAAAA ACCTCTACGG CCTTGAGGTT
GACACGATCC ATGAGTTGAA GAAAAACATG TCCGGATTTC AGAAGTTTGT CTCTTCGTCC
TACCAGAATC GTGGTATAGA CAGCGGCATG GCTTCTGAGC TGACGACTCA GCCGGGGCTT
GATGAGATCT TCGCGTTGAG CAGGCTGGTA GATGAGGCAC AGTCAGGGAA ATGGGACGTC
GTTGTTCTCG ATACTTCACC GACCGGTAAT ACGCTGAGGC TGCTTGCCTA CCCGGAGATT
ATCATCGGGG GCAATATGGG CAAACAGTTT TTCAAGCTCT ACAAGAGCAT GTCCTCTCTG
GCTCGCCCTA TGGGCAAGAA CTCAATTCCG GATGAAGAGT TTTTCAACGA GGTAAATGTC
CTCTTGAAGC AAATGGAGGA TATCAACAAA TTTATTCTCA GTCCTGAAGT CACGTTCAGG
CTGGTTCTGA ACCCTGAAAA ACTTTCGATT CTCGAGACAA AGCGTGCGTA TACCTTTGTT
CATCTGTATG GAATCAATAT TGACGGTATC GTTATCAACA AGATTTTGCC GACATCAAAG
ACGGTAGGAG AGTATTTCGA ATTCTGGGCT GATCTGCACA GCAAATATCT CATGGAGATC
GATAACTCAT TTTATCCTAC ACCGGTTTTT CGATGTCAAC TGCAGCGGAC GGAGCCTATC
GGCCCTGACG CGCTGCATGA GGTCAGTCAT CTGGTGTTTG GTGATCAGTC TCCGGACAAG
ATTTATTATT CGGGCAAGAA TTTCTGGATA GAATCAAAAA AAAGTTCGCA CGACCAGGAG
CATCTTGAAA TTCTTTGTAT CCGGATTCCA TTCCTCAAGG AGGCTGAAAC TGTTGAAGTG
AATCGTATGG GGACCGATAT CGTTGTCACG GTTGATCGTG CTCAGCGTAT CATAACCCTC
CCAAGGGCGC TGTACAGCCT TGAAATGGAA AAATATGTCA GGGAGGATGA TCAGTTGAAA
ATATTGTTCA AAGAGGTCCC TGTCGAAAAA GAAGAAATGG AATTGAACGT CAACAAGAAC
GTGCTGAACA AGCTTCGTTC TTTGAGGAAA ATGAAATTCT AA
 
Protein sequence
MLSRELEEGQ SNPRVIIYSG KGGTGKTTIS SSTAVALARK NKRVLIMSSD PAHSLSDVFN 
TSISRNEPQK IEKNLYGLEV DTIHELKKNM SGFQKFVSSS YQNRGIDSGM ASELTTQPGL
DEIFALSRLV DEAQSGKWDV VVLDTSPTGN TLRLLAYPEI IIGGNMGKQF FKLYKSMSSL
ARPMGKNSIP DEEFFNEVNV LLKQMEDINK FILSPEVTFR LVLNPEKLSI LETKRAYTFV
HLYGINIDGI VINKILPTSK TVGEYFEFWA DLHSKYLMEI DNSFYPTPVF RCQLQRTEPI
GPDALHEVSH LVFGDQSPDK IYYSGKNFWI ESKKSSHDQE HLEILCIRIP FLKEAETVEV
NRMGTDIVVT VDRAQRIITL PRALYSLEME KYVREDDQLK ILFKEVPVEK EEMELNVNKN
VLNKLRSLRK MKF