Gene Cpha266_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0213 
Symbol 
ID4570594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp236076 
End bp237254 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID639764813 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_910704 
Protein GI119356060 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.410604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTAA TTCTTATGAC CGGAAAAGGT GGTGTTGGAA AAACATCCAT GGCTGCAGCT 
ACCGGACTTC GGTGTGCCGA ACTTGGCTAT AAAACTCTCG TTCTCAGTAC TGATCCTGCA
CATTCGCTGG CCGACAGTTT TGATATGGCG CTCGGTCACA ACCCCAACAG AGTCTCGAAT
AATCTTTGGG GCGCAGAGCT CGATGTTCTC AAGGAACTCG AACAGAACTG GGGCACCGTG
AAACGATATA TAACCGGAGT TCTTCAGGCA AGGGGTCTTG AAGGTATTCA GGCAGAAGAA
CTTGCCATCC TTCCGGGAAT GGATGAAATT TTCGGACTGG TGAGAGTATT CCGCCACCAC
AAGGAGGGCG ACTACGATGT GCTTATCATC GACTCAGCTC CTACCGGAAC AGCATTGCGA
CTTTTAAGCA TTCCTGAGGT AGCCGGCTGG TATATGAGAA GACTTTACAA ACCTTTTGAA
AAAGTCGCGC TCTATCTCAG ACCTCTTGTC GAACCGATCT TCAGACCTCT TGCCGGCTTT
TCCTTACCAG ACAAAGAGAT GATGGATGTG CCATACGAAT TTTATGAACA AATCGACGCT
CTCGGCAAAA TCCTTACCGA CCACGCCGTC ACATCCGTCA GACTTGTCAC CAACCCAGAA
AAGATGGTTA TCAAGGAGTC CCTTCGCGCT CACGCCTATC TTGGTCTTTA TAACATCGCT
GTTGATCTGG TCATTGCCAA TCGGATCATA CCGCCAGAGG TCACCGATCC CTATTTCACA
TTCTGGAAAG AGAATCAAAC GCTCTATCGA CAAGAAATCC AGGATAACTT CGCGCCCCTT
CCCGTCAAGG AAGTCCCACT CTATTCTCGT GAGATATGCG GCATGCAGAC CCTCGAAAAA
CTCAAGGAGA TGCTTTACGG TAACGAAGAC CCTGCACAAG TCTATTATAA AGAGCAAACA
TTTCAGATAA AACAGACAAC CCAAGGATTT ACTCTGGAGC TCTATATCCC CGGAATTCCA
AAGGATCAGA TTCAGTTGGG AAAAAATGGT GACGAACTGC ACGTCCGCAT AGGTAATCAC
CGCCGCAATA TGGTGCTTCC TCAGGCACTC GCCTCACTGA AAACTACCGG AGCGGAAATG
GATGGAGATC ACCTCATCAT CCATTTTGTT GAACCATAA
 
Protein sequence
MRLILMTGKG GVGKTSMAAA TGLRCAELGY KTLVLSTDPA HSLADSFDMA LGHNPNRVSN 
NLWGAELDVL KELEQNWGTV KRYITGVLQA RGLEGIQAEE LAILPGMDEI FGLVRVFRHH
KEGDYDVLII DSAPTGTALR LLSIPEVAGW YMRRLYKPFE KVALYLRPLV EPIFRPLAGF
SLPDKEMMDV PYEFYEQIDA LGKILTDHAV TSVRLVTNPE KMVIKESLRA HAYLGLYNIA
VDLVIANRII PPEVTDPYFT FWKENQTLYR QEIQDNFAPL PVKEVPLYSR EICGMQTLEK
LKEMLYGNED PAQVYYKEQT FQIKQTTQGF TLELYIPGIP KDQIQLGKNG DELHVRIGNH
RRNMVLPQAL ASLKTTGAEM DGDHLIIHFV EP