Gene Cpha266_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0678 
Symbol 
ID4569832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp773777 
End bp775078 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content45% 
IMG OID639765276 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_911157 
Protein GI119356513 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.689675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATCGA GGGACTTAAC GGAAAATCAG TCTCAGCCGA GAGTTATCAT TTATTCCGGA 
AAGGGCGGAA CGGGAAAAAC CACGATATCT TCGTCAACAG CCGTAGCGCT TGCAAGGCAG
AACAAGAAAG TGCTTATCAT GTCGTCCGAT CCGGCACACT CCTTGTCGGA TGTCTTTGAT
ACGCAAATAA GTCGTAATGA TCCGCAGAGA ATTGAGAAAA ATCTTTACGG GCTCGAAATT
GACACGATAT ACGAGCTGAA AAAAAACATG TCGGGGTTCC AGAAATTTGT CTCTTCTTCC
TATAAAAACC AGGGGATTGA CAGCGGCATG GCCTCTGAAT TGACAACGCA GCCTGGTCTT
GACGAGATTT TTGCTCTGAA TCGTCTGGTT GATGAAGCCC AGTCCGGAAA ATGGGATGCC
GTGGTGCTCG ATACTTCCCC GACAGGCAAT ACCCTTCGCC TGCTTGCCTA TCCTGAAATT
ATTATTGGCG GCAATATGGG CAAGCAGTTT TTCAAGTTGT ACAAAAGCAT GTCATCTCTT
GCCCGTCCAC TGAGTGGTAA CTCGATTCCT GATGGAGAGT TTTTTAACGA GGTCAATGTA
CTGCTCAAGC AGATGGAGGA TATCAACAAA TTTATTCTCA GTCCTGAGGT TACCTTCCGT
CTGGTATTGA ATCCTGAGAA ACTGTCGATT CTTGAGACGA AACGAGCATA TACCTTTATC
CATCTGTATG GGATCAATAT TGATGCTATT GTTATTAACA AGATTCTTCC TACTTCGAAG
ACCGTAGGTG AGTATTTTGA GTTCTGGGCT GATCTGCATA CCAAGTATCT GATGGAGATT
GATAACTCTT TTTATCCGAC GCCTGTATTT CGATGCAATC TTCAGCGGAC CGAGCCTATC
GGATCCGATG CACTTCATGA GATCAGCAAA CTGGTGTTTG GAGAGCAGAT TCCCGACAAG
ACCTTCTATG AAGGGAAAAA TTTCTGGATC GAGAGCCGTA AAAATGCCGT CACCGAAGAT
CATCGTGAGA TTCTTTGCAT CAGGATTCCC TTTCTCAAGG ATGCCGAAGA TGTGAAGGTC
GAGCGAATGG GAACCGATAT TGTGGTAACC GTTGATCGGG CACAGCGGAT AATTACCCTT
CCAAGAGCGC TGTACAGTCT GGATCTGGAA GAGTATCTTA TCGAGGATAA CCTTCTTCGC
GTAGTATTCA AGGAGACTCC TGTCGAAAAG GATGAGGTGG AGTTGAGCGT CAACAAAAAT
ATGCTTGACA AGCTTCGTTC TATGAGAAGG ATGAAGATTT AG
 
Protein sequence
MLSRDLTENQ SQPRVIIYSG KGGTGKTTIS SSTAVALARQ NKKVLIMSSD PAHSLSDVFD 
TQISRNDPQR IEKNLYGLEI DTIYELKKNM SGFQKFVSSS YKNQGIDSGM ASELTTQPGL
DEIFALNRLV DEAQSGKWDA VVLDTSPTGN TLRLLAYPEI IIGGNMGKQF FKLYKSMSSL
ARPLSGNSIP DGEFFNEVNV LLKQMEDINK FILSPEVTFR LVLNPEKLSI LETKRAYTFI
HLYGINIDAI VINKILPTSK TVGEYFEFWA DLHTKYLMEI DNSFYPTPVF RCNLQRTEPI
GSDALHEISK LVFGEQIPDK TFYEGKNFWI ESRKNAVTED HREILCIRIP FLKDAEDVKV
ERMGTDIVVT VDRAQRIITL PRALYSLDLE EYLIEDNLLR VVFKETPVEK DEVELSVNKN
MLDKLRSMRR MKI