Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0678 |
Symbol | |
ID | 4569832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 773777 |
End bp | 775078 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639765276 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_911157 |
Protein GI | 119356513 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.689675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATCGA GGGACTTAAC GGAAAATCAG TCTCAGCCGA GAGTTATCAT TTATTCCGGA AAGGGCGGAA CGGGAAAAAC CACGATATCT TCGTCAACAG CCGTAGCGCT TGCAAGGCAG AACAAGAAAG TGCTTATCAT GTCGTCCGAT CCGGCACACT CCTTGTCGGA TGTCTTTGAT ACGCAAATAA GTCGTAATGA TCCGCAGAGA ATTGAGAAAA ATCTTTACGG GCTCGAAATT GACACGATAT ACGAGCTGAA AAAAAACATG TCGGGGTTCC AGAAATTTGT CTCTTCTTCC TATAAAAACC AGGGGATTGA CAGCGGCATG GCCTCTGAAT TGACAACGCA GCCTGGTCTT GACGAGATTT TTGCTCTGAA TCGTCTGGTT GATGAAGCCC AGTCCGGAAA ATGGGATGCC GTGGTGCTCG ATACTTCCCC GACAGGCAAT ACCCTTCGCC TGCTTGCCTA TCCTGAAATT ATTATTGGCG GCAATATGGG CAAGCAGTTT TTCAAGTTGT ACAAAAGCAT GTCATCTCTT GCCCGTCCAC TGAGTGGTAA CTCGATTCCT GATGGAGAGT TTTTTAACGA GGTCAATGTA CTGCTCAAGC AGATGGAGGA TATCAACAAA TTTATTCTCA GTCCTGAGGT TACCTTCCGT CTGGTATTGA ATCCTGAGAA ACTGTCGATT CTTGAGACGA AACGAGCATA TACCTTTATC CATCTGTATG GGATCAATAT TGATGCTATT GTTATTAACA AGATTCTTCC TACTTCGAAG ACCGTAGGTG AGTATTTTGA GTTCTGGGCT GATCTGCATA CCAAGTATCT GATGGAGATT GATAACTCTT TTTATCCGAC GCCTGTATTT CGATGCAATC TTCAGCGGAC CGAGCCTATC GGATCCGATG CACTTCATGA GATCAGCAAA CTGGTGTTTG GAGAGCAGAT TCCCGACAAG ACCTTCTATG AAGGGAAAAA TTTCTGGATC GAGAGCCGTA AAAATGCCGT CACCGAAGAT CATCGTGAGA TTCTTTGCAT CAGGATTCCC TTTCTCAAGG ATGCCGAAGA TGTGAAGGTC GAGCGAATGG GAACCGATAT TGTGGTAACC GTTGATCGGG CACAGCGGAT AATTACCCTT CCAAGAGCGC TGTACAGTCT GGATCTGGAA GAGTATCTTA TCGAGGATAA CCTTCTTCGC GTAGTATTCA AGGAGACTCC TGTCGAAAAG GATGAGGTGG AGTTGAGCGT CAACAAAAAT ATGCTTGACA AGCTTCGTTC TATGAGAAGG ATGAAGATTT AG
|
Protein sequence | MLSRDLTENQ SQPRVIIYSG KGGTGKTTIS SSTAVALARQ NKKVLIMSSD PAHSLSDVFD TQISRNDPQR IEKNLYGLEI DTIYELKKNM SGFQKFVSSS YKNQGIDSGM ASELTTQPGL DEIFALNRLV DEAQSGKWDA VVLDTSPTGN TLRLLAYPEI IIGGNMGKQF FKLYKSMSSL ARPLSGNSIP DGEFFNEVNV LLKQMEDINK FILSPEVTFR LVLNPEKLSI LETKRAYTFI HLYGINIDAI VINKILPTSK TVGEYFEFWA DLHTKYLMEI DNSFYPTPVF RCNLQRTEPI GSDALHEISK LVFGEQIPDK TFYEGKNFWI ESRKNAVTED HREILCIRIP FLKDAEDVKV ERMGTDIVVT VDRAQRIITL PRALYSLDLE EYLIEDNLLR VVFKETPVEK DEVELSVNKN MLDKLRSMRR MKI
|
| |