Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1489 |
Symbol | |
ID | 4570278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1687238 |
End bp | 1688254 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766072 |
Product | regulatory protein, ArsR |
Protein accession | YP_911937 |
Protein GI | 119357293 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.115914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAAC GAACAACTGA AATAAGAAAC TACATGTTGA CAGCTTGCAG GGACAGGTCG CTCAATGTCG CCGCGGAGAC TGCAAGGCAT TTCAAAATCT CACGACAGGC GGCTTCACGG CACCTGCATG CACTCGAAGA AGCCGGACTC GTCAGATCGG AGGGGGCGAA AAACGTCAAA AAATCGACAC TTATCCCTCT TAAGCAAACA TCCTGGACAT TCCTGTTGGC TGGCCTGAAA GAGGATGTGG TATGGCTTGA ATCGATAGCT CCCCTGATTG CTGATTTGCC GGGTAATGTC CGCGAAATGT GGCAATACGG TGCGACAGAA ATGATCAACA ACGCTCTTGA TCACTCAGAG GGACTTAACC TGTCAATTTA CTTTTCAAGA AACGCAATAG ATTGCGAACT GACGATTACT GACGATGGCG AGGGCATCTT CCACCGTATT CAACGATTAA CCGGCCTGTA TGATGCACGG GAATCAATCC TTGAGCTTGC CAAGGGCAAG CTGACAACCG ACCCGCAGAA CCATTCCGGC GAAGGTATAT TTTTTACATC GAGGGCATTC GACAGCTTCG TAATCATCTC AAGAAATCTG TACTTCACCC ACCGGGCAGA AAAAGATGAC TGGCTGATCG ATATCGACTC TGATACACCG GGCACGAGCA TTTATCTCAG ATTGAGCAAC ACCTGCCCTC GAACCATGAA GGAAATCTAT GACGAGTATG CCGAGCCGGA CGAGTATGCC TTTAACAAAA CAAGAGTTCC GGTAAAGCTG GCCCGATATG AGGAAGAAAA GCTCGTATCC CGATCGCAGG CAAAGAGGCT CGTATCGCGC TTCGAGAAGT TCTCAACCGT AATCCTTGAT TTTGAAGACG TTGAAGAAAT CGGACAAGCA TTTGCAGACG AAATCTTCAG AGTATTTGCA TCGAACCATC CCGAAGTGAA ACTCATTACC GCTCATGCAA CGGACGCCGT AAACAAAATG ATTCTGAGAG CTCTGGCTGT CAGGTAA
|
Protein sequence | MRERTTEIRN YMLTACRDRS LNVAAETARH FKISRQAASR HLHALEEAGL VRSEGAKNVK KSTLIPLKQT SWTFLLAGLK EDVVWLESIA PLIADLPGNV REMWQYGATE MINNALDHSE GLNLSIYFSR NAIDCELTIT DDGEGIFHRI QRLTGLYDAR ESILELAKGK LTTDPQNHSG EGIFFTSRAF DSFVIISRNL YFTHRAEKDD WLIDIDSDTP GTSIYLRLSN TCPRTMKEIY DEYAEPDEYA FNKTRVPVKL ARYEEEKLVS RSQAKRLVSR FEKFSTVILD FEDVEEIGQA FADEIFRVFA SNHPEVKLIT AHATDAVNKM ILRALAVR
|
| |