Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2043 |
Symbol | |
ID | 4568727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2363044 |
End bp | 2364114 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639766624 |
Product | hypothetical protein |
Protein accession | YP_912479 |
Protein GI | 119357835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000980864 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTCG GACTATTCAA AACAGTAGTC ACAGAATCAG GATTGGATGG AAAGCCTATT TTTCTGCTCT GCGATCTTGA GTTCATCAAT GAAATTTCAT ACAGTTACAA AAAAACACTT ACAAATTTTA TGTATAGCTG TAATCTGAAA TTTCGAATGA TTGTTTTCTG CAACATCACA CCAAACTTCC GGACGATGGT CGAATCGTTT CAGGCAGTAA TGCCGGATGG GCTTGAAACA ATAATTGTTA ATAATTATCA GGAAGCCATC GAAAATATAA TTACTTTTAA AGCAGGAACA TACAGGCATC CCGAACCGGA ATCAGAAGCC GAGCACCATG AAAAAGCAAT TAAAAAGCAT TTTCTTGCAA CTATAGCGAG AATATCGTGG TTTAATATGC TTGACCAGCA TATTGCTTTA CCTTCTGCAG ATGATAAATA CTATACATTT ATTAAGGCGA TAGAGGCAAT GCAGGCCGAC ATTCGCGAAA AAGAAAAGGA AAAAAACATG GAACTGGAAC ACATGAAACA CGACGAAGAG CAAAAACAGA CCGAAATGGT CGTGAAGCTG AACGCACAGA TTGAGCTCAA CAAAAAAGCC GCTCGCGAAC ACGAAAAGGA GATCGCGGCC CTCAAAACAA GAATCGCCAC ACAGGATATG GAGCTGACAA GAGTGTCGAC AGCGATAGCT GAAAAAACCA TGTCGCTGCG CAACCTGCTC GACAAAATCT ACGCGCTCGA TATTGATACC GATGTAAAAA GACAGATGAC CGACTCCTGC CTCAGCCTGA TCGAAACCGA AACCATCGAA AAACGGCTCA ACATTGAGCT CACCGAATCC GACTCGGTAT TTCTGTCAAG ACTGCAGAAA AAACATCCGC ATCTCAACCA GAGAGAGCTT CGCATCAGCC TGCTCGTCAA ACTGAACTAC GACACCAAAG AAATCGCCCG TTCAGTCGGC ATCTCCACCA GAGGCATGGA AAGCATTCGC TACCGGATGC ACAAAAAACT GGGACTCGGA AAACACCAGT CAATAAAAAC CTACCTCTCC GACCTGGCCG CCTCATTCTG A
|
Protein sequence | MDLGLFKTVV TESGLDGKPI FLLCDLEFIN EISYSYKKTL TNFMYSCNLK FRMIVFCNIT PNFRTMVESF QAVMPDGLET IIVNNYQEAI ENIITFKAGT YRHPEPESEA EHHEKAIKKH FLATIARISW FNMLDQHIAL PSADDKYYTF IKAIEAMQAD IREKEKEKNM ELEHMKHDEE QKQTEMVVKL NAQIELNKKA AREHEKEIAA LKTRIATQDM ELTRVSTAIA EKTMSLRNLL DKIYALDIDT DVKRQMTDSC LSLIETETIE KRLNIELTES DSVFLSRLQK KHPHLNQREL RISLLVKLNY DTKEIARSVG ISTRGMESIR YRMHKKLGLG KHQSIKTYLS DLAASF
|
| |