Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0397 |
Symbol | |
ID | 4568652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 439729 |
End bp | 442221 |
Gene Length | 2493 bp |
Protein Length | 830 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639764997 |
Product | surface antigen (D15) |
Protein accession | YP_910880 |
Protein GI | 119356236 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00277353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TGCAAAAACT GATAACCCTG GTACTGGTAG CCCTTGCCGT AAACGCTACA GGACAAACTG CCGAAGCTAA AAGCAAGCCC GTAAAACAAT CGGTAACAGC AACACAGGCA AAAACACCCG CGGAGAAAAA AGATCTCTAT ACGATAAGCG CAATTTCCTT CACCGGTCTT CAATCGCTTA ACGAGCAGGA ACTTATTGCA AGTCTTCCCA TAAAAATCGG CAACAACATC GCGGTACCCG GGGCTGAACT GTCAGCAACC ATGCAATACC TCTGGAATCT GCAGGTTTTC AAAAATATCA CCCTGGAAAA ATCCAATCCC GGCACCAAAA ACGTAGCGCT GCACTTTATT GTCGAAGAAC AGCCGTTACT TGAAGAGGTC GTTTTTAAAG GCAACGAAAA ATTCGACCTT GACAAACTGC AGAAAACCGC CGACATTCAA ACAGGTAAAA AGCTCAGTGA GCAGGAACTG CTCACGGCCG CAAACAAAAT AGAAAAGCTC TATGCCGACA AGGGGTATCT GACTGCAGGA GCGGAATACA AACTGGAGGC TATCGGAAAA AACAAGGTCA AAGCAGTTTT CACTATCACT GAAGGCCGGA AGGTAGTTAT TGAAAAAATC CGTTTCCACG GCAACAATGC CTTCAGCCAG GGCAAGCTCA GAGGGGTGTT CAAGGAGACT ACCCAAAACT CATGGTGGAG AAAAATCTTT GGAGCCCCCA AACTCGACAA GGATAAATTC GCCACCGACA AGGATCTCCT CGTCGATTTT TATCGGGAAA ACGGTTATCG GGACGCAAAA GTTGTCAGCG ACTCCATCAG CTATACCCCC GATAATAAAG GTCTCTTTCT TGATATCACG ATAGAGGAGG GACCGAAATA CCATATCGGC ACCATAACAT GGACAGGAAA CTCCAAAGAT TTCGCCACCA CAGAAATCCT TGAAAAAACA TTCGGAATAA AAACCGGCGA TCTCTATAAC GCAAAACTCA TCCAGGAAAG ACTGAACTTC TCACAGGACA ACAGCGATGT AAGCTCGATC TACCTTGACA GGGGGTACCT CTCCTACAGG GCCAACCTTG ACGAGGTGGT CGTGAATCCT GACACGGTCA ATCTTCTCAT CAGCATTCGT GAAGGCGAAC AGTATCAGTT GAATCTGGTG AACATCACCG GCAACACCAA AACCAAAGAC CATGTCATCC GCCGTGAATT GTACACCATC CCCGGTGAAA TGTTCAGCCG CAAAAACGTC ATCCGCAGTA TCAGGGAGCT TAACATGCTC AACTACTTTG ATGCTGAAAA ACTTGCTCCC GAAATTCAGC CCAACGAAGA GAATAACACA GTCGACCTGA CCTATTCGGT ATCTGAAAAA CAGAGCGACA CCTTCAACGC ATCGATCGGA TATGGAGGAT CGAGCGGCTT TACCGGCACA CTCGGCGTCA CCTTCAACAA CTTCTCGCTG CAGGATATTT TTGATGCCGA TGCCTACAGA CCTCTTCCGC ATGGTGACGG TCAGAAACTC TCCTTCCAAT GGCAGTTCGG CAGCGACAAC TACCGCACGC TTGCCCTGTC GTTTACGGAA CCATGGGCAT TCGGAGGCCC GACAACCGTT GGCTTTTCAG CCTTTAAAAC ACACCGCACC TATGACTATA CAGGCACAGA CTCATCGATT GAGAACAAAA CGATCGATCA GTACGGAACC ATTCTCACTA TCGGCAGACG TCTGACCTGG CCCGACGACT ATTTCGCCGT CGGACTTAAA CTGAAATACC TGCACAACAA AGGCGGATTT GTGAGCTTTA TCAACGAAAC AGGTATTAAC GTTCCGGATG AAGCCGACGA GTATTCAATC ACCGGAACAA TATCACGCAA CAGCATCGAC AGCCCGATCT ACCCGCGCCG AGGCAGCAAA AACACACTCA CCGCCCAGCT TGCAGGGGGC CCTCTTCCCG GCACTATCGA CTTCTATAAA TTCACAGGAA ACTCAACCTG GTTTTTCCCG CTGTCGAGAA AACTGGTGCT GAACATGTCT GCCCAGGCTG GATACCTCTC AACCTTTAAT AAAAGCGACT ACATCCCCTA TACAGAGTAT TTCTATATGG GAGGAAGCGG CATGTCTTCT CTGCCCACCG TTCCGATGCG CGGTTACGAC GACCGCAGTT TCGGCGCACT GCTTGAAACC GATTCCGACC TGTATGGCGG CACCATTTAC ACAAAGTTCA CAACAGAACT TCGCTATCCT ATAACGCTCT CGCCATCCGT AAGCGTCTAT GGACTCGCTT TCTTCGATGC AGGAAACCTC TGGCAAGACA GTGAATCCGT TGATTTCAGC GACCTCAAAA AGTCCGTCGG CGTAGGACTC AGGGTTTATC TTCCGATTAT CGGAATGGTT GGCCTTGATT ACGGATACGG TATGGACACC GTGCCCGGAG ACATGGAAAA AGGATGGGGA TTCATGTTCA CTTTTGGCAC ATCGACAGAG TGA
|
Protein sequence | MKKMQKLITL VLVALAVNAT GQTAEAKSKP VKQSVTATQA KTPAEKKDLY TISAISFTGL QSLNEQELIA SLPIKIGNNI AVPGAELSAT MQYLWNLQVF KNITLEKSNP GTKNVALHFI VEEQPLLEEV VFKGNEKFDL DKLQKTADIQ TGKKLSEQEL LTAANKIEKL YADKGYLTAG AEYKLEAIGK NKVKAVFTIT EGRKVVIEKI RFHGNNAFSQ GKLRGVFKET TQNSWWRKIF GAPKLDKDKF ATDKDLLVDF YRENGYRDAK VVSDSISYTP DNKGLFLDIT IEEGPKYHIG TITWTGNSKD FATTEILEKT FGIKTGDLYN AKLIQERLNF SQDNSDVSSI YLDRGYLSYR ANLDEVVVNP DTVNLLISIR EGEQYQLNLV NITGNTKTKD HVIRRELYTI PGEMFSRKNV IRSIRELNML NYFDAEKLAP EIQPNEENNT VDLTYSVSEK QSDTFNASIG YGGSSGFTGT LGVTFNNFSL QDIFDADAYR PLPHGDGQKL SFQWQFGSDN YRTLALSFTE PWAFGGPTTV GFSAFKTHRT YDYTGTDSSI ENKTIDQYGT ILTIGRRLTW PDDYFAVGLK LKYLHNKGGF VSFINETGIN VPDEADEYSI TGTISRNSID SPIYPRRGSK NTLTAQLAGG PLPGTIDFYK FTGNSTWFFP LSRKLVLNMS AQAGYLSTFN KSDYIPYTEY FYMGGSGMSS LPTVPMRGYD DRSFGALLET DSDLYGGTIY TKFTTELRYP ITLSPSVSVY GLAFFDAGNL WQDSESVDFS DLKKSVGVGL RVYLPIIGMV GLDYGYGMDT VPGDMEKGWG FMFTFGTSTE
|
| |