Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1010 |
Symbol | |
ID | 4570164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1148313 |
End bp | 1150511 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765612 |
Product | oligopeptidase B |
Protein accession | YP_911481 |
Protein GI | 119356837 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.371768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATAC CACGTACCGC GCTCTTTTTG TTATGCTTTC TTGCCCTCAC TTCTGCGGCT TTCGCTGTAG AAAAACCTGA ACCTGTTGTT ATAAGAAAAT CCTTTCCGGA AGCTCAAAAA CCTGTTGTGG AAACATGGTG CGGCGAAACA ATAGCTGATC CTTTCCGCGC GATGGAGAAC ATCAGAAACC CGCAAGTCAT CAACTGGCTC AAGCAGGAGA GCGACTATGC CCGCAATACC CTGCAACGAA TTCCCGGAAG AGATAGTCTC ATCTCCAAAA TGCAGGAATT TGACAAAAGA AAGGTCTCGA AAGTCTATAA TCTCTCGATT ACCGAAAACG ACGTCTATTT TTATCTCAAA CAGACGCCGT CGGACGAAAC AGGCCAGCTC TACTACAGAA AAGGGTTCCA TGGCAAAGAA ACCCTGCTGT TCAACCCCTC TTCATCACCT GATAAAGAGA GCCACACGGT TATAGGTACC ATTTCTCCCT CATTTGACGG ATCAAAGGTC GCATTTACCA TTTCAAAAAA CGGCTCAGAG GATGCGGTTC TCCTGATTAT GGATGTTGCG ACAAAAAAGC TGTTTCCGGA AAAAATCACA AGGTGCCGGT TTGCATCGCC ATCATGGCTC AAAACTGGCG ATGCATTTCT TTACAATCGC CTGCGGGCTC TTGAAAAGCC TGGCGAAAAT CCACAGCACA ACAGCAAAAC ATTTCTCCAT ATACCCGGAA CGGACCCGTC AAGAGACCTT GAAATTTTTT CAAGCGCAAC AAACCCTGAA CTTGTTCTTA ATCCGGAAGA TATTCCCGAA GTGAGTTATG AGAAGGAGAG CGGGCAACTG TTTGCATTTC TCTCAAATGT AGACCGCCGC CTGACAGTTT ACTATGCGCC CCTCGATCAG CCTGGAAAAA AGAAAACAGC CTGGAAAAAG CTCTTTATCC CGAAAGATGA TGTCCACGAT TTTGCTGTCA CGGAGAAGGA TATCTACTTT TCATCGCCAA AAAAAGCATC TGGTTTCCGG TTATTGAAAA CATCCCTTGA GCATCCCGAT CTTGAGCATG CCGAATTGAT CGTTCCGGAA ACTCCTGGAG CCACGCTAAC CGGATTCACG CTCACCAGCA GGGGCATCTT CTACACACAA TCGAAAAACG GCGTCGAAGA AAAACTCTAT CATCAGGAGT ACGGAAAAAC AGAATCAAAA GAGATCATCC TGCCTTCCGC TGCCGGAACC ATTGCGTTAA GCAGCAAAGG ATTCCGCTAC CCCGACCTCT GGGTGGTCAT CGCAGGCTGG AGCAAAGACT ACCGACGCTA CCGATATGAT AGCCGAAACG AAGGAACCTT CAGCCTTGAA ACACTCTCAT CCCCCGCCTT GTACCCGGAA TATGAGCGAC TGAAAGTCGA AGAACTCATG ATACCCTCAC ACGATGGGGT TATGGTCCCG CTGTCGATTG TGTACCGGAA TGATCTTGAT AAAAAAGGAA CCAATCCCGT TCTGCTCTAC GGCTATGGGG CCTATGGCAA CGCTCTGACA CCTTTTTTCA GCCCATCACT ACTGCTCTGG ACTCATAAAG GCGGCATTCT TGCCATCGTG CACGCAAGAG GCGGCGGAGA GCTTGGTGAT GCATGGCATA CATCGGGAAT GAAAACCACG AAGCCGAACA CATGGAAAGA CCTCATAAGT TCCGCCGAGT ACCTGATCAA GGAGAAATAC TCTTCATCTC GACATATCGC AATCAATGGC GCCAGTGCCG GCGGCATTCT TGTAGGAAAA GCCATGACCG AACGTCCGGA TCTTTTTGCC GCAGCGATTC CGCAGGTTGG ACTGATGAAC CCCCTGCGGG GTGAGGAAAC GCCGAATGGT CCGGTCAACG TACCAGAGTT CGGCACGGTC AAGAAGCCCG ATGAATGCAA AGCGCTCATT GCCATGGATC CCTATCTTTC CATTATTGAT GGAGTCCGAT ACCCGGCTGC ACTGGTTACA GCAGGAATCA ACGATCCAAG AGTCAGTGCC TGGCAGCCTG CGAAATTCGC CGCCCGTCTT CAGGCTGCGA CAGCCTCAAA CAAACCGGTT CTTCTGTTCA CCGATTTCAA GGCCGGACAC GGCATGGGAA ACACGAAACA GATGGAGTTT GAATCCCTTG CCGATGTATT GAGTTTCGGA CTGTGGCAAA CCGGCCATCC TGAATTTCAA ATCAGGTAA
|
Protein sequence | MTIPRTALFL LCFLALTSAA FAVEKPEPVV IRKSFPEAQK PVVETWCGET IADPFRAMEN IRNPQVINWL KQESDYARNT LQRIPGRDSL ISKMQEFDKR KVSKVYNLSI TENDVYFYLK QTPSDETGQL YYRKGFHGKE TLLFNPSSSP DKESHTVIGT ISPSFDGSKV AFTISKNGSE DAVLLIMDVA TKKLFPEKIT RCRFASPSWL KTGDAFLYNR LRALEKPGEN PQHNSKTFLH IPGTDPSRDL EIFSSATNPE LVLNPEDIPE VSYEKESGQL FAFLSNVDRR LTVYYAPLDQ PGKKKTAWKK LFIPKDDVHD FAVTEKDIYF SSPKKASGFR LLKTSLEHPD LEHAELIVPE TPGATLTGFT LTSRGIFYTQ SKNGVEEKLY HQEYGKTESK EIILPSAAGT IALSSKGFRY PDLWVVIAGW SKDYRRYRYD SRNEGTFSLE TLSSPALYPE YERLKVEELM IPSHDGVMVP LSIVYRNDLD KKGTNPVLLY GYGAYGNALT PFFSPSLLLW THKGGILAIV HARGGGELGD AWHTSGMKTT KPNTWKDLIS SAEYLIKEKY SSSRHIAING ASAGGILVGK AMTERPDLFA AAIPQVGLMN PLRGEETPNG PVNVPEFGTV KKPDECKALI AMDPYLSIID GVRYPAALVT AGINDPRVSA WQPAKFAARL QAATASNKPV LLFTDFKAGH GMGNTKQMEF ESLADVLSFG LWQTGHPEFQ IR
|
| |