Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2040 |
Symbol | |
ID | 6375733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2199083 |
End bp | 2201812 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684531 |
Product | surface antigen (D15) |
Protein accession | YP_001960431 |
Protein GI | 189500961 |
COG category | [R] General function prediction only |
COG ID | [COG1752] Predicted esterase of the alpha-beta hydrolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.813925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTCT GGAGCTGTCT TCTGGTGTTT CACGCTCCTG CACCGGTTCA GGCTGAAAAC AGGAGCGTTG TGTATCCCGA TACTCTCGAT CTGCCGGTAC GACGCGTTGC CATAGCACCC TACATGAGAG CCGCGAGAAA AACGGTGGGA CTCGCTCTTT CGGGAGGGGG GGCCAACGGT CTGGCCCAGA TCGGCGTGCT CAAGGCTCTT GAAGAAGAAA ACATCCCTGT CGACGCCATA GCAGGCACCA GCATAGGCGC TGTTATAGGA GGACTCTACA GCACCGGATA TAACGCCGAA GAACTTGAAA GGATAGCGGT CAGCATACCC TGGAAAGAAC TCCTTTCCCT TGAAAATGAC GCGCCAAGAG CTAAAACATT TCTTGAACAG CAGAAAATCC GCGACAGGGC AACCATAGCC ATACGTTTCG AGAACTTTAA ACTGGTTATC CCCAGGTCAC TGAGCTCGGC ACAAAAGCTG ACGAGAACCC TCGACCTGCT CACCATGAAC GCCCTCTACC ATCCTGAAGG CTCGTTCAGC TCCCTTCCGG TCTCATTCAA GGCCGTCACA ACCGACCTGG TATCGGGAAA ACGCGTCACG CTTTCAGAGG GGACGCTTTC AGAAGCGATG CGAGCAAGCA GTACCATACC GATTATCTTT GAGCCGATCG GGCTTGGCAA ACAGAAGCTC GCCGACGGCG GACTTGTAGC AAATCTCGCT GTAGACGAGC TGGAAAACGC TTCACTTGAT TACAAAATCG GGGTTGACAC CAGAGGCAGC ATGTATACCC TCGCGGAAGA TATCGACATC CCCTGGCAGG CGGCAGACCA GACCATGACG ATACTCATCG AACTGCAGTA CCCTCGTCAG CTTGAAAAAG CCGACCTTAT CATCACGCCG GATGTCGGCA ACAACCCTGC CATTGATTTC TCACAACTTG ACAAACTGAT GAGCGCGGGA TACGTCAGCG GAAAAGCTCT GGCCGGTAAA ATCCGCCAGG ATATCCAACT GCCCGCTCAA AGAAAAACCG GCATATCTTC GTACAAAAAA ACACTCAGGG TCTCGGGCAG CGGCAGAAAA ATAGTGAGCC GGTACCTCGC CGCGAGCGAA ATAATACGTA ACGCCTCCTA TGCGGAGGAA GCGTTGAAAG CGCTGCTTGA AACAGACCGT TTCACGAAGG TATATGCGGA GGTCAATCCT CGCCTCGGAG AGGCTGTGTT TTTTTGTGAA ACCCCGCCTT CATTTGATAC TGTTGAGCTG ATCGACCCGG TAACCGGGAT AGAACAAAAA GAACTTGACG CCTGTTTCAG GACGCTGACA GGAAAAAGCT ACACCAATGC CCAGGGAACA GAAGCGCTGG AAAAACTGGT AAAGATCTTC AGGAACAAAG GCTACCCGCT TGTCGACATC GCATGGATAC AGGTATCCGA CGACACACTT TCCATAAAGC TTTCCGAAGG AAAAATAGCC TCCCTTGACG TCTCAAAAGA CAAGAACATC ACCGGAACCA CTCCTGTCAT GAGGGAGCTG TCCATTGATA CATCAAAAAC GCTCAGGCTG AAAGATATCG AACGTTCAAT TGACAACCTG TTTGGAACAG GAGCTTTCAA CCGCGTCTCC ATCGGAATAT CAGGGAACGA CTCTCTTGCC GCCCCGACAG GGAAAAGAAC GCTCAGGATA AGGCTCAATG AAAAGCCCTC CAATGTCCTG AGACTGGGGG TTCATTACGA TAACACCTAT AATGCCCAGG CCCTCATCGA TTTCAGAAAT GAAAACCTTG GCGGAACAGC CAACTCCATC GGGGGATGGA TGAAAATCGG AGAAAAAAAC AACCGGGCAA ACCTTGAATT CAATATGCCG CGGATCGGCA GCACAAGCCT GACCTTCACG ACAAAACTGT TTTATGACCA GCGTAACCTG GACATACGCC ACCCTGAATT CTCCGGCAGG TTTCTCTTCC CAGGTTCTGA ACAGATCGGA CAGTTCGGCA TTCAGCGCTA CGGCATCACC TCTTCTATTG GAGCCAGAAT TGAAAAAAAC AGTCAGTTCC TGCTTGATCT GACCATGAAA AACACGCAGA CCTACAACAG GGGAGGCGGC AATTTCACCG CTGAAAACAA TGACATCGCG TCGGCGGCCG TAGAGTTCGC TCTCGACACA AGAGACAACT CCTTTCTCCC GAAAGAAGGA CGGCACACCA ATCTGCGCTA CTCGTTCACG CCTGAGCTGC TTAATGATAT CTCCTTCTGG AAACTGCTTC TTGCACATGA GGAGAACCTC TCTTTCAGCG AAAGGGTTTC AGGCCAGCTC AGACTCGCTC TCGGCCTGAG CAGCCCTGGA ATTCCTTTTT CGGAAAGGTT TTTCCTCGGC GGTGCAGGCA GTGCATACAG CAGCAGATTC ATCGGCTTGA GAGAAAATGA TCTCATTGGC GGCAACATGG TTTCCCTTGG AACGTCGTTC CGGTACAGTC CCCCTTTTGA CCTGATATTT CCCACCTCGT TTCTGTTGTA CTATACTGCA GGTAACGTCT GGGAAACCAG AGCGGCAATT TCAGCTGCGG ATCTTGTTCA CGGCATCGGA GCCGGCGTGC AATGGGATAC ACCTATCGGC CCTGCGCGCT TCACTGCCGG AAAAGCGTTC ATCACCAGTG ACGGCCAAAG CTACAGAGAC TATTCAGATT TGCGGTTTGC CGAAACGCTG CTCTATTTCA GCCTCGGGCA CGAGTTTTAA
|
Protein sequence | MLFWSCLLVF HAPAPVQAEN RSVVYPDTLD LPVRRVAIAP YMRAARKTVG LALSGGGANG LAQIGVLKAL EEENIPVDAI AGTSIGAVIG GLYSTGYNAE ELERIAVSIP WKELLSLEND APRAKTFLEQ QKIRDRATIA IRFENFKLVI PRSLSSAQKL TRTLDLLTMN ALYHPEGSFS SLPVSFKAVT TDLVSGKRVT LSEGTLSEAM RASSTIPIIF EPIGLGKQKL ADGGLVANLA VDELENASLD YKIGVDTRGS MYTLAEDIDI PWQAADQTMT ILIELQYPRQ LEKADLIITP DVGNNPAIDF SQLDKLMSAG YVSGKALAGK IRQDIQLPAQ RKTGISSYKK TLRVSGSGRK IVSRYLAASE IIRNASYAEE ALKALLETDR FTKVYAEVNP RLGEAVFFCE TPPSFDTVEL IDPVTGIEQK ELDACFRTLT GKSYTNAQGT EALEKLVKIF RNKGYPLVDI AWIQVSDDTL SIKLSEGKIA SLDVSKDKNI TGTTPVMREL SIDTSKTLRL KDIERSIDNL FGTGAFNRVS IGISGNDSLA APTGKRTLRI RLNEKPSNVL RLGVHYDNTY NAQALIDFRN ENLGGTANSI GGWMKIGEKN NRANLEFNMP RIGSTSLTFT TKLFYDQRNL DIRHPEFSGR FLFPGSEQIG QFGIQRYGIT SSIGARIEKN SQFLLDLTMK NTQTYNRGGG NFTAENNDIA SAAVEFALDT RDNSFLPKEG RHTNLRYSFT PELLNDISFW KLLLAHEENL SFSERVSGQL RLALGLSSPG IPFSERFFLG GAGSAYSSRF IGLRENDLIG GNMVSLGTSF RYSPPFDLIF PTSFLLYYTA GNVWETRAAI SAADLVHGIG AGVQWDTPIG PARFTAGKAF ITSDGQSYRD YSDLRFAETL LYFSLGHEF
|
| |