Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1977 |
Symbol | |
ID | 6375669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2122444 |
End bp | 2124855 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642684468 |
Product | capsular exopolysaccharide family |
Protein accession | YP_001960369 |
Protein GI | 189500899 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0664452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAATA ACGATCCGAA TCAATTCGAA CAGGAAATAA ACATTCAGGA GCTGCTCCAG GTTCTCTGGC AGAACCGGCT GATCATCGGA GCGGTGACCG GGGTTGTTCT TGTACTGGTG ATGCTGTACC ATTTCTCGGC GACTCCCGAG TATCGATCGA CGTCAGTCGT GCTCATCAAA TCCGACAAGG GCGGGATGAG CGAGATGATC AACCCTTTCG AGTCCATGAC CGGTTTCGAG CTGCAGAACG ACATCGAACT TGTCAAGTCG TTCCCTCTGG CCGAAGAGGT TGTCAGAGAG CTCTATGCCC GTAAAGATCG CGATTCGCTT CAGCTCTTTG GTGAAAGACG GTACATTTCC CCCATCGGTC ATCTTTTCAG CTGGATGCGC TTCGGCACAA GAGAGAAGGA AGAGAAGGAG TATGATATCA GGATGCGTGA ATACGCGACA GCCTTGCAGA AACGGATTTC CGTCAGCAAC TCTCGTGATA CTGATATTCT TCAGGTATCG GTGTCCAGTC CTTTCCCCGA TGAAGCGGCT TTGCTGACCA ACGCAATCTG CCGCTCCTAC ATGCGTAAGG ACATCGAGTG GAACGCCGAT CAGGCTATGT CGGTCAAGGA GTTTGTCAGT GATCAGCTTG CCAGTCAGCA GAAGGAGATC GGGGAGGTTG AAAACCAGCT CTCTTCCTAC ATGAAGAAGC AGAACATCTA CGAACTGACA GGCAACGCGG AAAAGCTTCT TGAAAAACTG GTTGAGGCGG AGTCACGGTA TAATGATGCC CAGGCGGAGT GCAACATCCT GAAAAATCGG CAGGACTACC TGATCAACAA GTTGTCGGCG GAAGAGAAGG CCTTGAGCGC GAAAATCGCC AAAAATGTGG ATCAGCAGTC CCGTGAGCTC AAGGAGCGTA TCAAGCAGGA AGAGAAGCAG CTTATCGCCA TGGCCGGAGA GGCCGGAACG CAGGATGCCG GCTACCTTGC CAAAAAGCAG CAGCTTGACA TGTTCAAGCA GCGGCTGCAG GAACTTACCA GCAACACGAT TGCCGGTGAA CTGGCGTTTG CCAGCAAGGC GCGTCAGTAC CAGTTCGACC TTATTTCAGA ACAGCTGCAG ACAGATGTTC GCCTTGCGGA ACTGAAATAC ATCGCTCAGG AGTTCCAGCG TTCCAAGAAC TACTATGAAA GCCAGCTCAA CCGGCTTCCT CAGAAACAGC TCAATTACGC CCGTCTGCAG CGTGACAGGG AGGTGCTCAA CAACACCTAC ACCTTCCTGA AGGAGAAGCT CGAGGAATCC CGGATCAAGA TCGCTTCCGA AGTGGGCAAG GTGGTGATTG TCGGCGCGGC ATTTCCTCCC ATAGAGCCTG TCGCTCCGGA TCTGAAGAAA AACCTGCTTA TCGGTCTTAT TCTCGGACTC GGACTTGGCG GCGCGCTGGT GTTCGTCCGT GAAATGCTCG ACCACTCCCT GAAGGATGAT GCCTTTCTCG AGGACCACGG CTATACCCCG CTTGCCGCGA TCCCATATGT GGGAGATGAA AATGAAAGCG ACCTTCAGGA TTCACTCAAA AAATCCATCA GGCAGCTGAG CGACAGCATT CCCGGTTTCG GCGCCAGCAG CAACGGAAAA GATGCAAGCC ATGTGAAAAT GAACAGCAGC GGCAAGTCGG TTCCTTCGAC CAGAGAGAGT AAACCGCGGC TTATGGCGGA CAGCCTCGCA TCCCCGTTCG CCGAAGCGTT TCGTGACCTG CGGACCAATC TTGTGTTTTC CCGGGCAGAC CGGAAACTGA AATCCATTCT GGTGACCGGC ACCGAGATCA GTGAAGGAAA GTCAACTGTC TGCGCGAACC TCGCGTTTGC CTTCGCGCTG AGCGGCAACC GCGTACTTAT CGTGGACTGT GACCTTCGCC GTCCCAGTCA GCACCGGATC TACAGTTGCA AAAAGACGCC CGGGCTCTCG GATTATCTTG CCGGAGTCGA GGACGATGCC GATGCGCTGA TCCAGTCGAC CATGCATGAA AACCTTTTCA TTCTTCCGGC GGGCAATAGC ACGCCGAGCC CCAACGAACT GCTCGGATCG AACAAGATGA CCGGTCTGGT AGAGCGTCTT GAAGAAGAGT GGGACTATGT GATTCTCGAC ACGCCGCCTA TGATGCTGCT CAGTGACGCC GCACTGATCT CCAGAGCCGC CGACGGCATT CTCATGGTGG TACGAATGGG CTACACCAAC AGAAACCTGC TCAAAGAGGT GCAGAAACTC GATCATATCA GAAAACAGAT GCTCGGCGTA GCCATCATCG GCCCATCAGA AAAAGCAGGG TACGGCAAAT ACGGCCGCTA CTACGGCCGC TACGGCTACA AAGGCTACTA CAGCTACAAA ACCTACAGCA GCTACATGGA ACCCGAAAAG ACCAAGGGAT GA
|
Protein sequence | MPNNDPNQFE QEINIQELLQ VLWQNRLIIG AVTGVVLVLV MLYHFSATPE YRSTSVVLIK SDKGGMSEMI NPFESMTGFE LQNDIELVKS FPLAEEVVRE LYARKDRDSL QLFGERRYIS PIGHLFSWMR FGTREKEEKE YDIRMREYAT ALQKRISVSN SRDTDILQVS VSSPFPDEAA LLTNAICRSY MRKDIEWNAD QAMSVKEFVS DQLASQQKEI GEVENQLSSY MKKQNIYELT GNAEKLLEKL VEAESRYNDA QAECNILKNR QDYLINKLSA EEKALSAKIA KNVDQQSREL KERIKQEEKQ LIAMAGEAGT QDAGYLAKKQ QLDMFKQRLQ ELTSNTIAGE LAFASKARQY QFDLISEQLQ TDVRLAELKY IAQEFQRSKN YYESQLNRLP QKQLNYARLQ RDREVLNNTY TFLKEKLEES RIKIASEVGK VVIVGAAFPP IEPVAPDLKK NLLIGLILGL GLGGALVFVR EMLDHSLKDD AFLEDHGYTP LAAIPYVGDE NESDLQDSLK KSIRQLSDSI PGFGASSNGK DASHVKMNSS GKSVPSTRES KPRLMADSLA SPFAEAFRDL RTNLVFSRAD RKLKSILVTG TEISEGKSTV CANLAFAFAL SGNRVLIVDC DLRRPSQHRI YSCKKTPGLS DYLAGVEDDA DALIQSTMHE NLFILPAGNS TPSPNELLGS NKMTGLVERL EEEWDYVILD TPPMMLLSDA ALISRAADGI LMVVRMGYTN RNLLKEVQKL DHIRKQMLGV AIIGPSEKAG YGKYGRYYGR YGYKGYYSYK TYSSYMEPEK TKG
|
| |