Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_3070 |
Symbol | |
ID | 6162633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 3397431 |
End bp | 3398591 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641665845 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_001792095 |
Protein GI | 171059746 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAACG GCTGGCAGTT TTCGTGGTCG AGCGGTATCG CAGGTTTGAG GGAACTGGCG CCAGCATGGC AGGCGCTGGC TGACTCGCTC CCCGATGCCG AGTACTTTCA ACGTCCTCAG TGGTTTCATG CGCATCAGGC GATAAACGAA AATCCGGAAA AATCGATTTG GGTTTCCGTT CATCACGAGG GCCAGTTGAA GGCTGTGTTT GCGTTGCAGT CCGTGGTGCG AAAGGTGGGG CCGCTGCGTG TGCCCGAACT TCGCTTTGTC AATCACGGGC ACATGACACT GTCAGATGTC TGTGCAGATC GGGCCGATGT GACACTGTGG CCCGCGTTCT GGAATTGGTT GCAGGGGCGT GATGCGCCCG AGTGGGACCG GTTTGTCTTG CCTCAGATTC CCGCTGATGG CGTCATGGCG GCTTGGTTGC AGCACTTCGC GCCGCAGCGG ATGTTGCACT CAGTGGCTTC CAGCAGTGCG CGCGTCGACT GCCGGCGCTC GATGGAAGAA CTGCTGAAGT CATGCAGTGC AAATCACCGC AGCAGTGTGT CACGGGGGGG CAAGCGCGCA GAAGCCCTGG GTCCTCTTCG GTATGAACTT GCCCGCAGCC CTCAGGATCT GGCTCGTCTG ATGCCGATTT TCCTCGCCAT CGAGGCGTCA GGATGGAAGG GTGCGGCGGG CAGTGCGGTG GCCAGCAACC CGGCCTTGAT GCGGTTCTAC AACGCTCTGC TGGACGGATT CGGCTCGCGC GGCCAGTGTG AAATCGACGT GCTCCATGTC GGTGAACGTC CGGTTGCGTC GGTGCTCTGG TTTCGAACCG GGCGTCAGAT CCACCTGCAG AAGATCGGCT ATCTGGAGGA ACTCTCGCAG ATCGGCCCCG GCAAGCTGCT CTTGCGCGAG ACCTTCAAGC GGGCCTGTGA AGATCCGGAA CTGGATCGTC TGTGTTTCAT CACACATCCG GCATGGGCCG ATCCCTGGCG GCCGGAGGGC AATCCCGTGC TGGAGTTCAC CTTGTTCCGG GACAACTGGC GGGGTCTGGT GCTCTACCAG TTGAACAAGG CAAAACGGGC CCGGGCAGCT CGACTCGGGC AGGCGCAGGC ACGCAAGAAC CCTGGTCCCG AGAATTCGGA GCACTCGCCT GAGGCGATTG CCGAACGATA G
|
Protein sequence | MSNGWQFSWS SGIAGLRELA PAWQALADSL PDAEYFQRPQ WFHAHQAINE NPEKSIWVSV HHEGQLKAVF ALQSVVRKVG PLRVPELRFV NHGHMTLSDV CADRADVTLW PAFWNWLQGR DAPEWDRFVL PQIPADGVMA AWLQHFAPQR MLHSVASSSA RVDCRRSMEE LLKSCSANHR SSVSRGGKRA EALGPLRYEL ARSPQDLARL MPIFLAIEAS GWKGAAGSAV ASNPALMRFY NALLDGFGSR GQCEIDVLHV GERPVASVLW FRTGRQIHLQ KIGYLEELSQ IGPGKLLLRE TFKRACEDPE LDRLCFITHP AWADPWRPEG NPVLEFTLFR DNWRGLVLYQ LNKAKRARAA RLGQAQARKN PGPENSEHSP EAIAER
|
| |