Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1556 |
Symbol | |
ID | 4710921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1691439 |
End bp | 1692575 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639856020 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_001003122 |
Protein GI | 121998335 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCCCT CGGTTCAGCG ACAGCCTTTC GCGCAAGTGG CCTCTGCCTG GGATCGCCTG GCGCAAGATG CTCTCCCGAA TCCGTTCCTG ACGACGGCCT GGCACCGAAC CCTCCACAAA TTAGCTCCAG ACGACCCGGT CCCCCCGGCC CTGGAAACCG TCCTGTACCA GTGGGGCGGC GAGCCGAGAG CCTTGGCCAC GCTCGGCAGG GCCAGGGTTC GTCGGGCCCT TGTTTTCAGC AGTCGGATGC TCTTTCTCAA CGAGACGGGC GATCCGCGCC TTGACTACTT GACGGTCGAG CACAACGCGC CGCTGGCCCC GGTGGGGGCC GAGGCGAAGG CGTTCGCCGG CATGGTCGAG GGTCTGCTCA CGGACACAGA CTGGGACGAA CTCTGCCTGG GGTGGGTCGA GGCGGATCGT TGGCGGGCGT GCTGGTTGGA GTGTTCCCAC TTGCCGCTGA TGCCGGTGGT GATAGATCGT CGACCCTACT ATTTTCGCGG GTTGCAGTCT CGGGATGCCA GGCCGGATCA ACTCCTCAGC AGTCTCAGCA GCAACACGCG GCAACAGATT CGGCGGTCGA TCCGGCAGTA CGGTGGGCTC GACGCCCTCG CCTTTGAGGT CGCCACGGAC CCGGCCATGG CCGTGCGGTG GTTCGAGCAT ATGGTCGAGC TGCATCAAGC GCGCTGGCAG GCGCAGGGCA AGGTCGGCGC TTTCGCCGAT CCGTTCATGC GTGCATTCCA TGAGCACCTA ATCGAGGCGG GCGCCCAGGA TGGTAGCGCT CGCATGATCC GGGTGCAGAC CTCGGAGCGG GTGATTGGCT ATCTCTACAA CCTGCGGGCG GGGGGCTATG AGTGCAATTA CCAGAGTGGC CTCGCTTATG AGGCAGATCC CCGCAGCAAG CCGGGGCTCG TGAGCCATAT CCTCGCCATG GCGGCCGCGG CGGAGACCGG GGTCCACTGC TACGATTTCC TCGTTGGTGA GAGCCAGTAC AAGCGCAGCC TGGCTAGCGG GAAGGGAGAG ATGCTGCGGG TCTCCCTGCA GAGGCGGCGA CCGATGCTGT GGTTGGAGCG GCAACTTCGC GCAGCCCGGG ATCGGATCCT GCAAAAGAGG GGGCGCGAAC GAAAGGAGGC GTGGTAA
|
Protein sequence | MEPSVQRQPF AQVASAWDRL AQDALPNPFL TTAWHRTLHK LAPDDPVPPA LETVLYQWGG EPRALATLGR ARVRRALVFS SRMLFLNETG DPRLDYLTVE HNAPLAPVGA EAKAFAGMVE GLLTDTDWDE LCLGWVEADR WRACWLECSH LPLMPVVIDR RPYYFRGLQS RDARPDQLLS SLSSNTRQQI RRSIRQYGGL DALAFEVATD PAMAVRWFEH MVELHQARWQ AQGKVGAFAD PFMRAFHEHL IEAGAQDGSA RMIRVQTSER VIGYLYNLRA GGYECNYQSG LAYEADPRSK PGLVSHILAM AAAAETGVHC YDFLVGESQY KRSLASGKGE MLRVSLQRRR PMLWLERQLR AARDRILQKR GRERKEAW
|
| |