Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0208 |
Symbol | |
ID | 4710978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 240199 |
End bp | 241893 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639854667 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_001001804 |
Protein GI | 121997017 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0946427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCAAC GCCGCTTCCT GAACCGCAGC CTGCTGATCA TCCTCACCGG CGCCGCCATC CTCAGCGTCT TCACCTACCT GATCGGTCGG ACGGCGCTGT TCCTGTTCGC CGACTACCTG TGGTACGAGA AGACCGCGGC CGCATTCCTG CTACTCGCCG AGGCGTTCAT CATGGTCCAC GCCCTCGGCT ACTTCCTGAA CATCTACCAC GCCAATCGCA CCCGGCCCGT GGACCCCGCA GCGCCCACCG CCGAACGCGA GATCCCCTCC GCCACGGCAC CGCGCTTTTC CTACGAGCAG GCCCTGAAGA TCCTCCCGAC GGAAAACCCG CCAGAACTCG CGGTGGTCGT GGCCGCACAC AACGAGCCCC TGTGGCTGAT TGAAGAGACC CTGACCTGCT TCTACAACCT GACCTACCCC AACAAGCGGA TCTTCCTGCT CGACGATACC CGCTACGACC CATCGGAACA GCGCTCCGCG GAGATGGCCA AGTACCGCCA GGCGATCGAG GACCTGTGCC AGGGCATCGG CGTCCACCTC TTCCGGCGCC CCTGGCGCGG GGCCAAGGCC GGCATGATCA ATGACTTCCT GGCCTTCCTC AACGACCGCG CGCCAGCCGG CTTCGAGTTC ACCCGCAGTC CGCTGGATGT CGAGCCCTTC AATATCGAAT ACGTGGCCGT CTTCGACGCC GATATGAACC CGTTGCCCGA ATTCGCCGAA CCGCTGATCG CCCAGCTCGA GAGCGAACCG AACCTGGCCT TCATCCAGAC ACCGCAGTAC TACTCCAACT TCGAGACCAA CCGGGTCGCG CGGGCCGCCG GCATGCAGCA GGCGATCTTC TACGAGTACA TCTGCGAGGG GAAAAGCTCC CAGGACGCCA TGTTCTGCTG CGGTACCAAT GTCATCTTCC GCCGCGCAGC GCTGGAAGAC GTCGGCGGCT TCGACGAGAC CTCGGTCACC GAGGATTTTG CCACCTCGCT GCGCTTTCAC GCACGCGGGT GGCGCTCCGC CTACATCAAC CGGCTCAGTG CGTTCGGGCA GGGTCCGCAG GACCTGGGCG CCTACTTCAA GCAGCAATTC CGCTGGGCGC TGGGCACCGT GGGGCTGTTC CGCACGGTCC TCCAGGCCAT GGTGCGCACG CCGCGCCAGC TTTCGCTGGC CAAGTGGTGG GAGTACTCCC TTTCCGGCAC CCACTACTTC GTCGGCTGGG TGTTCCTGAT CATGGTGCTC AGCCCGACCC TCTACCTCCT GCTGGGCGTG CCGAGCTTCT TCGCCCGGGC GGATATCTAC CTGCTGTTCT TCTTCCCCTA CATCCTGCTG ACGATCACCC TGTTCGCCTT CGCTATGAGT CAGCGCAAGT ATCGGCTGCG CGAGCTGGTC ACCGGGATCG TCCTTCAGGC CACGGCCTTC CCCGTCTATA TCAAGGCCAG CCTGCTGGGC ATTCTGGGGG TCCGGGGAAG CTTCACGGTC ACCCCGAAGG CCGGTAGCAA CGGGCTCCCC CTGCGCGCCC TGTGGCCCCA CCTGGCACTG ATCGTGCTGT GCACCGCGGC GATCACCTGG GGCGTACTGC GTGGCGTCTT TGAGCAGGAG CCGCTCTGGG CGCTGGTGGT CAACACCCTC TGGTGCAGCT ACCACCTCAC GCTGCTGCTG GCTGTCTTCT ACTTTAATCA TCAACCGCAG GAGCAGACGG CATGA
|
Protein sequence | MGQRRFLNRS LLIILTGAAI LSVFTYLIGR TALFLFADYL WYEKTAAAFL LLAEAFIMVH ALGYFLNIYH ANRTRPVDPA APTAEREIPS ATAPRFSYEQ ALKILPTENP PELAVVVAAH NEPLWLIEET LTCFYNLTYP NKRIFLLDDT RYDPSEQRSA EMAKYRQAIE DLCQGIGVHL FRRPWRGAKA GMINDFLAFL NDRAPAGFEF TRSPLDVEPF NIEYVAVFDA DMNPLPEFAE PLIAQLESEP NLAFIQTPQY YSNFETNRVA RAAGMQQAIF YEYICEGKSS QDAMFCCGTN VIFRRAALED VGGFDETSVT EDFATSLRFH ARGWRSAYIN RLSAFGQGPQ DLGAYFKQQF RWALGTVGLF RTVLQAMVRT PRQLSLAKWW EYSLSGTHYF VGWVFLIMVL SPTLYLLLGV PSFFARADIY LLFFFPYILL TITLFAFAMS QRKYRLRELV TGIVLQATAF PVYIKASLLG ILGVRGSFTV TPKAGSNGLP LRALWPHLAL IVLCTAAITW GVLRGVFEQE PLWALVVNTL WCSYHLTLLL AVFYFNHQPQ EQTA
|
| |