Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1568 |
Symbol | |
ID | 4711393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1708337 |
End bp | 1709416 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639856032 |
Product | hypothetical protein |
Protein accession | YP_001003134 |
Protein GI | 121998347 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA GATACCCGTG GACTGTCGAG CGACTCGACT TGAACTCCAA CGAGCTTCTC CCCGATGACT GGCAGCGACT CAATCAACGC TGTTGTGGCG CCCACCCGCT GTTATCCGCC GATTTCTTCA CCGCTCTGCT CCGCCACTTT CCTGGCGAAG ACGTCTACCT AGCCGCGCTG CATGATCGCG AGCTACCGAT CGCTATGGCG CTGTTAACCC CCCTCACCGC GATGCGCTGG GCACTCTACC ATCCGAGCCA GGCCCCTTTG GGACCCATCC TGCTCGATTC GTCACAAGTG GATCCCGATC GCGCCCTGGC AGACCTCCTC GATACCCTAC CGGGGCAGAC CCTGCAACTG GATTTAACTC AGCTAGATCC CACTTTTTTC CCTGTGGCGT TCGACTCGAA TCGAGTCGAG AGGATCCACT ATGTCACGAC TATGGCGGTA TCCCCTGAGG GCAGCTTCGA AACATACTTA AGCAACCGAC CGCGCAAGTT ACGCTCCAAC CTGCGACGCT ACCGGCGCCG ACTCGAAGAG GCAGGCATGC AAATCGAACT GAAGTGCCTG ACCGACCCGC AGGCGGTAGC AGAGACAGTC GATGAGCACG GCAGATTGGA GAGTTCCGGA TGGAAGGGTG AAATCGGCAC GGCCATGCGC CCGGACAACT CTCAAGGACG ATTTTATCGT GACTTGATGA AAGATCATGC ATATGAGGGT AAGGGGTGGG TTTATTGTCT AAACGTCGAC GGCAGCATTG CCGCCACTTG GCTGGTCATC CGAGGCGGTG GCATGATCTC GATGCTAAAA ACCGCCTATG ATGAATCCCT TGCCAAGTAC TCAGTGGGTC GAGTCCTCCT AGTTGAGACC TTGGAGGAGT TGTTTAAGAT CGATGGCGTC CAAAGCATAG AGTTCTACAC GAATGCCAAT GCGGATCAAC TGGAGTGGGC GACGGATAGC CGTTCGATCG AGCACCTCAA TTTTTACCGC CACCAGGCCG TACAGCGGTC ACGTGCGGCG GCACGCCGGG CAAACCAGGT ATTCAGGCAG ATCCAGGCCG TTTGGAAGCG AAAACCTTGA
|
Protein sequence | MSTRYPWTVE RLDLNSNELL PDDWQRLNQR CCGAHPLLSA DFFTALLRHF PGEDVYLAAL HDRELPIAMA LLTPLTAMRW ALYHPSQAPL GPILLDSSQV DPDRALADLL DTLPGQTLQL DLTQLDPTFF PVAFDSNRVE RIHYVTTMAV SPEGSFETYL SNRPRKLRSN LRRYRRRLEE AGMQIELKCL TDPQAVAETV DEHGRLESSG WKGEIGTAMR PDNSQGRFYR DLMKDHAYEG KGWVYCLNVD GSIAATWLVI RGGGMISMLK TAYDESLAKY SVGRVLLVET LEELFKIDGV QSIEFYTNAN ADQLEWATDS RSIEHLNFYR HQAVQRSRAA ARRANQVFRQ IQAVWKRKP
|
| |