Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0697 |
Symbol | |
ID | 4710727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 777802 |
End bp | 780720 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855160 |
Product | hypothetical protein |
Protein accession | YP_001002281 |
Protein GI | 121997494 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.418548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTGCGA CTGGGCGGTC TGTTGCGGTC GCGGTTTGGC TCCTGGCCGC GGCGGGGGCG TTCTGGTCGC CCAGCGCGCC TGCCCCGCTT TCTGTGGAGA GCCTCCAGGA AGAGGATGAT GGCGAGGGGG TGGAGGATAT GGTGCTGCGC GTGGAGCTGG AAGGCTACCC CCTCTCCGAG AGCGAGTATA TGCTCCAGCG CAAAGGGGAG ATCTACTACC CATTGGGCCA GCTCGCCCGC CATCTGGAGC TGGCGGTCGA GGTCGACCCG CTTGAGGGGC AGGTCTCCGG CTGGGTCCTC GATCCGGACC GGACGGTCTC CTTCGATACC GCCAGCGGCA TGGGCGAGGT CGGCGACCGC CCGCTGATCC TTGAGGAAGA GGATTGGATC CAAGAGCACG ATGACATCTA CTTCCGTGAG GATGCCATCC AGGAGTTCCT GCCCGTTGGC TTCGAATATC GCCCGCGCCG CGCTGAGGTC CACATGCGTG CGGACGAGGA ATTGCCGATC CTTAGCCGTC TGGAGCGCGA GGCCCGGTGG GCACGACTGA CCGATGACGG GGCGGGTATC GGCGGTGAGG AGTGGCTGCC CCGTGAACCG CTGCCGGCCC GTTTCATGAC CCGTCCTGCA TGGTCCCTGA GCCTCAATCA CCAGCAAAGC GGGAACGGTA GCAGCACCAG CGGTAATCTC GCAGGCGCTG GAGATCTCCT CTGGCATGAG GCGGAGTGGC GGTTACAGGC GAATGACACC GACGGAATCC GGCGCTTCGA CGGGACCCTC GGGCAGCAGG TCGGCCACCC GGCCCTGGAG CGCTACGAGC TGGGGCGGGT GCGCGCGCCC CGGTACGACC TGATCCGGCG TGGCGGCTCC GGGATGGGGG TGACCCTGAC GAACCGACCG GAGGGTATGG CGCGTACCAC GTTTACCGAT CAGTCCATCG AGGTGGAGGT TCCCGAGGGC TGGGACGTCG AGCTCTATCG CGACGGCGAT CTGGTCGACT TTGCACACGA TGTGGACACG GATCGCCACG AGTTCGAGGA CATTGACGCC CGGCCGGGCA CCAACCCTTA CACCATTGAG TTCTACGGAC CCCACGGCGA GCGCGAGACC ATCTCGCGGA CGATTGAGAT CGGCCCGGGG CTGCTGCCGC CGGGCATGGT GCACTACAGC CTGGATGCCA GCCGAGAGGG GGTCAGTCTC TATGAGCACC AGCCGCGCGA TGACCAGGAC CGGCGCGCGG CGGCGCGGAT GGACGTCGGG GTGACCGAGA CCCTGACGGT CGGCGGCGAC CTCCACTACA TGGAGCCGGA TGACGAGGAG GACGAGCTGA CGCGCCGAGA GCTGGTCGGG GCGGACGCCT CGTTCAGTGC GTTCGGGGTT TGGGGCCGTC TTCGCGGTGC CTTCGAGAAT GACCAAGGCC GGGCTTGGCA GCTCGCGCTG GAGCGCGGTC TGGGCGCGTG GAGCCTGGAC TACGAGTACA CCCTGGTGGA TGGCCTGGAG ACCGACGAGC TGCGTAGCCG CGTCAGCGGT GACTTGCGTC ACGGCCACGA GTTGCAAGTG CGCGGCTCTT TGGGGCCGCT GCGCACCCGT CTGCGCTACG AGCATCAGGA GTCCGCGGAC GGCGAGTCGA CTTGGCAACG CCTGCGCACC CGCGAAAACG TGAGCATCGG CGGTCAGCGG CTCCGGCATT CGCTGACCGT GAGCCAGGCG GATGACGGCG ACCCTTCCGC GCAGGGCAGC CTGCAGACAC GCTGGGGGCG TTATCGGGAG GGGCGGCTGA ACCTCGGTGT GAGCTATGGC CTGGCGCCCG AGGCGGAGCT GGATAGCGCC AACGCCGAGT ACAGTCAGAG CCTTCATGAC AACTGGCGGG GCAGTGCCCA GGTGCGCGGC AGCTTCAATG ACCGTCCTCA TAACCTCCGT CTGGGCATTT CCCGTACTGC TCGCGAGTAC TGGCGCTTCT CGGGGCAAGG TCAGGTGGAT ACGGGGGGTC GGTGGCAGGT GTCGGTCGGC CTGGATGTGG GAGGGCTGCC CCATCCTGCC GGGGGGTGGT ACCCGGATCC CGAGGCCGGG CGCGGCTACG ATCACGGGGC GGTGATGGCA AGGGTTCATC AGGATGGCGA GCCCGTGGAG GGGGTCGATG TCTGTGCAGG CCGCAGCTGC GGGGGCACCG ATGAGAACGG GGAGGCCTGG GCCGCCCGTC TGGATCCCCA CGATCCGGTC AACGTGGAGG TGGATGTGGG GTCGATCGAC AATCCCTTCG TCCAACCTGC AACCCGGGGG GTAAGCTTCG AGCCGCGCCC GGGTCGGGTA CTCCCCCTGG ATTTCGAGCT GCACCTCACG GGCGAGGTGG ATGGCGTGGT TCGTCGCCAG CGTGGCACGG CAGAGCCATC GGAGGTCTCC GGTTTCGAGA TGGAGGCGGT GGATACCGAA ACCGGGGAGG TGGTCCAGAC GGATCGCAGT GTCTTCGATG GTCTCTTCAT CCTGGATCAG CTGCCGCCGG GAGGGTATCT GGTGCGCGCT TCGGAGGAGC AGGCAGAGCG CCTGGATGTC CCGCAGACCG CCACGGCGCA GCGCATCAGC GTTGAGGGGG ATGGTGATCT GGTGAGTCAT CAGGATCTCA CGATCCATGA CGGTGGTGAG ATCGTCCGGG TCGCTTCGCC GGCAGAGGCG GTCGAGCAGA TCACCTTCAG CTACGACTCG AGCCACACCC TCCGCGATCA CCTGGCGCAA CTCCTTGATC GCCTGGGTGC GGAGCAAGTC GAAGGTGGCC TGGAGTCGCT CTACGAGGAG GTGGTGATCC TCAACGGCGA GGTCGAAGAC GGTGATCGTG TGATGGTCCC GGCGGATCGT TCGCCGCTGG ACGAGATCAC CCACTGGCAG CATCAGGAAG AATTGGAACT GGCGGAAGAG GGTGACTGA
|
Protein sequence | MGATGRSVAV AVWLLAAAGA FWSPSAPAPL SVESLQEEDD GEGVEDMVLR VELEGYPLSE SEYMLQRKGE IYYPLGQLAR HLELAVEVDP LEGQVSGWVL DPDRTVSFDT ASGMGEVGDR PLILEEEDWI QEHDDIYFRE DAIQEFLPVG FEYRPRRAEV HMRADEELPI LSRLEREARW ARLTDDGAGI GGEEWLPREP LPARFMTRPA WSLSLNHQQS GNGSSTSGNL AGAGDLLWHE AEWRLQANDT DGIRRFDGTL GQQVGHPALE RYELGRVRAP RYDLIRRGGS GMGVTLTNRP EGMARTTFTD QSIEVEVPEG WDVELYRDGD LVDFAHDVDT DRHEFEDIDA RPGTNPYTIE FYGPHGERET ISRTIEIGPG LLPPGMVHYS LDASREGVSL YEHQPRDDQD RRAAARMDVG VTETLTVGGD LHYMEPDDEE DELTRRELVG ADASFSAFGV WGRLRGAFEN DQGRAWQLAL ERGLGAWSLD YEYTLVDGLE TDELRSRVSG DLRHGHELQV RGSLGPLRTR LRYEHQESAD GESTWQRLRT RENVSIGGQR LRHSLTVSQA DDGDPSAQGS LQTRWGRYRE GRLNLGVSYG LAPEAELDSA NAEYSQSLHD NWRGSAQVRG SFNDRPHNLR LGISRTAREY WRFSGQGQVD TGGRWQVSVG LDVGGLPHPA GGWYPDPEAG RGYDHGAVMA RVHQDGEPVE GVDVCAGRSC GGTDENGEAW AARLDPHDPV NVEVDVGSID NPFVQPATRG VSFEPRPGRV LPLDFELHLT GEVDGVVRRQ RGTAEPSEVS GFEMEAVDTE TGEVVQTDRS VFDGLFILDQ LPPGGYLVRA SEEQAERLDV PQTATAQRIS VEGDGDLVSH QDLTIHDGGE IVRVASPAEA VEQITFSYDS SHTLRDHLAQ LLDRLGAEQV EGGLESLYEE VVILNGEVED GDRVMVPADR SPLDEITHWQ HQEELELAEE GD
|
| |