Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0489 |
Symbol | |
ID | 4710720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 557137 |
End bp | 558627 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639854947 |
Product | hypothetical protein |
Protein accession | YP_001002078 |
Protein GI | 121997291 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACGAT ACCACGCCCA CCGCCGCCGG CCGGCCCTGC TGGCCACCGG CCTGGGCAGT GCGACGCTGC TGCTCGGCAG CGCCCACGCC CAGTCGCCCG CTGACGACCC CGCCGTGGAG CCGGGTCCGG AGCCCACGGT CAGCGCCGAT CCGGGGCCGA TGCAACGGGC CAGCGCCTCC AGCGAACAGA TCGGCGCCGG CACGCTGATG AACCCGTCGA TCTCGGTCAT CTTCGACGGC GTCTACGGCA ACGAGTTCTC CGGCCACGTC GGCGACCCCG GCGGCTTCGG CATGGGCCAC AGCCACGGGC ATGGCCACGG CCACGATCAC GCCCACGGCA TTGAGGACGG CTTCCAGCTG CGCGAGACGG AGTTCGCCTT CGAGGCCTCG GTGGACCCGT ACTTCGACGC CTTCGCCATG CTGGTGGTCG AGGGCACCGA CCACATCGAC CTGGAGGAGG CGTACTTCAC CACCCGCGCC CTGCCCTGGG GCCTGCAGGT GAAGGCCGGG CGCTTCCTCT CGGATATCGG CTACATCAAC AGCCAGCACC CCCACGAGTG GGACTTCGTG GACCGCCCGC TGGTCAGCGA ACACCTCTTC GGCGACCACG GCATCCAGGA GACCGGCGTG CAACTCAACT GGCTGGCACC GACCCGGACC TACCTGAAGT TCGGCGCCGA GATCCTGGAG GGGGAGACGA GCGGGATTGC GGCCTATGAG GGCGAGACAA GCACACGGCC CGGTTGGATC GATGGAGACG GCGCGCCGGA GCGCCACAGG ACGGAAGAAT TGGATCTCCC CTTCTCCGAC TCAACGGGAC CTCGGCTTGC CACGCTCTTC GCCAAGTGGG GGCCCGACCT GGGCTTCAAC CACGCCGCGC AGTTTGGCGC CTCGGCTGGA TACGCCAGCG CGTGGCAACG GATGGAAGAG CACAGCGAAG GCCTTCGCGT CGAAGCCTGG GATGGGGACG CCTGGTTTGC TGGCCTCGAT GCCGTCTACA AGTTCGATCC GCCAGGTAGC TACCAGGGCG CCGGCCAACT GACGCTCCAA GGCGAGTACT TCTACAGGAA CATCGATTCC GATTTCTATT ACTACAACCA CGACGATGCC AACAACTGGG AACGAGAGAC TGCAGATGCC GACGGGACCT CGGGCAGTTT CAAACAGGAT GGTCTCTACG TGCAGGCCGT CTATGGCATT GCCCCGCGCT GGCGCTCCGG CATCCGCGCC GAGGCGCTGG GCCTGCTCGA GAACCAGGCG TGGCATGATC GGGATGACGG CAACGGCTAC ACGGACCTGG ACACCTCCTA CCGCTACTCC GCCAATGTGA CCTTCTACCC GTCGCACTTC TCCTACATCC GGGCGCAGGT GAACTATTCG GACTTTGCTG ACGGCACGCC GGACGACCCG GATACGCACG ACGAGGACGC CTGGCAGGTC ATGCTCCAGT ACAACCTGAG CCTCGGCGCC CACGGCGCCC ATCCGTTCTG A
|
Protein sequence | MPRYHAHRRR PALLATGLGS ATLLLGSAHA QSPADDPAVE PGPEPTVSAD PGPMQRASAS SEQIGAGTLM NPSISVIFDG VYGNEFSGHV GDPGGFGMGH SHGHGHGHDH AHGIEDGFQL RETEFAFEAS VDPYFDAFAM LVVEGTDHID LEEAYFTTRA LPWGLQVKAG RFLSDIGYIN SQHPHEWDFV DRPLVSEHLF GDHGIQETGV QLNWLAPTRT YLKFGAEILE GETSGIAAYE GETSTRPGWI DGDGAPERHR TEELDLPFSD STGPRLATLF AKWGPDLGFN HAAQFGASAG YASAWQRMEE HSEGLRVEAW DGDAWFAGLD AVYKFDPPGS YQGAGQLTLQ GEYFYRNIDS DFYYYNHDDA NNWERETADA DGTSGSFKQD GLYVQAVYGI APRWRSGIRA EALGLLENQA WHDRDDGNGY TDLDTSYRYS ANVTFYPSHF SYIRAQVNYS DFADGTPDDP DTHDEDAWQV MLQYNLSLGA HGAHPF
|
| |