Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0412 |
Symbol | |
ID | 4709332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 479943 |
End bp | 481178 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639854871 |
Product | type II restriction endonuclease, putative |
Protein accession | YP_001002004 |
Protein GI | 121997217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.135566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGACT CGATTTCGGA ACTTTTCGAA GGGGCGGCGG CGAAATGGCT CAGCGCCGTA GACGCCGAAC CGGACAGATC GAACCAGCAT GAGATCGGAG GCCTGCCAAG CGCCGGATTC AGGAGCCATC TCGGCGAACC CTCGAAGAAC GATGTCGCCC GGTTTCCAGC CACAATGGCT TACCTTGGGG ACCATGAGCC TCCGGAAATC GTCCGCGATA CCGTGTCTTG GTACGACTGC CGGTGGAAAC AGGCGCACCG CTCCCCAGAG TACCGGCTCT ATTACCGCAG CAATTCGGTA ACGCTGCGGC TCAGTGCCGG TGATCTGATG GTGATCGCGA AGGCCCGTGA CGGTAGTCTG CTCATCGTTT TCGCGCCGGC GGAGTCGGAT GTCGAGGCCC AGGTACGCCA AGTGTTCGGG TTCGGCGAGA TGGGGCAGCG CTTCCAGGCC GCCAGAATGC CTCCATCCTC TCTTACGCTG CCACTGAAAC TGCTGCTGGA AGACGTCGGC GTAGAAGCCT TCGAACCGGA CGATGCGGCG GACGATCTCG AACTTGTCCT GAACCGTTTC CCAGAGCGTT TTCCCTCGAC CGCAGAATTC TCCGCGCTTG CGCGTGAGGT CACTCAGGGC GATCCCGTAG GCGAGCCGGA TCGGACACTC TTGGCATGGA TGGAACGGGA GGAAGCGCTC TTCCGGGCCT ATGAGCGCGA GATCGTTCAG GAACGCCTCG AACGGGGGTT CGCCGGCGAT GTCGACGAAT TCATAAGGTT CTCACTAAGC GTCCACAATC GGCGGAAATC CCGGGTCGGT CATGCCTTCG AGAACCATCT GACTGAACTG TTTGAGCGGC ACGGCCTGCG TTTCGAAAAA GGGGGGGTGA ACCGGGTCAC GGAGAACAAA TCAAAACCTG ACTTCCTGTT TCCCGGCTTC ACCGAGTATC ACGATCCGCA GTTTCCGAAT TCCCGCCTGT TCCTCCTTGG TGCAAAGACC ACCTGCAAGG AACGATGGCG GCAGGTCCTC GCCGAGGGCG AACGACTCAA ACGGAAACAC CTCGCGACGC TCGAGCCTGG GATAAGTCGT ACCCACACCG ACGAAATGTC GGCGCACGGC CTCCAGCTTG TGGTGCCACT ACCCATCCAT GCGACATATT CGGAAACACA GCGTATGAAA ATCATGGGAA TCCAGGATTT TATCAGCCAT GTCAGGCCCA AGGTATTCAA GACTCGGCCA GGCTGA
|
Protein sequence | MIDSISELFE GAAAKWLSAV DAEPDRSNQH EIGGLPSAGF RSHLGEPSKN DVARFPATMA YLGDHEPPEI VRDTVSWYDC RWKQAHRSPE YRLYYRSNSV TLRLSAGDLM VIAKARDGSL LIVFAPAESD VEAQVRQVFG FGEMGQRFQA ARMPPSSLTL PLKLLLEDVG VEAFEPDDAA DDLELVLNRF PERFPSTAEF SALAREVTQG DPVGEPDRTL LAWMEREEAL FRAYEREIVQ ERLERGFAGD VDEFIRFSLS VHNRRKSRVG HAFENHLTEL FERHGLRFEK GGVNRVTENK SKPDFLFPGF TEYHDPQFPN SRLFLLGAKT TCKERWRQVL AEGERLKRKH LATLEPGISR THTDEMSAHG LQLVVPLPIH ATYSETQRMK IMGIQDFISH VRPKVFKTRP G
|
| |