Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1151 |
Symbol | |
ID | 4710141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1252134 |
End bp | 1255313 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855625 |
Product | hypothetical protein |
Protein accession | YP_001002729 |
Protein GI | 121997942 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.255903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA GTCGCGAAGC ACCCTTCCAA CAGGACATCA TCGACGAGCT GCGCGCCGCT GGCTGGCTCG TCGGCGAGGC GGGCGGCTAC GACCGCACCC ACGCCCTGTA TCCTGAGGAT CTGATTGGCT TCGTCCAGGA GGCCTACCCG GAGCGCTGGG AGAGGTTCAC CAAGAACAAC CCGCAGCACC CCGAGCAGGC GCTGATCAAG GCGACCGTCC GGGAACTGGA GAAGCAGGGC ACCCTCGATG TGCTGCGTCA CGGCTTCAAG GTCCCCGGCG TGCGCATCGC CACCTGCAGC TTCCGGCCCG ACCACGGCAT GAACCCCGAG GCGCAGGCTC GCTACCGGGC CAACCGGCTG CGCGTGGTCC CCGAGGTCTC CTACTCCCCC CACGCCCGCA CCGGAGAGTA CAACCCGCGG CTCGATCTGG TGCTCTTCGT CAACGGCATC CCCACGGCGA CACTGGAGCT TAAGAGCGCC TTCAAGCAGT CCGTTGAGCA GGCCAAGCGG CAGTACCGCA ACGATCGCCC CCCGAAGGAT CCGGTCACCC GCAAGGAGGA GCCGCTGCTG AGCTTCAAGC GCGGCGCGCT GGTGCACTTC GCCGTCAGTC AGAACGAGGT GGCCATGACC ACCCGCCTCG CCGGCGCCGA GACGACCTTC CTCCCGTTCA ACCAGGGCAC ACCCGACGGC GGCGCCGGCA ACCCCCCGCC GCCCGACGCC GACACCTACG CCACGAGCTA TCTCTGGCGC GAGGTCTTCC AGCCGGATGC CTGGCTGAAG ATCATCGCCC GCTTCCTGCA CCTGGAGCGC AAGACCGAGG AAGAATTCGA CGGCAAGCGC AAGACCCGGG AGAGTCTGAT CTTTCCCCGC TACCACCAGT GGGAGGCGGT CAATCGCCTG CTGGCCGCCA CGGCCGAGGA GGGCGCGGGC CGGCGCTATC TAATCCAGCA CAGCGCCGGG TCCGGCAAAT CCAACTCCAT CGCTTGGGTG GCCCACCAGC TGGCCGCGCT CTACGACGAC GACGGCAACC GGTTGTTCAA CTCGGTGATC GTCGTCACCG ACCGCACCGT TCTCGACGAC CAGCTCCAGC GCACGATCTA CCAGTTCGAG CACGCCCAGG GGGTCGTCCG ACCGATCACC CGGGAGGTCG GTAACCAGAG CAAGTCCGAG CAGCTCGCCG AGGCCCTGGC CGAGCAGACG CGGATCATCA TCGTCACCAT CCAGACCTTC CCGGCGCTTT TCGATGCCCT CGACGCCCGG CCGCAGCTGG CCGAGGGGCG CTATGCGGTT ATCGCCGACG AGGCCCACTC CTCACAAAGC GGCGCGGCAG CGAGCAAGCT CAAGACCATC CTCGGTGCCG ATGCCCCGGA GACCGACGAG GTCAGCGCCG AGGAACTGCT CGATGCGGCG GTGGCAGCGC GCAAGCCCAC GGAGCGGATC AGCTACTACG CCTTTACCGC CACGCCCAAG GGCAAGACCC TGGAGCTCTT CGGCCGGCCG CCAGACCCGG AGCAAGGGCG GAGCAGCGAG AACCTGCCGC AGCCGTTCCA CGTATACTCC ATGCGCCAGG CCATCGAGGA GGGATTCATC CTCGATGTGC TCAAGAACTA CACCACCTAC CGCACCGCCT GGCGCCTGGC CCACCCGGAT GACCAGGCCT ATGAGGTCAA CTCGCGCAAG GCCTCGGCCA AGCTCGCCCG GTGGGTGAAG CTCCACCCCT ACAACATCGG CCAGAAGGTG GAGGTCATCG TCGAGCACTT CCGCACCCGG GTACGCCACC TGCTCAACGG CCAGGCGAAG GCCATGGTGG TCACCGGCAG CCGGCAGGAG GCGGTGCGCT ACGCCCTGGC CCTGCGCCAA CACGTGGAGG CGCAGGGTTA CAACGACGTG CACGCCCTGG TGGCCTTCTC CGGCAGCGTG CCAGCGGATG ACACCATCCC CGAGGAGGTC ACCGAGCACA GCGCCCAGCT CAACCCCGGG CTGAACGGCC GCGATCATGC CGAGGCGCTG GATACCGACG ATTACAACGT GATGATCGTC GCCAACAAGT ACCAGACCGG CTTCGATCAG CCGAAGCTCT GCGCCATGTA CGTCGACAAG AAGCTCCAGG GCGTGGACTG CGTGCAGACC CTCTCGCGGC TCAACCGGAT CTTCCCCGGC AAGGAGACCT TCGTCCTCGA CTTCGTCAAC GACGCCGAGG AGATCCTCGC CGCCTTCCGG CCCTACTACA ACAAGGCCGA GCTCGCCGAC GTCTCCGACG CCAACGTGGT CTTCGACCTG CAGCGGACCC TGGATGCTGC CGGGGTCTAC CACTGGGAGG AGGTCGAGCA GTTCGCCCGC GCCTTCTTCG ACCCCAAGGC CAAGAACGCG CAGCTCAGCA CTGCCTGTCA GCCGGCCAAG GAGCGTTTTA CCCAGCGCTA TAAGGCGGCA CAGGAGCAGC GCCAGGCCTG GCAGGAGGCC AAGCGCCAGG CCGAGCGCAA CGGGGACGAA GCGGGTGTCT CGCGGGCCGA ACACGAGATC AAGGAGGCCG ACGAGGCCCT GGATGAGCTG GATCTCTTCC GCAAGAACCT GCAGAGCTTC GTGCGCAGCT ACGAGTTCCT CTCGCAGATC GTCGACTACG ACGACGTGGA GCTTGAGCAG CTCTGCGTCT ATGCCAAACA CCTCCACCCG CTGCTGCGCG TCGACCGGCT TGACGAGGAG GCGATCGACC TCTCCGAACT GGCGCTGACC CACTACCGCC TGACCAAGCA CCAGGAGCAG CGGCTGCAGC TTGAGGCCCG CGACGAGGCG GGGGATTACA ACCTCCAGCC GGTCAGCGAG GTCGGCTCCG GCAAGCCGCA CGCCCCCGAG AAGAAACCCC TGGCGGAGAT CATCGAACGG CTCAACGACC TCTTCGGCGC CGAGGTGGAC GAGCAGGACA AGCTCAACTT CGCCCAGGGC GTGGCGGACC GGATCGAGCG CGACGAGGCG GTCATGGCCG AGGTCCAGCG CAACAACCCG CAGCAGCTCA TGCACGGTCA ATTCCCCGAG CGGGTCGCCG ACATCGTGCT CGATGCCATG CACGACCACG AGAAGCTGTC CATGGAGATC CTCGACGACA AGGAGAAGGG GCGCGACTTC GCGCTGCTGA TCCTGCAGCT GCTGTCGATG CGGGGGGAGC CCGCGTCGCG GCGCGTGTGA
|
Protein sequence | MADSREAPFQ QDIIDELRAA GWLVGEAGGY DRTHALYPED LIGFVQEAYP ERWERFTKNN PQHPEQALIK ATVRELEKQG TLDVLRHGFK VPGVRIATCS FRPDHGMNPE AQARYRANRL RVVPEVSYSP HARTGEYNPR LDLVLFVNGI PTATLELKSA FKQSVEQAKR QYRNDRPPKD PVTRKEEPLL SFKRGALVHF AVSQNEVAMT TRLAGAETTF LPFNQGTPDG GAGNPPPPDA DTYATSYLWR EVFQPDAWLK IIARFLHLER KTEEEFDGKR KTRESLIFPR YHQWEAVNRL LAATAEEGAG RRYLIQHSAG SGKSNSIAWV AHQLAALYDD DGNRLFNSVI VVTDRTVLDD QLQRTIYQFE HAQGVVRPIT REVGNQSKSE QLAEALAEQT RIIIVTIQTF PALFDALDAR PQLAEGRYAV IADEAHSSQS GAAASKLKTI LGADAPETDE VSAEELLDAA VAARKPTERI SYYAFTATPK GKTLELFGRP PDPEQGRSSE NLPQPFHVYS MRQAIEEGFI LDVLKNYTTY RTAWRLAHPD DQAYEVNSRK ASAKLARWVK LHPYNIGQKV EVIVEHFRTR VRHLLNGQAK AMVVTGSRQE AVRYALALRQ HVEAQGYNDV HALVAFSGSV PADDTIPEEV TEHSAQLNPG LNGRDHAEAL DTDDYNVMIV ANKYQTGFDQ PKLCAMYVDK KLQGVDCVQT LSRLNRIFPG KETFVLDFVN DAEEILAAFR PYYNKAELAD VSDANVVFDL QRTLDAAGVY HWEEVEQFAR AFFDPKAKNA QLSTACQPAK ERFTQRYKAA QEQRQAWQEA KRQAERNGDE AGVSRAEHEI KEADEALDEL DLFRKNLQSF VRSYEFLSQI VDYDDVELEQ LCVYAKHLHP LLRVDRLDEE AIDLSELALT HYRLTKHQEQ RLQLEARDEA GDYNLQPVSE VGSGKPHAPE KKPLAEIIER LNDLFGAEVD EQDKLNFAQG VADRIERDEA VMAEVQRNNP QQLMHGQFPE RVADIVLDAM HDHEKLSMEI LDDKEKGRDF ALLILQLLSM RGEPASRRV
|
| |