Gene Hhal_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1151 
Symbol 
ID4710141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1252134 
End bp1255313 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content67% 
IMG OID639855625 
Producthypothetical protein 
Protein accessionYP_001002729 
Protein GI121997942 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATA GTCGCGAAGC ACCCTTCCAA CAGGACATCA TCGACGAGCT GCGCGCCGCT 
GGCTGGCTCG TCGGCGAGGC GGGCGGCTAC GACCGCACCC ACGCCCTGTA TCCTGAGGAT
CTGATTGGCT TCGTCCAGGA GGCCTACCCG GAGCGCTGGG AGAGGTTCAC CAAGAACAAC
CCGCAGCACC CCGAGCAGGC GCTGATCAAG GCGACCGTCC GGGAACTGGA GAAGCAGGGC
ACCCTCGATG TGCTGCGTCA CGGCTTCAAG GTCCCCGGCG TGCGCATCGC CACCTGCAGC
TTCCGGCCCG ACCACGGCAT GAACCCCGAG GCGCAGGCTC GCTACCGGGC CAACCGGCTG
CGCGTGGTCC CCGAGGTCTC CTACTCCCCC CACGCCCGCA CCGGAGAGTA CAACCCGCGG
CTCGATCTGG TGCTCTTCGT CAACGGCATC CCCACGGCGA CACTGGAGCT TAAGAGCGCC
TTCAAGCAGT CCGTTGAGCA GGCCAAGCGG CAGTACCGCA ACGATCGCCC CCCGAAGGAT
CCGGTCACCC GCAAGGAGGA GCCGCTGCTG AGCTTCAAGC GCGGCGCGCT GGTGCACTTC
GCCGTCAGTC AGAACGAGGT GGCCATGACC ACCCGCCTCG CCGGCGCCGA GACGACCTTC
CTCCCGTTCA ACCAGGGCAC ACCCGACGGC GGCGCCGGCA ACCCCCCGCC GCCCGACGCC
GACACCTACG CCACGAGCTA TCTCTGGCGC GAGGTCTTCC AGCCGGATGC CTGGCTGAAG
ATCATCGCCC GCTTCCTGCA CCTGGAGCGC AAGACCGAGG AAGAATTCGA CGGCAAGCGC
AAGACCCGGG AGAGTCTGAT CTTTCCCCGC TACCACCAGT GGGAGGCGGT CAATCGCCTG
CTGGCCGCCA CGGCCGAGGA GGGCGCGGGC CGGCGCTATC TAATCCAGCA CAGCGCCGGG
TCCGGCAAAT CCAACTCCAT CGCTTGGGTG GCCCACCAGC TGGCCGCGCT CTACGACGAC
GACGGCAACC GGTTGTTCAA CTCGGTGATC GTCGTCACCG ACCGCACCGT TCTCGACGAC
CAGCTCCAGC GCACGATCTA CCAGTTCGAG CACGCCCAGG GGGTCGTCCG ACCGATCACC
CGGGAGGTCG GTAACCAGAG CAAGTCCGAG CAGCTCGCCG AGGCCCTGGC CGAGCAGACG
CGGATCATCA TCGTCACCAT CCAGACCTTC CCGGCGCTTT TCGATGCCCT CGACGCCCGG
CCGCAGCTGG CCGAGGGGCG CTATGCGGTT ATCGCCGACG AGGCCCACTC CTCACAAAGC
GGCGCGGCAG CGAGCAAGCT CAAGACCATC CTCGGTGCCG ATGCCCCGGA GACCGACGAG
GTCAGCGCCG AGGAACTGCT CGATGCGGCG GTGGCAGCGC GCAAGCCCAC GGAGCGGATC
AGCTACTACG CCTTTACCGC CACGCCCAAG GGCAAGACCC TGGAGCTCTT CGGCCGGCCG
CCAGACCCGG AGCAAGGGCG GAGCAGCGAG AACCTGCCGC AGCCGTTCCA CGTATACTCC
ATGCGCCAGG CCATCGAGGA GGGATTCATC CTCGATGTGC TCAAGAACTA CACCACCTAC
CGCACCGCCT GGCGCCTGGC CCACCCGGAT GACCAGGCCT ATGAGGTCAA CTCGCGCAAG
GCCTCGGCCA AGCTCGCCCG GTGGGTGAAG CTCCACCCCT ACAACATCGG CCAGAAGGTG
GAGGTCATCG TCGAGCACTT CCGCACCCGG GTACGCCACC TGCTCAACGG CCAGGCGAAG
GCCATGGTGG TCACCGGCAG CCGGCAGGAG GCGGTGCGCT ACGCCCTGGC CCTGCGCCAA
CACGTGGAGG CGCAGGGTTA CAACGACGTG CACGCCCTGG TGGCCTTCTC CGGCAGCGTG
CCAGCGGATG ACACCATCCC CGAGGAGGTC ACCGAGCACA GCGCCCAGCT CAACCCCGGG
CTGAACGGCC GCGATCATGC CGAGGCGCTG GATACCGACG ATTACAACGT GATGATCGTC
GCCAACAAGT ACCAGACCGG CTTCGATCAG CCGAAGCTCT GCGCCATGTA CGTCGACAAG
AAGCTCCAGG GCGTGGACTG CGTGCAGACC CTCTCGCGGC TCAACCGGAT CTTCCCCGGC
AAGGAGACCT TCGTCCTCGA CTTCGTCAAC GACGCCGAGG AGATCCTCGC CGCCTTCCGG
CCCTACTACA ACAAGGCCGA GCTCGCCGAC GTCTCCGACG CCAACGTGGT CTTCGACCTG
CAGCGGACCC TGGATGCTGC CGGGGTCTAC CACTGGGAGG AGGTCGAGCA GTTCGCCCGC
GCCTTCTTCG ACCCCAAGGC CAAGAACGCG CAGCTCAGCA CTGCCTGTCA GCCGGCCAAG
GAGCGTTTTA CCCAGCGCTA TAAGGCGGCA CAGGAGCAGC GCCAGGCCTG GCAGGAGGCC
AAGCGCCAGG CCGAGCGCAA CGGGGACGAA GCGGGTGTCT CGCGGGCCGA ACACGAGATC
AAGGAGGCCG ACGAGGCCCT GGATGAGCTG GATCTCTTCC GCAAGAACCT GCAGAGCTTC
GTGCGCAGCT ACGAGTTCCT CTCGCAGATC GTCGACTACG ACGACGTGGA GCTTGAGCAG
CTCTGCGTCT ATGCCAAACA CCTCCACCCG CTGCTGCGCG TCGACCGGCT TGACGAGGAG
GCGATCGACC TCTCCGAACT GGCGCTGACC CACTACCGCC TGACCAAGCA CCAGGAGCAG
CGGCTGCAGC TTGAGGCCCG CGACGAGGCG GGGGATTACA ACCTCCAGCC GGTCAGCGAG
GTCGGCTCCG GCAAGCCGCA CGCCCCCGAG AAGAAACCCC TGGCGGAGAT CATCGAACGG
CTCAACGACC TCTTCGGCGC CGAGGTGGAC GAGCAGGACA AGCTCAACTT CGCCCAGGGC
GTGGCGGACC GGATCGAGCG CGACGAGGCG GTCATGGCCG AGGTCCAGCG CAACAACCCG
CAGCAGCTCA TGCACGGTCA ATTCCCCGAG CGGGTCGCCG ACATCGTGCT CGATGCCATG
CACGACCACG AGAAGCTGTC CATGGAGATC CTCGACGACA AGGAGAAGGG GCGCGACTTC
GCGCTGCTGA TCCTGCAGCT GCTGTCGATG CGGGGGGAGC CCGCGTCGCG GCGCGTGTGA
 
Protein sequence
MADSREAPFQ QDIIDELRAA GWLVGEAGGY DRTHALYPED LIGFVQEAYP ERWERFTKNN 
PQHPEQALIK ATVRELEKQG TLDVLRHGFK VPGVRIATCS FRPDHGMNPE AQARYRANRL
RVVPEVSYSP HARTGEYNPR LDLVLFVNGI PTATLELKSA FKQSVEQAKR QYRNDRPPKD
PVTRKEEPLL SFKRGALVHF AVSQNEVAMT TRLAGAETTF LPFNQGTPDG GAGNPPPPDA
DTYATSYLWR EVFQPDAWLK IIARFLHLER KTEEEFDGKR KTRESLIFPR YHQWEAVNRL
LAATAEEGAG RRYLIQHSAG SGKSNSIAWV AHQLAALYDD DGNRLFNSVI VVTDRTVLDD
QLQRTIYQFE HAQGVVRPIT REVGNQSKSE QLAEALAEQT RIIIVTIQTF PALFDALDAR
PQLAEGRYAV IADEAHSSQS GAAASKLKTI LGADAPETDE VSAEELLDAA VAARKPTERI
SYYAFTATPK GKTLELFGRP PDPEQGRSSE NLPQPFHVYS MRQAIEEGFI LDVLKNYTTY
RTAWRLAHPD DQAYEVNSRK ASAKLARWVK LHPYNIGQKV EVIVEHFRTR VRHLLNGQAK
AMVVTGSRQE AVRYALALRQ HVEAQGYNDV HALVAFSGSV PADDTIPEEV TEHSAQLNPG
LNGRDHAEAL DTDDYNVMIV ANKYQTGFDQ PKLCAMYVDK KLQGVDCVQT LSRLNRIFPG
KETFVLDFVN DAEEILAAFR PYYNKAELAD VSDANVVFDL QRTLDAAGVY HWEEVEQFAR
AFFDPKAKNA QLSTACQPAK ERFTQRYKAA QEQRQAWQEA KRQAERNGDE AGVSRAEHEI
KEADEALDEL DLFRKNLQSF VRSYEFLSQI VDYDDVELEQ LCVYAKHLHP LLRVDRLDEE
AIDLSELALT HYRLTKHQEQ RLQLEARDEA GDYNLQPVSE VGSGKPHAPE KKPLAEIIER
LNDLFGAEVD EQDKLNFAQG VADRIERDEA VMAEVQRNNP QQLMHGQFPE RVADIVLDAM
HDHEKLSMEI LDDKEKGRDF ALLILQLLSM RGEPASRRV