Gene Hhal_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1153 
Symbol 
ID4710143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1256434 
End bp1257723 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content55% 
IMG OID639855627 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001002731 
Protein GI121997944 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.487263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTCC CGGCGTATCC CGAGTACAAG GATTCTGGGG TCGAGTGGCT GGGGGAGGTG 
CCGGAGCATT GGTCCGTCAG TGCGCTGAAG CGTGTTGCGC GTCTCGAAAG TGGCGACGCG
ATAAGCAGCG ATCACATCAG TGAAGAGGGG GAGTATGCCG TTTACGGCGG GAATGGCATA
AGGGGTTTTT CATCTGGATA CACTCACGAC GGTTTTTACC CTTTGATTGG GCGCCAAGGA
GCTCTTTGCG GTAACGTCAA TTACGCGAAA GGAAGGTTCT GGGCATCTGA GCATGCGGTT
GTTGTTTGGC CTGGAAGACA AATTGACGGT TTTTGGCTCG GTGAGCTTCT TCGCTCAATG
AATCTTAATC AATATGCGAC ATCGGCTGCG CAACCGGGTT TGTCGGTTGA GACTATTGAA
AATCTTTATG TTCCTGTTCC GCCGGATGAA GAGCAACAAA AGATAGCGGA GCTCCTCGAC
CACGAAACCG CCCGTATCGA CGCCCTGATC GAGGAGCAGC AGCGCCTGAT CGAGCTGCTC
AAGGAGAAGC GCCAGGCGGT GATCTCCCAT GCCGTCACCA AAGGCCTCGA CCCCGATGTG
CCGATGAAGG ACTCCGGCGT GGAGTGGTTG GGGGAAGTGC CGGCGCATTG GGATGTCGTG
AAGTTCGTCC GGTGTGCAAA AATTGCTGAG GGTCAGGTTG ATCCAAAGCA GGAGCCATAT
AGGAGCATGA TGCTTGTTGC TCCAAATCAC ATTGAGTCAG GGACTGGACG ACTCATGGCT
CGTGAGACTG CAGAAGAGCA GGGGGCAGAG AGTGGCAAGT ATTATTGCTA TGCTGGCGAC
GTAATATACA GCAAGATTCG ACCGTCATTG AGAAAAGCAT GTGTAGCCTA CGAAGATTGC
CTATGCAGCG CTGATATGTA TCCTCTCAGG GCGCAAAGTG GGGTGTATGG CGATTATCTG
CGCTGGACGA TTCTGTCTGA ATCGTTCTCG ACGCTAGCTT TTCTGGAATC AGAGCGCGTG
GCGATGCCGA AAGTCAATCG GGAGTCGATT GAAGAGATTC GAATCCCTAT GCCGCCACCG
GAAGAGCAGC TACAGATATC CCGTACCCTC GAAAAAGAAA CGGCCCGCAT CGACGCGTTG
ATGGAGGAGG CTGAATCGGG TATCCAGTTG CTCCAAGAAC GCCGCTCCGC CCTGATCTCC
GCCGCCGTCA CCGGCAAGAT CGACGTGCGT GACTGGGCGC CGCCGGCCGC TGCCGAACCG
GAGCAGGAAC GCGAAGGAGC GGCGCTATGA
 
Protein sequence
MSFPAYPEYK DSGVEWLGEV PEHWSVSALK RVARLESGDA ISSDHISEEG EYAVYGGNGI 
RGFSSGYTHD GFYPLIGRQG ALCGNVNYAK GRFWASEHAV VVWPGRQIDG FWLGELLRSM
NLNQYATSAA QPGLSVETIE NLYVPVPPDE EQQKIAELLD HETARIDALI EEQQRLIELL
KEKRQAVISH AVTKGLDPDV PMKDSGVEWL GEVPAHWDVV KFVRCAKIAE GQVDPKQEPY
RSMMLVAPNH IESGTGRLMA RETAEEQGAE SGKYYCYAGD VIYSKIRPSL RKACVAYEDC
LCSADMYPLR AQSGVYGDYL RWTILSESFS TLAFLESERV AMPKVNRESI EEIRIPMPPP
EEQLQISRTL EKETARIDAL MEEAESGIQL LQERRSALIS AAVTGKIDVR DWAPPAAAEP
EQEREGAAL