Gene Hhal_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1770 
Symbol 
ID4710967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1943243 
End bp1945198 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content64% 
IMG OID639856240 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001003336 
Protein GI121998549 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.292571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACA TGGCCAAGAA TTTGATCCTT TGGGTCATCA TCGCCGTCGT GCTGATGTCC 
GTATTCAGCA ACTTCCAGGA ACAATCGGCG GATGTGACCG AGAAGGTTCC CTACTCGGAG
TTCCTCAACG AGGTCGAGCG GGGGAACATC CGCGAGGTGC TGATCCGGGG TGAGGAGATC
ACCATCCAGC ACGCCGACGG CAACGAGTAC CGGACCTTCA ACCCGGAGGT CGACAACCGC
GCGCTGATCG GCGAACTGCT TGAGCACGGC GTCACCATCG ATGCCGAGCA GGCTGAAAGC
GACAGCATGC TCATGCAGAT CCTCATCTCG TGGACGCCCT TCCTGCTGCT GATCGCCGTG
TGGATCTACT TCATGCGCCA GATGCAGGGA GGCGGCGGTG GCCGCGGGGC GATGTCCTTT
GGCAAGAGCA AGGCCAAGAT GATGACCGAG GAGCAGAGCA AGCACAGCTT CTCGGACGTG
GCCGGTTGCG ATGAGGCCAA GGAGGACGTC AAGGAACTGG TGGACTTCCT GCGTGACCCG
AGCAAATTCC AAAAGCTTGG CGGGACGATC CCGCGGGGCG TGCTCATGGT GGGTCCTCCG
GGAACCGGCA AGACCCTGCT CGCCAAGGCG ATCGCCGGGG AGGCCCGGGT GCCGTTCTTC
TCGATCTCCG GTTCGGACTT CGTCGAGATG TTCGTCGGTG TCGGCGCCTC GCGGGTTCGC
GACATGTTCC AGCAGGCGAA GAAGCAGGCC CCGTGCATCA TCTTCATCGA CGAGCTCGAC
GCCGTGGGGC GGCAGCGTGG GGCCGGTCTC GGCGGTGGGC ACGATGAGCG CGAGCAGACG
CTGAACCAGA TGCTCGTCGA GATGGATGGA TTCGAGGGCA GTGAAGGGAT CATCGTCATC
GCTGCCACCA ACCGTCCCGA CGTCCTCGAC CCGGCGCTGC TGCGTCCGGG GCGCTTCGAC
CGTCAGGTGG TGGTGCCGCT GCCCGATGTC CGAGGGCGTG AGCAGATCCT CAACGTGCAC
ATGCGCAAGG TACCGACGGC GGACGATGTC CGGCCCGAGA TCATCGCCCG CGGGACGCCG
GGCTTCTCCG GCGCCGACCT GCAGAACCTG GTCAATGAGG CAGCGCTGTT CGCGGCCCGA
GCCAACAAGG AGGCCGTCGA TCAGACGGAC TTCGAGCAGG CCAAGGACAA GATCATGATG
GGCTCCGAGC GCAAGTCCAT GGTGATGAAA GAGGACGAGA AGAAGCTCAC GGCCTACCAC
GAGGCCGGGC ACGCCATCGT CGGCTTGCTC ACCCCGGAGC ACGATCCGGT TCACAAGGTG
ACGATCATCC CGCGGGGGCG CGCGCTGGGC GTGACCATGT TCCTTCCTGA GGAGGATCGC
TACAGCTACA CCAAGCAGCG CCTGGACAGC ATGATCGCCA GCCTCTTCGG TGGGCGGATT
GCCGAGGAGC TGATCTTCGG CAACGACCGG GTCACTACCG GTGCCCAGAA CGACATCCAG
CGGGCCACCG AGATTGCCCG CAACATGGTC ACCAAGTGGG GGCTTTCGGC GCGGCTCGGT
CCGCTCGCCT ACGGCGAGGA GGAGGGCGAG GTGTTCCTCG GCCGCTCCAT GGCGCAGCAG
AAGGACGTCT CCGACGAGAC GCAGCACGCC ATCGACGAAG AAGTGCGCGC AGTGATCGAC
AACAACTACA CTGCGGCTGA GAAGATCCTC CAGGAGAACC TGGAGAAGCT GCACCTGATG
GCTGATGCGC TGATGAAGTA CGAGACCATC GACCGCGATC AGATCGACGA CATCATGCGG
GGCGACGAGC CGCGACCGCC CAAAGGGTGG CAGGATCGGG ATCACGGTGG TGGCTCGGGC
GACGAGGGTG AGACTGCCGG GGCCGATGAC CAGCCCGAGG CCGAAGGTAA AGACGGCCGC
GAGGGGCCCA TCGGCGGACC TGTAGGCGAG CACTGA
 
Protein sequence
MSDMAKNLIL WVIIAVVLMS VFSNFQEQSA DVTEKVPYSE FLNEVERGNI REVLIRGEEI 
TIQHADGNEY RTFNPEVDNR ALIGELLEHG VTIDAEQAES DSMLMQILIS WTPFLLLIAV
WIYFMRQMQG GGGGRGAMSF GKSKAKMMTE EQSKHSFSDV AGCDEAKEDV KELVDFLRDP
SKFQKLGGTI PRGVLMVGPP GTGKTLLAKA IAGEARVPFF SISGSDFVEM FVGVGASRVR
DMFQQAKKQA PCIIFIDELD AVGRQRGAGL GGGHDEREQT LNQMLVEMDG FEGSEGIIVI
AATNRPDVLD PALLRPGRFD RQVVVPLPDV RGREQILNVH MRKVPTADDV RPEIIARGTP
GFSGADLQNL VNEAALFAAR ANKEAVDQTD FEQAKDKIMM GSERKSMVMK EDEKKLTAYH
EAGHAIVGLL TPEHDPVHKV TIIPRGRALG VTMFLPEEDR YSYTKQRLDS MIASLFGGRI
AEELIFGNDR VTTGAQNDIQ RATEIARNMV TKWGLSARLG PLAYGEEEGE VFLGRSMAQQ
KDVSDETQHA IDEEVRAVID NNYTAAEKIL QENLEKLHLM ADALMKYETI DRDQIDDIMR
GDEPRPPKGW QDRDHGGGSG DEGETAGADD QPEAEGKDGR EGPIGGPVGE H