Gene Hhal_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1717 
Symbol 
ID4710221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1882469 
End bp1884874 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content68% 
IMG OID639856185 
Productpeptidase S16, lon domain-containing protein 
Protein accessionYP_001003283 
Protein GI121998496 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAC CCGAGCCCCT TTCACTGGAG CGCCTGTACC GGGTCTGTGA TCCGGAGCAG 
CTCGGTTTTC GCACCACCGA GGAGCTGGCG GGCATGGACC GCCCACCCGG GCAGGAGCGG
GCCTTGGAGG CGATGGATCT GGGCGCGAAC ATGCGCGCCC CGGGCTTCAA CCTCTTCGTC
ATGGGCCCGG AGGGCGACGG CAAGCTGGAG ATGGTCCAGC GCCTACTGGC CGAACGCGCC
GCTCGCGAGC CGACGCCCTC GGACTGGTGC TACCTGAACA ACTTCGACGA GCCCACACAA
CCCCGCCTTC TGCGGCTACC CCCGGGCCAA GGGGCACGCT GGCGCCACGA TCTGGAGCAG
CTGATCGAGG AGCTGCGCAG CACCATCCCG GCCACCTTCG AGAGCGACGA GTACCAGAAC
CGGCTGCAGG AGCTGCAGCA GCAGCTCAAC CGCCGCCAGC GTGAGGCCTT CGAGACCATC
CAGAAGGAGG CCGAACAGTA CGACGTCACC CTGCTGCAGA CGCCCTCGGG ATTCAGCTTC
GCCCCGGTGA AGGATGGCGA GGTGATCGAG CCGGAACAGT TCCAGCAGCT ACCCGATGAG
GAGCGCAAGC GCTACCAGGA GGCCATCGAG TTCCTGCAGG AACGACTGCA GTCCGTGGTG
CAGCAGATCC CCAAGTGGCG CAAGGAGATC CAGCAGCAGG TCCGCAAGCT CAACGAGGAG
ATGACCCTGC TCGCGGTCGG TCAGCGCATC CAGGAGCTGC GCCAGCGCTA CGGCGAGCTA
CCGGTGGCGG CGGCCCATCT GGACGCCATC CGCAACGACA TCATCGAGCA CGTGGACGCC
TTCCGCTCCG GGGAGCAGGA CCACGTGGAG TACATCCTCG GCCGCTACCG GGCCAATCTA
TTACTCGCCC ACGATCCAGC CGACGGCGCC CCGGTGGTCT ACGAGGACAT GCCCACCCAC
CAGAGACTGG TGGGTCGAAC GGAGCACCAC GTCCATCAGG GCGCCCTGCT CACCGACTTC
AATCTGATCC GCCCTGGCTC GCTGCATCAG GCCAATGGCG GTTACCTGGT CGTGGACGCC
CACCGCATCC TCACCCAGCC ACTGGCCTGG CCGTCGCTCA AGCGCACCTT GTCTGCCGGA
GAGATCCGCA TCGAATCCCT GGAGCAGGTC CACGGCTTCT GGACCACCGT CACCCTGGAG
CCGGAGCCGA TGCCGCTGCG TACCAAGGTG GTGCTGCTCG GCGACCGGAT GGTCTACTAC
CTGCTCTCCG CCTACGACCC GGACTTCCCG GAACTGTTCA AGGTTGAGGC CGACCTGGAG
GACGACCTCC CCCGGGACAC CGAGACCCAG CAGCTCTACG CCCGCATGCT CGCCACCCTG
GTTCGGCAGC GCCGTCTGCG CCACCTGGAC CGCTTCGCCG TGGCGCGGGT GATCGAGCAC
GGCAGCCGCA TGGCCGATGA CAGCGAGCGG CTGGCCGCCG GCGGGCGGGC CATCACCGAT
TTACTGCAGG AGGCGGATCA CTACGCCACC GGCGACGGCG CCGAGATCAT CGGCCAGGAT
CACATCGAGC GCGCCCTCGC CGCCCAAGAG CGCCGCGCCG GGCGCATCCG CGATCGCAGC
CAGGAGACCA TCGAGCGCGG CACCCTGGTG ATCCACACCG AGGGGCACCA CACCGCCTCG
GTCAACGGGC TCTCGGTCCT GCAGCTGGGC GATTTCGGTT TCGGCCGTCC GACACGGATC
ACCGCCACCG CCCGCCCCGG GCGCGGGCAG CTGGTGGATA TCGAGCGCGA GGCCAAGCTC
GGTGGCAAGA TCCACTCCAA GGGGGTGATG ATCCTCTCGC GCTTTCTGGC CAGCCGCTTC
GCCCCGGAGG GCGACCTGTC GCTGTCGGCA AGCCTCGCCT TCGAGCAATC CTACGGCGGA
ATCGACGGCG ACAGCGCGTC GGTGGCCGAG CTCTGTGCAC TCTTCTCGGC CATCGGCCGC
GTCCCACTGG ATCACGGCAT CGCCGTCACC GGCTCGCTGA ACCAGCTTGG CGAGGTCCAG
GCCGTGGGCG GGGTGAACGA GAAGATTGAG GGCTTCTTCG AGGTCTGTCG GCGGCGCGGG
CTGACCGGGC AGCAGGGCGT GGCGCTGCCG GCAACCAACG TGCCGCATCT GATGCTGCGC
CAGGAGGTGC GCGATGCGGT GGCCGCCGGG CAGTTCCACA TTTACCCGCT GAGCCGCGTG
GACGAGGCCC TGGAGCTGCT CACCGGTCTA CCCGCCGGTG TCTGCGACGA CGCCGGCGAG
TACCCGCAGG GGTCGGTGAA CCGCGCTGTC GCCGACCGCC TGGTGCAGTT CGCCAAGAGC
CAGCGTCGCC GCGGCGACGG CGACGCCGGA GACACCCCGG ACACCGCGGA GGATGACGAT
GACTGA
 
Protein sequence
MAAPEPLSLE RLYRVCDPEQ LGFRTTEELA GMDRPPGQER ALEAMDLGAN MRAPGFNLFV 
MGPEGDGKLE MVQRLLAERA AREPTPSDWC YLNNFDEPTQ PRLLRLPPGQ GARWRHDLEQ
LIEELRSTIP ATFESDEYQN RLQELQQQLN RRQREAFETI QKEAEQYDVT LLQTPSGFSF
APVKDGEVIE PEQFQQLPDE ERKRYQEAIE FLQERLQSVV QQIPKWRKEI QQQVRKLNEE
MTLLAVGQRI QELRQRYGEL PVAAAHLDAI RNDIIEHVDA FRSGEQDHVE YILGRYRANL
LLAHDPADGA PVVYEDMPTH QRLVGRTEHH VHQGALLTDF NLIRPGSLHQ ANGGYLVVDA
HRILTQPLAW PSLKRTLSAG EIRIESLEQV HGFWTTVTLE PEPMPLRTKV VLLGDRMVYY
LLSAYDPDFP ELFKVEADLE DDLPRDTETQ QLYARMLATL VRQRRLRHLD RFAVARVIEH
GSRMADDSER LAAGGRAITD LLQEADHYAT GDGAEIIGQD HIERALAAQE RRAGRIRDRS
QETIERGTLV IHTEGHHTAS VNGLSVLQLG DFGFGRPTRI TATARPGRGQ LVDIEREAKL
GGKIHSKGVM ILSRFLASRF APEGDLSLSA SLAFEQSYGG IDGDSASVAE LCALFSAIGR
VPLDHGIAVT GSLNQLGEVQ AVGGVNEKIE GFFEVCRRRG LTGQQGVALP ATNVPHLMLR
QEVRDAVAAG QFHIYPLSRV DEALELLTGL PAGVCDDAGE YPQGSVNRAV ADRLVQFAKS
QRRRGDGDAG DTPDTAEDDD D