Gene Hhal_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1783 
Symbol 
ID4710892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1956984 
End bp1958456 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID639856253 
Producthypothetical protein 
Protein accessionYP_001003349 
Protein GI121998562 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGATTC GACGTCTGCT CGCCGTCCTT CTTGCCTACT TCCTCTCGGC GGTCCCGCTG 
GGTAGTGCCG AGACGCCTCC GGACGCCGGC GAAGATGCCG CCGGCACTGC GCTGATGCTT
GACGTCAAGG GGGCCATCGG TCCGGCCACC ACCGACTACA TCGTGCGCGG CCTGGCCGAG
GCTCAGGAAC GGGGCGCCGG TCTGGTCATC CTGCGCATGA ACACGCCCGG TGGTCTCGAC
GATGCCATGC GGGACATCAT CAGCGAGATC CTCGCCAGCG ACGTCCCTGT GGCCACCTAC
GTCGCGCCCA GCGGTTCCCG CGCGGCGAGT GCCGGGACCT ATATCCTCTA TGCCTCGCAC
GTGGCCGCCA TGGCGCCGGC GACCAACCTC GGTGCGGCCA CTCCGGTGCA GATCGGCGGG
GGAGGCGGTG GGATCTTCCC CGGTGGCGAT GACGAGGAGC AGGGAGGCGA CGCCCTGGAG
GAGCTCCGGG AGCGGCTCGG CGGCGAGGAG GAAGACGTCG ACGAGGAGGC GCGTGAGGAA
GACGAGGCGG TCGAGCAGGA AGAGCCGGCG GAAGAGCGGG CCGACCGCCC GGACGCGATG
GAGCGCAAGA TCATCGAGGA TGCGGTCTCC TACATCAAGG GGCTGGCCGA GCTGCGCGGG
CGCAATGCCG AGTGGGCCGA GAGGGCAGTG CGCGAATCGA TCAGCGCCTC GGCGAGCGAG
GCGGCGGAGC TGGGGGTGAT CGACTTCGTC GCCGAGGACG TGGATGAGCT CCTGGCCAAG
GCCGATGGCG TGGTGGTCAA GCTCCCCGGG GGCGAGCGGG CCATCGAGAG CGCCGGACTG
GAGGTGGATC TGGTCGAACC GGACTGGCGC AATCGGCTGC TTTCGGTGAT CACCAACCCG
AACGTGGCCT ACATCCTGAT GCTGGTGGGC ATCTACGGCA TCATCTTCGA ACTGATCAAT
CCCGGTTCCC TGGTGCCTGG TGTCCTCGGT GGCATCAGTC TGCTGCTGGC CCTGTACGCC
TTCCAGGCGT TGCCGATCAC CTACGCCGGC CTGGGGCTGA TCGGGCTGGG GATCGCCTTC
ATGATCGCCG AGGCGTTCAT GCCCAGTTTC GGGATCATGG GCATTGGCGG TGCCGTCGCC
TTCGTCCTCG GCTCGATCAT GCTGTTCGAC ACCGATCTTG AGGCCTTCCA GGTCTCGCTT
GGGGTGATCG CCGGGTTTAC CGTGGCCAGT CTGATCATCT TCATCGGCGT GGCGATGATG
GCCGCCCGGG CCTGGCAACG ACCCAAACTC GGCGGGGCTG ATGAACTCAT CGATGCCGAG
GCCATCGCCG AGGAGAGTTT CGAGGGGGCT GGCCACGTGC GTTACGCCGG CGAGCGCTGG
AATGCCGTGG CGGTGAGCCC GGTGCGTAGC GGCGAGCGGG TGCGCGTGGT CAGTAAGGAA
GGACTGACAC TGAAGGTGGA GCCCAATGAC TGA
 
Protein sequence
MWIRRLLAVL LAYFLSAVPL GSAETPPDAG EDAAGTALML DVKGAIGPAT TDYIVRGLAE 
AQERGAGLVI LRMNTPGGLD DAMRDIISEI LASDVPVATY VAPSGSRAAS AGTYILYASH
VAAMAPATNL GAATPVQIGG GGGGIFPGGD DEEQGGDALE ELRERLGGEE EDVDEEAREE
DEAVEQEEPA EERADRPDAM ERKIIEDAVS YIKGLAELRG RNAEWAERAV RESISASASE
AAELGVIDFV AEDVDELLAK ADGVVVKLPG GERAIESAGL EVDLVEPDWR NRLLSVITNP
NVAYILMLVG IYGIIFELIN PGSLVPGVLG GISLLLALYA FQALPITYAG LGLIGLGIAF
MIAEAFMPSF GIMGIGGAVA FVLGSIMLFD TDLEAFQVSL GVIAGFTVAS LIIFIGVAMM
AARAWQRPKL GGADELIDAE AIAEESFEGA GHVRYAGERW NAVAVSPVRS GERVRVVSKE
GLTLKVEPND