Gene Hhal_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2136 
Symbol 
ID4709740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2335468 
End bp2338893 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content74% 
IMG OID639856610 
Producthypothetical protein 
Protein accessionYP_001003702 
Protein GI121998915 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID[TIGR02099] conserved hypothetical protein TIGR02099 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCTT CGATGCGCCG CTTCGGCCTG CGCGTGCTGC GTGGCCTGCG TCCGGTGCTC 
CTGGTCGCGC TGGCCGGCGC GGTCGCCCTG CGTCTGCTCG TCTGGCTGGC GCCCCCACTG
ACATCCCCGG TTGAACGGCT GGCCAGCCAG GCCCTTGAGG CTCCGGTGAC GGTGGAGCGC
GCCGAACTCG CCTGGCGGGG GGTATGGCCG GGGATCGAGC TGCACGGTGT GCGGGTTGGT
GACGAGGCGT CCATCACCCT GGAGCGGGCC ACCCTGACGC CGGCTCCGCT GGAGAGCCTG
CGTGCCGGGG CGCCACGCTG GGCCGCCGCC TCGGCGGCGG GCGTGCGCGC TACCCTGGTG
GCGGATCGGC ACGGCTGGTC GCTGCCGGGG TTGACGCCCG GCGGGGAGGG TGGCGAGCCC
CGGCTGGATC TGGATCAGAT GCCCCGCCGC TTGGCGGTGG CTGACGCTGC GCTGGAGTTG
ATCCCCCGCC GCGGTGCGGC GCTGGAGAGC GCCCGCTTCG ATATCGCCGG CCGGCGCAGT
GCCGAGGCCC TGCAGTTGGG GCTGCGCAGT GTTGAGGGGA GTGCCGGGCT GGGCCGGGGG
GTGCAGCTCG CTGCGCGACT GCCCCGGGGG GATCCCGGCT CGGCCACCGT TTACGCGGCC
CTCGACGAGG TTGCGGTGGC GCCGTGGGCG GCGGCCTTGG CCGAGACCTG GGCGGCCGGG
ACGCTCGACG GCCAGCTCTG GCTGGAGTGG GCGGACGGTC GGGCGGATGC CGCCCACGGC
GCCCTGCGGG CCAACGCCCT GGGCATGGAG CCGCGACGCC CGGCACCGGG GTTCGACAAC
CTGTCGGCGA CGGCCCGTTT CGCGGGCCGC GCGTGGGTGG CCGAGCTCGA TGCCGCGGAG
ACCGCGGTCG ATCTGCCCTG GCTGTTTGCC GAGGCCATCG CCGTGGAGCA GGCCGAGGCC
CAGCTCGTCG GCAGCTGGCA GCCCGGCGGG CGCTGGTCGC TCCGCCTGCC GCGGGTCGTC
GCCGAGAACA CGGATACCCG GGCCCGCGGC CGCGGGTCGA TCACCGGGGG CGATGGCGAC
ACGCCGCGGC TGTTCATTCG CGCCAGCGCC GATGAGGCCC CTGTTGAACG GGTACCGGCC
TACGTCCCCA CTGGGATCAC GCCGCCGGCG GTGGTGGAGT GGCTCGAGGA TGGCCTGCTC
GACGGCACCG CCCACGGGAT CGAGGTGCTG TTCTTCGGCC GCCCCGACCG CTTCCCCTTC
GACGGCGGCG AGGGGGTCTT CGACGTCCGC GCCAGGGTGA GCGACGCCCA GCTGCGCTAC
GACCGTAGCT GGCCTCGGCT GACCGGGATC GACGCGGGCC TGCGCTTCCA CAACGCCAGC
ATGACCATCA CCGGACGCGG TGTGACCAAC GGCGTGGACC TGCGCGACGT GGCAGTGGGC
ATCGACGATC TGCGCGAACC CATCCTGACC GTCCGCGGCC GCGGCGAGGG GGATGCCGCC
GGCGGTCTGG GCTTCCTCGC CGGCTCGCCG CTGGGGCGCG ACTGGATCGG GGATCCGGTG
CCGCTGTACG CCAGTGGCCC GATGAGCCTC GACCTGGACT TCGCCCTGCC GCTGGAGATG
GCCCACCCCG AGACCCTGCT GCTCGATGGG CGCGTCCAGC TTGACGGCGT CGAGGCTGGG
ATCGACTCCT GGGTGCGTAC CGAGGCCCTG GTCGGTGCCC TGGGCTTCGA TGCCCGGGGG
GTGCACAGCG CGGAGGGGCT ACACGGTCGC TGGGGCGGGG AGCCGGTGAC CATCGGGGTC
ACCACCGCCT CGGTGGGGGG GCAGTCGCGC ATCCTCTTGG ATGCCGCCGG ACACGGCTCC
CCGGGTAGCC TGCTGGAGGG GGTCGCGGAC GATCCGGCCT GGCTCAGCGG GGCGGCCCCC
TGGGCGGTCC GGGCTCGCCT GCCGGCCTTT CAGCCGCATC TGCAGCCGCC GGAGGTCAGC
CTGCGGGTGC GCTCGGGCCT CCAGGGGGTG GGGCTGGATC TGCCGGCGCC GTTCGGTGTC
CAGCCCGACG AGCGTCGCGA TGTGCAGGTG GATGTGGGAT TCTCGGCCCG CGGCCTGGAG
TCGTACTGGT TGCACTACGG TGACGAGTTG CTGCGCGCCG GGGCCGCGGC CGACGAAGAC
GGCCTGCCGC AGGCCCTGGC GATGCGCCTC GGGGCCGGCG CGCTGCGGCT GCCGAGCCGG
GGGAGCGTCA TCGAGGGCCG ACTGGAGCAG CTCGATGCCG ACGTCTGGCG CCGCTGGTTG
ATCGCGCGCC TGGGTGAAGG TCTGTCCCGG ATGGAGGCCT CGGCGTCGCA CTGGTTGCCG
CCCCTGCCGC TGCGCGGTCA TCTGCGTATC GGCGATCTGA CCCTGGGGGG GCGGGGCTAC
GGCGATCAGA CTCTGGCGGC GGACTGGCCC GCCGATCAAC CGGGTCAGCT GGCGCTGCGG
GGGGACCTGG TCAGCGGGCA GGTGACCTGG GACGAGGCGC TGCAGAGGGT CCGGGCCGAT
CTCGATCACC TGGATCTGCC GGTGCCGCGG CGGGCACCGG ACGCCGGGCC GGCGCTGGCC
GATCCCCGGC CGCGCATGGG GACGCCCGCG GCGGCGGCCT GGCCCGAGGT GGACGTCGAC
ATCGCCAGTG TGCGTCTCGC CGGTCGTGTC GCCGGTGTCG GGCGGGTGCG TCTGCGCCCC
CGTGGGGAGG TGCTATACCT GGACGAGGCC AGTATCCGCG GGCCGACGCT CCACTTGGTT
ACCAGCGGGG AGTGGCGCTC CACCGGTACG GCGCTGCGCG GGCGGCTGCG CAGCGGCGAT
GCGGGGGATC TGTTCGGGCT GCTGCAGGCG CCGCGCGCGG TCACGGTGGC CGATGTCGAC
ATCTCCGCCG AACTCGGCTG GCCGCAGGCC CCCTGGGGGG TGGAGCTGGC TGACCTGGTT
GGCGCCGTGC GGGTGCGCAT GAACGACGGA CGCATCACCG ATGTGGACCC GGGCGCTGGC
CGCCTGGTGG GTCTGCTGGG GCTGCGCATG CTGCCGCGGC GGATCCTGCT CGACTTCGCC
GACCTCTCCG GGGAGGGGTT CGCCTTTGAT CGGATCAGCG GACGGATCAC CGCCGCCGGC
GGTTACGCCC ACGTCGACGA CCTGCGCATC GCGGGTCCGG CCGCGCGCGT TACCATATCG
GGGAACATGG ATCTGACGCA GCGTGTCTAC GACAACCACG TCGTTGTCGA GCCGCGTCTG
GGGGCAACGC TGCCGCTGCT TGGGGCGCTG CTCGGTGGCG GCGTGGGTGC CGCCGGCGGC
TTTCTGGCCG ACCAATTGCT CGGTCAGGGT GTGGATCGGG CAGCGGCGGT GCGCTATCGC
GTGGCGGGGC CGTGGCACGA GCCCCGCGTC ATGAGGCTTG GAGTCGAGGA TGCAACACAG
CGCTAA
 
Protein sequence
MAPSMRRFGL RVLRGLRPVL LVALAGAVAL RLLVWLAPPL TSPVERLASQ ALEAPVTVER 
AELAWRGVWP GIELHGVRVG DEASITLERA TLTPAPLESL RAGAPRWAAA SAAGVRATLV
ADRHGWSLPG LTPGGEGGEP RLDLDQMPRR LAVADAALEL IPRRGAALES ARFDIAGRRS
AEALQLGLRS VEGSAGLGRG VQLAARLPRG DPGSATVYAA LDEVAVAPWA AALAETWAAG
TLDGQLWLEW ADGRADAAHG ALRANALGME PRRPAPGFDN LSATARFAGR AWVAELDAAE
TAVDLPWLFA EAIAVEQAEA QLVGSWQPGG RWSLRLPRVV AENTDTRARG RGSITGGDGD
TPRLFIRASA DEAPVERVPA YVPTGITPPA VVEWLEDGLL DGTAHGIEVL FFGRPDRFPF
DGGEGVFDVR ARVSDAQLRY DRSWPRLTGI DAGLRFHNAS MTITGRGVTN GVDLRDVAVG
IDDLREPILT VRGRGEGDAA GGLGFLAGSP LGRDWIGDPV PLYASGPMSL DLDFALPLEM
AHPETLLLDG RVQLDGVEAG IDSWVRTEAL VGALGFDARG VHSAEGLHGR WGGEPVTIGV
TTASVGGQSR ILLDAAGHGS PGSLLEGVAD DPAWLSGAAP WAVRARLPAF QPHLQPPEVS
LRVRSGLQGV GLDLPAPFGV QPDERRDVQV DVGFSARGLE SYWLHYGDEL LRAGAAADED
GLPQALAMRL GAGALRLPSR GSVIEGRLEQ LDADVWRRWL IARLGEGLSR MEASASHWLP
PLPLRGHLRI GDLTLGGRGY GDQTLAADWP ADQPGQLALR GDLVSGQVTW DEALQRVRAD
LDHLDLPVPR RAPDAGPALA DPRPRMGTPA AAAWPEVDVD IASVRLAGRV AGVGRVRLRP
RGEVLYLDEA SIRGPTLHLV TSGEWRSTGT ALRGRLRSGD AGDLFGLLQA PRAVTVADVD
ISAELGWPQA PWGVELADLV GAVRVRMNDG RITDVDPGAG RLVGLLGLRM LPRRILLDFA
DLSGEGFAFD RISGRITAAG GYAHVDDLRI AGPAARVTIS GNMDLTQRVY DNHVVVEPRL
GATLPLLGAL LGGGVGAAGG FLADQLLGQG VDRAAAVRYR VAGPWHEPRV MRLGVEDATQ
R