Gene Hhal_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0697 
Symbol 
ID4710727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp777802 
End bp780720 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content66% 
IMG OID639855160 
Producthypothetical protein 
Protein accessionYP_001002281 
Protein GI121997494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.418548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTGCGA CTGGGCGGTC TGTTGCGGTC GCGGTTTGGC TCCTGGCCGC GGCGGGGGCG 
TTCTGGTCGC CCAGCGCGCC TGCCCCGCTT TCTGTGGAGA GCCTCCAGGA AGAGGATGAT
GGCGAGGGGG TGGAGGATAT GGTGCTGCGC GTGGAGCTGG AAGGCTACCC CCTCTCCGAG
AGCGAGTATA TGCTCCAGCG CAAAGGGGAG ATCTACTACC CATTGGGCCA GCTCGCCCGC
CATCTGGAGC TGGCGGTCGA GGTCGACCCG CTTGAGGGGC AGGTCTCCGG CTGGGTCCTC
GATCCGGACC GGACGGTCTC CTTCGATACC GCCAGCGGCA TGGGCGAGGT CGGCGACCGC
CCGCTGATCC TTGAGGAAGA GGATTGGATC CAAGAGCACG ATGACATCTA CTTCCGTGAG
GATGCCATCC AGGAGTTCCT GCCCGTTGGC TTCGAATATC GCCCGCGCCG CGCTGAGGTC
CACATGCGTG CGGACGAGGA ATTGCCGATC CTTAGCCGTC TGGAGCGCGA GGCCCGGTGG
GCACGACTGA CCGATGACGG GGCGGGTATC GGCGGTGAGG AGTGGCTGCC CCGTGAACCG
CTGCCGGCCC GTTTCATGAC CCGTCCTGCA TGGTCCCTGA GCCTCAATCA CCAGCAAAGC
GGGAACGGTA GCAGCACCAG CGGTAATCTC GCAGGCGCTG GAGATCTCCT CTGGCATGAG
GCGGAGTGGC GGTTACAGGC GAATGACACC GACGGAATCC GGCGCTTCGA CGGGACCCTC
GGGCAGCAGG TCGGCCACCC GGCCCTGGAG CGCTACGAGC TGGGGCGGGT GCGCGCGCCC
CGGTACGACC TGATCCGGCG TGGCGGCTCC GGGATGGGGG TGACCCTGAC GAACCGACCG
GAGGGTATGG CGCGTACCAC GTTTACCGAT CAGTCCATCG AGGTGGAGGT TCCCGAGGGC
TGGGACGTCG AGCTCTATCG CGACGGCGAT CTGGTCGACT TTGCACACGA TGTGGACACG
GATCGCCACG AGTTCGAGGA CATTGACGCC CGGCCGGGCA CCAACCCTTA CACCATTGAG
TTCTACGGAC CCCACGGCGA GCGCGAGACC ATCTCGCGGA CGATTGAGAT CGGCCCGGGG
CTGCTGCCGC CGGGCATGGT GCACTACAGC CTGGATGCCA GCCGAGAGGG GGTCAGTCTC
TATGAGCACC AGCCGCGCGA TGACCAGGAC CGGCGCGCGG CGGCGCGGAT GGACGTCGGG
GTGACCGAGA CCCTGACGGT CGGCGGCGAC CTCCACTACA TGGAGCCGGA TGACGAGGAG
GACGAGCTGA CGCGCCGAGA GCTGGTCGGG GCGGACGCCT CGTTCAGTGC GTTCGGGGTT
TGGGGCCGTC TTCGCGGTGC CTTCGAGAAT GACCAAGGCC GGGCTTGGCA GCTCGCGCTG
GAGCGCGGTC TGGGCGCGTG GAGCCTGGAC TACGAGTACA CCCTGGTGGA TGGCCTGGAG
ACCGACGAGC TGCGTAGCCG CGTCAGCGGT GACTTGCGTC ACGGCCACGA GTTGCAAGTG
CGCGGCTCTT TGGGGCCGCT GCGCACCCGT CTGCGCTACG AGCATCAGGA GTCCGCGGAC
GGCGAGTCGA CTTGGCAACG CCTGCGCACC CGCGAAAACG TGAGCATCGG CGGTCAGCGG
CTCCGGCATT CGCTGACCGT GAGCCAGGCG GATGACGGCG ACCCTTCCGC GCAGGGCAGC
CTGCAGACAC GCTGGGGGCG TTATCGGGAG GGGCGGCTGA ACCTCGGTGT GAGCTATGGC
CTGGCGCCCG AGGCGGAGCT GGATAGCGCC AACGCCGAGT ACAGTCAGAG CCTTCATGAC
AACTGGCGGG GCAGTGCCCA GGTGCGCGGC AGCTTCAATG ACCGTCCTCA TAACCTCCGT
CTGGGCATTT CCCGTACTGC TCGCGAGTAC TGGCGCTTCT CGGGGCAAGG TCAGGTGGAT
ACGGGGGGTC GGTGGCAGGT GTCGGTCGGC CTGGATGTGG GAGGGCTGCC CCATCCTGCC
GGGGGGTGGT ACCCGGATCC CGAGGCCGGG CGCGGCTACG ATCACGGGGC GGTGATGGCA
AGGGTTCATC AGGATGGCGA GCCCGTGGAG GGGGTCGATG TCTGTGCAGG CCGCAGCTGC
GGGGGCACCG ATGAGAACGG GGAGGCCTGG GCCGCCCGTC TGGATCCCCA CGATCCGGTC
AACGTGGAGG TGGATGTGGG GTCGATCGAC AATCCCTTCG TCCAACCTGC AACCCGGGGG
GTAAGCTTCG AGCCGCGCCC GGGTCGGGTA CTCCCCCTGG ATTTCGAGCT GCACCTCACG
GGCGAGGTGG ATGGCGTGGT TCGTCGCCAG CGTGGCACGG CAGAGCCATC GGAGGTCTCC
GGTTTCGAGA TGGAGGCGGT GGATACCGAA ACCGGGGAGG TGGTCCAGAC GGATCGCAGT
GTCTTCGATG GTCTCTTCAT CCTGGATCAG CTGCCGCCGG GAGGGTATCT GGTGCGCGCT
TCGGAGGAGC AGGCAGAGCG CCTGGATGTC CCGCAGACCG CCACGGCGCA GCGCATCAGC
GTTGAGGGGG ATGGTGATCT GGTGAGTCAT CAGGATCTCA CGATCCATGA CGGTGGTGAG
ATCGTCCGGG TCGCTTCGCC GGCAGAGGCG GTCGAGCAGA TCACCTTCAG CTACGACTCG
AGCCACACCC TCCGCGATCA CCTGGCGCAA CTCCTTGATC GCCTGGGTGC GGAGCAAGTC
GAAGGTGGCC TGGAGTCGCT CTACGAGGAG GTGGTGATCC TCAACGGCGA GGTCGAAGAC
GGTGATCGTG TGATGGTCCC GGCGGATCGT TCGCCGCTGG ACGAGATCAC CCACTGGCAG
CATCAGGAAG AATTGGAACT GGCGGAAGAG GGTGACTGA
 
Protein sequence
MGATGRSVAV AVWLLAAAGA FWSPSAPAPL SVESLQEEDD GEGVEDMVLR VELEGYPLSE 
SEYMLQRKGE IYYPLGQLAR HLELAVEVDP LEGQVSGWVL DPDRTVSFDT ASGMGEVGDR
PLILEEEDWI QEHDDIYFRE DAIQEFLPVG FEYRPRRAEV HMRADEELPI LSRLEREARW
ARLTDDGAGI GGEEWLPREP LPARFMTRPA WSLSLNHQQS GNGSSTSGNL AGAGDLLWHE
AEWRLQANDT DGIRRFDGTL GQQVGHPALE RYELGRVRAP RYDLIRRGGS GMGVTLTNRP
EGMARTTFTD QSIEVEVPEG WDVELYRDGD LVDFAHDVDT DRHEFEDIDA RPGTNPYTIE
FYGPHGERET ISRTIEIGPG LLPPGMVHYS LDASREGVSL YEHQPRDDQD RRAAARMDVG
VTETLTVGGD LHYMEPDDEE DELTRRELVG ADASFSAFGV WGRLRGAFEN DQGRAWQLAL
ERGLGAWSLD YEYTLVDGLE TDELRSRVSG DLRHGHELQV RGSLGPLRTR LRYEHQESAD
GESTWQRLRT RENVSIGGQR LRHSLTVSQA DDGDPSAQGS LQTRWGRYRE GRLNLGVSYG
LAPEAELDSA NAEYSQSLHD NWRGSAQVRG SFNDRPHNLR LGISRTAREY WRFSGQGQVD
TGGRWQVSVG LDVGGLPHPA GGWYPDPEAG RGYDHGAVMA RVHQDGEPVE GVDVCAGRSC
GGTDENGEAW AARLDPHDPV NVEVDVGSID NPFVQPATRG VSFEPRPGRV LPLDFELHLT
GEVDGVVRRQ RGTAEPSEVS GFEMEAVDTE TGEVVQTDRS VFDGLFILDQ LPPGGYLVRA
SEEQAERLDV PQTATAQRIS VEGDGDLVSH QDLTIHDGGE IVRVASPAEA VEQITFSYDS
SHTLRDHLAQ LLDRLGAEQV EGGLESLYEE VVILNGEVED GDRVMVPADR SPLDEITHWQ
HQEELELAEE GD