Gene Hhal_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0190 
Symbol 
ID4711038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp222667 
End bp223953 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content70% 
IMG OID639854648 
Producthypothetical protein 
Protein accessionYP_001001786 
Protein GI121996999 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.55784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTATC CTAGCGGACC CATGCACCAC CCGGGGCATA CAGAGCAGCC TATGGGCAGC 
GGCCAACGCC TGGAGAGCCT CGATGCCGTC CGCGGGGCGG CAGTGCTGGG TATCCTGGTC
ATCAACATCC AGCTCTTCGC CATGCCGGTG GCCGGGCTGT TGAGCCCGAC CCTGCTCGGC
GGTTTCGAGG GCTGGGACTA CGTCGTCTGG GCCATCGGCC ACGTCTTCTT CGAGGGCAAG
TTCATCGCCC TCTTCGCCGC CCTGTTCGGC GCCGGCGCCG TCCTGCTCGC CGAACGCCAC
CGGGCGGCCG GCATCAACCC CTGGGTGGTC CACCGCCGGC GCATGCTCGC CCTCGGCGCC
ATCGGCCTGG CCCACGGCAC CCTGTTGTGG ATGGGGGATA TCCTGTTCAT CTACGCCGCG
ATGGGGCTGC TCGCCTTCCT GTTCATCGAC CGCACGCCGC GTGCGCTCCT GGCCTGGGGG
GCCGCGCTCT ACGCCCTGCC CATCCTGCTG ACCATGGCCG CCGGCTGGGG GCTGACCCTG
CTGCCCACCG CCGGCTTCAT CAAGCTCGCC TCCGCCTCGC CGCCCGTCCA CGAGGAGATC
ACCGCGGCCA TCGAGGCCTA TCAGGGCGGA TGGCTGACGC AGATGGAGCA GCGCCTCCCC
GAGGCCCTCA CCCGCTATCT GGTCGGCACC CCGGCACGCC TGGGCTGGCT GACCCTCGGC
TGCATGCTGA TCGGCATGGC CGCGTACAAG AACGGCTTCC TCACCGGCGC CTGGAGCTCC
CGGGCCTACG CGCGGGTGGT GGGCTACGGC CTGGGCATCG GCGTCCCGAT GAGCATCGTC
GGGATCGCTT ACCGCGAGTG GCGCGACTGG GAATTGCTCA GCGGCTTCTT CTTCAGCACC
CAGCTCAATC AGCTGGCGGT GCCCTTCGTC GCCGCAGGGT GGGCCGCCCT GATCATCCTC
GCCTTCCAAC GCGGCTGGCT CGGACGCCTG CACTGGCCGT TGACCGCCGT AGGACGGACA
GCCTTGAGTG GTTACCTGCT ACAGTCGGTG CTGTGTACCC TGGTCTTCTA CGGCCACGGG
CTGGGGCTGT ACGGCGAGAT GGGGCGGCCG ACCCAGCTGC TGGTGGTACT CGGCGTCTGG
CTGGTCCTGC TGATCGCCGC ACCCCTGTGG CTGCGCGCCT TCCGCATGGG ACCGGCGGAA
TGGCTCCTCC GCCAGGCCAC ACAGCTGCCG AGACCGGCGC CACCATGCCC CCCGGTCCCC
CCGCCGCGCA GCCCCGACGC CGGCTGA
 
Protein sequence
MLYPSGPMHH PGHTEQPMGS GQRLESLDAV RGAAVLGILV INIQLFAMPV AGLLSPTLLG 
GFEGWDYVVW AIGHVFFEGK FIALFAALFG AGAVLLAERH RAAGINPWVV HRRRMLALGA
IGLAHGTLLW MGDILFIYAA MGLLAFLFID RTPRALLAWG AALYALPILL TMAAGWGLTL
LPTAGFIKLA SASPPVHEEI TAAIEAYQGG WLTQMEQRLP EALTRYLVGT PARLGWLTLG
CMLIGMAAYK NGFLTGAWSS RAYARVVGYG LGIGVPMSIV GIAYREWRDW ELLSGFFFST
QLNQLAVPFV AAGWAALIIL AFQRGWLGRL HWPLTAVGRT ALSGYLLQSV LCTLVFYGHG
LGLYGEMGRP TQLLVVLGVW LVLLIAAPLW LRAFRMGPAE WLLRQATQLP RPAPPCPPVP
PPRSPDAG