Gene Hhal_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0236 
Symbol 
ID4709927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp270988 
End bp272064 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content59% 
IMG OID639854696 
Producthypothetical protein 
Protein accessionYP_001001832 
Protein GI121997045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000273252 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACA AGACCAAGAG CCTGGTCCTC TCCGGGGCGG CGGTCGCCCT GGCCACGCCG 
ATGTCCTCGG CGCTGGCGCT GGACGACGTC CTGAGCTTCG GGGCGTGGGT GAACTACAAC
TACAACGTTG ATGACGACGC CAGCGAGGAC CGCTTCGGCG ATCTCGACTT CGAGTCCTTC
AACATCTACG CCAACCACGA GCACGGCGAC TGGTTCCTCG ACTCGGAAGT CCGGTTGGGT
AAGGGCAGTT TCCAGGGTTC CGGCATCAGC GAGCAAGGAT CTGAAGCCAC AGCAGACCGC
AATGTCATCG GCATTAAGGA ACTGGCCATC GGCCGCCACT ACGGCGAAGA GTGGACCATG
ACCATTGGTA AGACCACGGT CCCGTTCACC TACAGCCGTT TTAACTTCTG GCCGGGCGCC
CGCAACATGG CCGGCTTCGA CGACCAGGAC GGGGTCGGTA TCCGGTTCGA TAACGACCCG
GTCAACACCC CGTTCGACAT GAGCCTCATG TTCGTCAAGA GCCAGAACTT CGGCAACGAG
ACCACCTCGC TTGACGACGG GCAGCGGACT TTCTGGGGAA CCGATGACAC TTACCACGTC
ATGAACACCC TGGTGGGTGA CTTCGGTTTC ACCACGGGTG ATTTCCGCCA CGGTGTCTCG
GTGCAGGCAG GTCAGCTGGC CGACCAAGAT GATACCGACG AGATCGAGGG CCACTACGCG
GCCGGTCTGT ACTCCGAGGG CACCGTTGGC GCCCTGGACC TCTCCGCGCA GTTCGTGCAC
TACGACCTCG ACGAAGTCGA TGGCGACGTA ACTCAAGGCT CCGGCCAGAA AGCCATGGTC
AACGTGGGCA CCGACGTAGG CAGCTGGTAC ACCTACAGCG ACCTCTCCAT GAGTATGCCC
GACAGCGATA TAGCAGATGA TGACCAGATC GACCTGGTCC TTGGTGGTCG CTACAACTAC
GGCCCGGGTA ACATCTACGT CGAGGTGCTG CTTGAAAACC TCACCGATGA TGAAGATGTC
GAGACGAACG ATGAGTTCTC TCAGAGCATC GACCTGACCA TGGACTACTA CTTCTAA
 
Protein sequence
MNNKTKSLVL SGAAVALATP MSSALALDDV LSFGAWVNYN YNVDDDASED RFGDLDFESF 
NIYANHEHGD WFLDSEVRLG KGSFQGSGIS EQGSEATADR NVIGIKELAI GRHYGEEWTM
TIGKTTVPFT YSRFNFWPGA RNMAGFDDQD GVGIRFDNDP VNTPFDMSLM FVKSQNFGNE
TTSLDDGQRT FWGTDDTYHV MNTLVGDFGF TTGDFRHGVS VQAGQLADQD DTDEIEGHYA
AGLYSEGTVG ALDLSAQFVH YDLDEVDGDV TQGSGQKAMV NVGTDVGSWY TYSDLSMSMP
DSDIADDDQI DLVLGGRYNY GPGNIYVEVL LENLTDDEDV ETNDEFSQSI DLTMDYYF