Gene Hhal_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1150 
Symbol 
ID4710140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1250925 
End bp1252103 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID639855624 
Producthypothetical protein 
Protein accessionYP_001002728 
Protein GI121997941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.732796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAACT ACGCCGATAT GCTCCTCAGG CACGAGGAGA GCAGCGGCTT TATCAAGGCT 
CGCAGGGCCC CTCTGGGGTT TACCGACGGC CGTGATGAGG CGTGGTTACG TGACCTTTTG
GCCGACAACC CGGATGTTCT GCCGATCGAG GAGATCGACC CGTCCTTTGC GCCGTTAGCC
CCCCTCTGCA CCGAGCTTGA GACCGAGGCG GGTCCGGTAG ACGCCGCCTT TATCAATCCC
TCCGGCCGGC TGACCCTGGT GGAGTGCAAG CTCTGGCGCA ACCCGGAAGC GCGGCGAAAG
GTGATCGCGC AGATCCTCGA TTACGCCCGG GCGATTGCCC AATGGGACTA TGCCGACCTC
CAGCGCCGGG TGGCCTCCGC CTCCGGAGAC AAGGCGAATC GACCGTTTGA GGCCGCCCGG
CAGCTGCAAA CGGACCTGGA CGAGGCCGTC TTCGTCGATG CAACGGCCCG CGCCCTGCGA
GAGGGGCGAT TCCTGTTGCT GATCGCAGGG GATGGTATCC GTGAAGGGGT CAGCGGGATG
ACCGACCTGA TCAGCCGCAA TGCGGCCCTT GGCTTCAGCT TTGGCCTCGT CGAGGTCGCC
CTGTATCAGT TCGGCGAACA GGGGCTCGCG GTCCAGCCGC GTGTCATCGC CAAGACCCAC
ACGATCGAGC GAACCTTTGT GGTCATGCAA GGTCCCAATG GCGCCGTCCT TCAAGAGGGT
GAGGGCGACG CGGACCAACC GTCGCAATCG CGCCCTACCG AGGACGAGGT GGCTTGGTGG
GAGCCGTTAA CCCGAATCGC CTTCAATGAC CCCGAGCAGG AGCCACCGGT CTACCGCCCA
CGCAACCACG TGAGGGTCGC CATGCCGTCA ACCGGGATGT GGGTGACGGC GTTTCGTGCC
ATGAGCCATG GCATCTGCGG CGTATTTCTG GGCGGTAGGA AGCCCGAGCG CCTCGAGGTG
CTCGACGCCC TGAACGAGGA GCGCGAGCAG ATCCTAAGCG AACTCCCTGA GGGGACGCAC
CAGGGCATGG ACGGCACTGA AGAGCGCCCC GGATTTGCCA TCTACGCCCA GCTCGACGAC
TTCGCCAGCG ACGAGGCGTG CCGCGCCTGG CTGTCGGAGC AGCTTAATCG GTTCGTGAAT
GCGTTTCGTC CGCGGCTCAA GCGTGTGGAG AAACGCTGA
 
Protein sequence
MPNYADMLLR HEESSGFIKA RRAPLGFTDG RDEAWLRDLL ADNPDVLPIE EIDPSFAPLA 
PLCTELETEA GPVDAAFINP SGRLTLVECK LWRNPEARRK VIAQILDYAR AIAQWDYADL
QRRVASASGD KANRPFEAAR QLQTDLDEAV FVDATARALR EGRFLLLIAG DGIREGVSGM
TDLISRNAAL GFSFGLVEVA LYQFGEQGLA VQPRVIAKTH TIERTFVVMQ GPNGAVLQEG
EGDADQPSQS RPTEDEVAWW EPLTRIAFND PEQEPPVYRP RNHVRVAMPS TGMWVTAFRA
MSHGICGVFL GGRKPERLEV LDALNEEREQ ILSELPEGTH QGMDGTEERP GFAIYAQLDD
FASDEACRAW LSEQLNRFVN AFRPRLKRVE KR