Gene Hhal_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0304 
Symbol 
ID4711236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp341697 
End bp343325 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content66% 
IMG OID639854764 
Producthypothetical protein 
Protein accessionYP_001001900 
Protein GI121997113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACTG AGCCCGGCCG CGCGGCCGCC CCGCTGGTTC TGGCCGCTGC CGTTCTTGGC 
GGCACCGCCG GCTACATCCC CCTGGTCGCA GCGGAGGAGC GCATCGGCAT CGTCCTCGCC
AACCGCGGCG ACCCGGTGCT GATCCGCGAC GGCGAAGAAC AGCCCGTGGG GCGCCGCGAC
GAGATCCGCG CCGGCGACCG GCTCAAGACG GACGATCGCT CGCGGCTGCA GATGCGGCTC
AACGACGGCC AGACCCTGAG CCTGACCGAG AACACCGAGC TGCACATCGA GGCCTACCAC
TTCGACGAAG ACAGCGGTGC CGGGGAGAGC CGCAAGAGCC TGATCGAGGG CGGCCTGCGC
GCCATCACCG GGCAGATCAG TGGCGAGGAC GATTACACCA TCGAGACCGA GGTGGCCACC
ATCGGCATCC GCGGGACCAT CTTCGAACTC GCCCACAGCG ACGGCATCAC CGCCGGCGGC
ACGCCCCGCG GTCGCGGCTA TGCCGAGAAC CGCGGTGATA ACCGGCAACG GATCAACACC
GGCGACGACG CCCGCACCGA CTACTACCGG GTGGTGGACG CCGACCTGCC GCCCGAGGCG
CTGCCCGAGC GGCCGGCGGA GCTGGCGGCG CTCGATGAGG CCGCGCCTGA TACGCCGGAG
GACTCGGAAG CGACGGACGA GAACGGTTCC CAGCCATCCG AAGACGACGC CGAGGAGACC
TCAACCGACG AGCAGCAGGC CTCGGAAGAG ACGGAAGGCG ACATGGACAC GCCGGAGGAC
AGCGATCTCC CGGCGACCGT CGATGCACCG GTGGAGCCGG CAACCGATGA CGGGATCGCG
GTGGCGCAGC AAGAAACCGC AGCCAAGGAG GCCGAAGACG ACACCGACGA CATCGATGAC
ACCGTAGAGC TGGACCACGA CCTACTCGTC GACATCAACC CCGACGCCGC GGGCGTCGTC
CCCGAGCCCG GTGAATCCTT CGTCTACATC CGCGGCGACG CCCGGGGGGC CGACGATCCG
CTCTTCTTCC GCACTCCGAC ACTCGAGGAG CGGGAGGAGA TGGCCCGTGA AGGGGACTGG
GATTTCCACG CGGACGACGA GCAGGAGGAG TTCGGTCCCG CGACCGGGGA GTGGGGATAC
TGGTTTGAAA CCGATAGCGA CGAGATCGGG GGGTTCTGGA TACAGGGGGA GAGCTACGTC
GATGAGGTCG ATCTCGAGCC GGAGGAAGAA CTCCCCTTTT ATGGCGGGTG GGTGGAGGCG
GGTTGGTCTC GCGGCCTCCT TGGCGACCAA GGTGCTGCTT TTGAGGATGC GCGGATCACC
ATTGCTGAGG ACCAAGCCGT CACACTTGAG CATTTCGAGA TCCGGCAGTA CCGACCCGAC
CACGAGAATC ACCTGATCTG GCGCACCGAC AGAAAGGACA CAAACCTAGG AGCGCTCGCA
GAAGGCATCA ACGTCTCCGG CGACATCCTG ATCCTTGGGG AGTCGGCCGA TGGCTTCTCC
GGCGACCTCG CCGCCATGCT CTCGCAGATG GACGAGACCC TGGAGCTTTT GGGTGAGTTC
GACTTCCAGG CGGATGACAA CGAAGACTGG GCGGTGGACG GCGTCTTTTT CCTGGAGCTG
GACGACTGA
 
Protein sequence
MTTEPGRAAA PLVLAAAVLG GTAGYIPLVA AEERIGIVLA NRGDPVLIRD GEEQPVGRRD 
EIRAGDRLKT DDRSRLQMRL NDGQTLSLTE NTELHIEAYH FDEDSGAGES RKSLIEGGLR
AITGQISGED DYTIETEVAT IGIRGTIFEL AHSDGITAGG TPRGRGYAEN RGDNRQRINT
GDDARTDYYR VVDADLPPEA LPERPAELAA LDEAAPDTPE DSEATDENGS QPSEDDAEET
STDEQQASEE TEGDMDTPED SDLPATVDAP VEPATDDGIA VAQQETAAKE AEDDTDDIDD
TVELDHDLLV DINPDAAGVV PEPGESFVYI RGDARGADDP LFFRTPTLEE REEMAREGDW
DFHADDEQEE FGPATGEWGY WFETDSDEIG GFWIQGESYV DEVDLEPEEE LPFYGGWVEA
GWSRGLLGDQ GAAFEDARIT IAEDQAVTLE HFEIRQYRPD HENHLIWRTD RKDTNLGALA
EGINVSGDIL ILGESADGFS GDLAAMLSQM DETLELLGEF DFQADDNEDW AVDGVFFLEL
DD