Gene Hhal_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0489 
Symbol 
ID4710720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp557137 
End bp558627 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content66% 
IMG OID639854947 
Producthypothetical protein 
Protein accessionYP_001002078 
Protein GI121997291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGAT ACCACGCCCA CCGCCGCCGG CCGGCCCTGC TGGCCACCGG CCTGGGCAGT 
GCGACGCTGC TGCTCGGCAG CGCCCACGCC CAGTCGCCCG CTGACGACCC CGCCGTGGAG
CCGGGTCCGG AGCCCACGGT CAGCGCCGAT CCGGGGCCGA TGCAACGGGC CAGCGCCTCC
AGCGAACAGA TCGGCGCCGG CACGCTGATG AACCCGTCGA TCTCGGTCAT CTTCGACGGC
GTCTACGGCA ACGAGTTCTC CGGCCACGTC GGCGACCCCG GCGGCTTCGG CATGGGCCAC
AGCCACGGGC ATGGCCACGG CCACGATCAC GCCCACGGCA TTGAGGACGG CTTCCAGCTG
CGCGAGACGG AGTTCGCCTT CGAGGCCTCG GTGGACCCGT ACTTCGACGC CTTCGCCATG
CTGGTGGTCG AGGGCACCGA CCACATCGAC CTGGAGGAGG CGTACTTCAC CACCCGCGCC
CTGCCCTGGG GCCTGCAGGT GAAGGCCGGG CGCTTCCTCT CGGATATCGG CTACATCAAC
AGCCAGCACC CCCACGAGTG GGACTTCGTG GACCGCCCGC TGGTCAGCGA ACACCTCTTC
GGCGACCACG GCATCCAGGA GACCGGCGTG CAACTCAACT GGCTGGCACC GACCCGGACC
TACCTGAAGT TCGGCGCCGA GATCCTGGAG GGGGAGACGA GCGGGATTGC GGCCTATGAG
GGCGAGACAA GCACACGGCC CGGTTGGATC GATGGAGACG GCGCGCCGGA GCGCCACAGG
ACGGAAGAAT TGGATCTCCC CTTCTCCGAC TCAACGGGAC CTCGGCTTGC CACGCTCTTC
GCCAAGTGGG GGCCCGACCT GGGCTTCAAC CACGCCGCGC AGTTTGGCGC CTCGGCTGGA
TACGCCAGCG CGTGGCAACG GATGGAAGAG CACAGCGAAG GCCTTCGCGT CGAAGCCTGG
GATGGGGACG CCTGGTTTGC TGGCCTCGAT GCCGTCTACA AGTTCGATCC GCCAGGTAGC
TACCAGGGCG CCGGCCAACT GACGCTCCAA GGCGAGTACT TCTACAGGAA CATCGATTCC
GATTTCTATT ACTACAACCA CGACGATGCC AACAACTGGG AACGAGAGAC TGCAGATGCC
GACGGGACCT CGGGCAGTTT CAAACAGGAT GGTCTCTACG TGCAGGCCGT CTATGGCATT
GCCCCGCGCT GGCGCTCCGG CATCCGCGCC GAGGCGCTGG GCCTGCTCGA GAACCAGGCG
TGGCATGATC GGGATGACGG CAACGGCTAC ACGGACCTGG ACACCTCCTA CCGCTACTCC
GCCAATGTGA CCTTCTACCC GTCGCACTTC TCCTACATCC GGGCGCAGGT GAACTATTCG
GACTTTGCTG ACGGCACGCC GGACGACCCG GATACGCACG ACGAGGACGC CTGGCAGGTC
ATGCTCCAGT ACAACCTGAG CCTCGGCGCC CACGGCGCCC ATCCGTTCTG A
 
Protein sequence
MPRYHAHRRR PALLATGLGS ATLLLGSAHA QSPADDPAVE PGPEPTVSAD PGPMQRASAS 
SEQIGAGTLM NPSISVIFDG VYGNEFSGHV GDPGGFGMGH SHGHGHGHDH AHGIEDGFQL
RETEFAFEAS VDPYFDAFAM LVVEGTDHID LEEAYFTTRA LPWGLQVKAG RFLSDIGYIN
SQHPHEWDFV DRPLVSEHLF GDHGIQETGV QLNWLAPTRT YLKFGAEILE GETSGIAAYE
GETSTRPGWI DGDGAPERHR TEELDLPFSD STGPRLATLF AKWGPDLGFN HAAQFGASAG
YASAWQRMEE HSEGLRVEAW DGDAWFAGLD AVYKFDPPGS YQGAGQLTLQ GEYFYRNIDS
DFYYYNHDDA NNWERETADA DGTSGSFKQD GLYVQAVYGI APRWRSGIRA EALGLLENQA
WHDRDDGNGY TDLDTSYRYS ANVTFYPSHF SYIRAQVNYS DFADGTPDDP DTHDEDAWQV
MLQYNLSLGA HGAHPF