Gene Hhal_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1451 
Symbol 
ID4710289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1563625 
End bp1564929 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content70% 
IMG OID639855918 
Producthypothetical protein 
Protein accessionYP_001003020 
Protein GI121998233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.92705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGG GTCGCCCGAA ATACTCGCCG GAACTTCGGC GCAAGCTGGT CGAACTGGTC 
GAGGCGGGAC ACACACCGCA GGAACTGGCG CGCACCTACG AGCCAGCGGC CAAGACGATC
CGCCGCTGGT ATCAGGAGGA CCGACAAGAG GCCAGCGAGC GAGGCAGCGG AAGCGCCAGC
ACGGGCCGCG CCGGCAAGGC AGGGAAAAAG CGTGGAGGAG CCACAGGCGC CTCGGCGTCC
TCGGGCCGTT CCGGGGAGCA GGGGGGCGCG GCGAAGACCC GGGGTAAGCC GGGGAGTAGC
AAGGGGCGCC GGCTCGCGGG CAAGCGGGTC CTGGAGAGCC CGGCGCCGGA GGCCGATGCC
GGAGGGGCGA CAGCGGAGTC CATCGAGGCC GAACAACCTG AGGAGGCCCT CCCTTCGGTC
ACCGGCGAGG TGATCGTCGA ACCCGAGCCC GAGGTCGTCG TCGATGAGCA GCGCGATCCC
GCGGATGAAG CGGAGGGCGC AGCCGACGCG GAGATGGATC CGGAGGAGCT GCTCGAGATC
GGCCTCGAGC GTGCCGACCG GGGAGACGCG GTGTACGACT CGGAGGGCCC CGAGGCGGCA
CTGCCGCTCT ATCAGGAGGC GCTCGATCTC CTCGATGCGG CGATCAGTGC CGGGGTCGGA
GGGGATGCTG CGCGGCGCGA TTGGGGCATT ACCCTGGAGC GCTTCGGTGA TGCCATCTTC
GAGGTCGACG GAGCGGGTGC CGCCCGGCCC TGCTACGAGG CGTGGTGCGA TCTGGCCGAG
GCCCTGGCCA CCGAGCATGC CACGGCCCAG TCGCTGCGGG ACTGGAGCGT GGCGCTCGGT
CGGTACGGCC AGGTGGTGCT GGTCGAGAAA GGGCCCGAGG CGGCGCTACC CTACTATCAG
CAGGTGGTTG AACTGCGTGA GGACATCGTC CGCGAGCGGC AGAGCGCGGA CGCCTACCAG
GACTGGGCGC TGGCCCTGGA GCGCTATGGC GCCGTGCAGG AGGCGGTGGA AGGGCTGGCC
GAGACGGTTG AGACCTACCG CCATGCCCTC GAGGTCCGCG AGGCTCTCGC CGAGCAGCAC
GATACGCCGC AGGCCCGTCG GGCCCTAGGG GTGGCCCAGG AGCGCCTGGC GATGGCGGTA
CTTGCTGCCG ACGGAGCCGA GGCCGCCCTG CCGTACTTCG AACAACTGGC GGAGCTCTTC
ACCGAGCTGG CCGAGGAACT TGGTACCGAA GAGGCCGACG CCGAACGGCG GCAGGCCGAT
ACGATGCTCG GCAAGGTGCA GATGGCGGCG CTGGACCTAG AGTGA
 
Protein sequence
MAEGRPKYSP ELRRKLVELV EAGHTPQELA RTYEPAAKTI RRWYQEDRQE ASERGSGSAS 
TGRAGKAGKK RGGATGASAS SGRSGEQGGA AKTRGKPGSS KGRRLAGKRV LESPAPEADA
GGATAESIEA EQPEEALPSV TGEVIVEPEP EVVVDEQRDP ADEAEGAADA EMDPEELLEI
GLERADRGDA VYDSEGPEAA LPLYQEALDL LDAAISAGVG GDAARRDWGI TLERFGDAIF
EVDGAGAARP CYEAWCDLAE ALATEHATAQ SLRDWSVALG RYGQVVLVEK GPEAALPYYQ
QVVELREDIV RERQSADAYQ DWALALERYG AVQEAVEGLA ETVETYRHAL EVREALAEQH
DTPQARRALG VAQERLAMAV LAADGAEAAL PYFEQLAELF TELAEELGTE EADAERRQAD
TMLGKVQMAA LDLE