Gene Hhal_1427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1427 
Symbol 
ID4710325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1540818 
End bp1541828 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID639855894 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001002996 
Protein GI121998209 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.818946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTCG ATACGGCGGT TGTCAGCGTG CACGCGTACG CTGATGTGGA CGCAGAGCGC 
GATCCCCCGA TGATCGATGA CCCCGGCGAG CTGATCCAGC CCTACGGCAT CGATGAGGGC
AGCGAGCCCG AGCCGGCTCG CCGGCCCCGT AGGAGCCGAT CCAGCACGGA GTGTCGCACC
GCGCTGGACG CCACGCAGCT CTACCTGAAT GAGATCGGCC ACGCCTCCCT GCTCACGGCC
GAGGAGGAGG TCGCGCTGGC GAGACGCGTT CAGCAGGGCG ATGCGGCAGC CCGGGCGCGC
ATGATCGAGA GCAACCTCCG GCTGGTGGTC AAGATCGCCC GGCGCTACAT GAACCGCGGC
CTCGCCTTCC TGGACCTCAT CGAAGAGGGC AACCTGGGGT TGATCCGCGC CGTGGAGAAG
TTTGATCCCG AGCGCGGGTT CCGTTTCTCG ACCTATGCGA CCTGGTGGAT CCGGCAGACC
ATCGAGCGCG GCATCATGAA TCAGACACGC ACCATCCGGC TGCCGATCCA CGTCATCAAA
GAGATCAACC AGTACCTGCG GACCCAGCGC CGCCTGACCC AGACACTGGA TCACGAACCC
ACGGTCGATG AGATCGCCGA TGCTATGGGG CGTTCGCCGG AAGACGTCCG ACGCATGCGC
GGTCTCAACG AGGGGACCAC CTCGGTGGAT GTGCCGATCG GCAAGGATTC CGACCGGGTG
CTACTCGACG CCATTCCCGA CGAGACGCAG GGCATGCCGG AGAATGTCCT CGAGGACGAC
GACGTCGTGC GGCATCTGCA AGACTGGCTC GGCTTCCTCA CTGACAAGCA GCGGGCCGTG
CTGGAGCGGC GTTTCGGGCT CAACGGCCAC GAGCGCTACA CGCTCGAGCA GGTGGGCACG
CAGGTGGGGG TGACCCGCGA GCGGGTACGA CAGATCCAGA TCGATGCACT GCGGCGGCTG
CGCGAGCTCA TGGAGCGCGA TGGCTACTCC CAGGAGGCGG TCTTCGGCTA G
 
Protein sequence
MTLDTAVVSV HAYADVDAER DPPMIDDPGE LIQPYGIDEG SEPEPARRPR RSRSSTECRT 
ALDATQLYLN EIGHASLLTA EEEVALARRV QQGDAAARAR MIESNLRLVV KIARRYMNRG
LAFLDLIEEG NLGLIRAVEK FDPERGFRFS TYATWWIRQT IERGIMNQTR TIRLPIHVIK
EINQYLRTQR RLTQTLDHEP TVDEIADAMG RSPEDVRRMR GLNEGTTSVD VPIGKDSDRV
LLDAIPDETQ GMPENVLEDD DVVRHLQDWL GFLTDKQRAV LERRFGLNGH ERYTLEQVGT
QVGVTRERVR QIQIDALRRL RELMERDGYS QEAVFG