Gene Hhal_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1101 
Symbol 
ID4709935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1192926 
End bp1194734 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content67% 
IMG OID639855572 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001002679 
Protein GI121997892 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.248126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGG ATCAACAGTC GCAAATCAAG CAGCTCATCG CCAAGGGCAA AGAGCAGGGC 
TTCCTAACCT ACGCAGAGGT TAACGACCAC CTCCCGGACG ATATCGTCGA CCCGGATCAA
ATCGACGATA TCATCGGGAT GATCAACGAC ATGGGGATCA ACGTCCACGA GACGGCCCCG
GACTCGGACG AACTGCTGCT CGCCGAAACC ACTGTCGCCA CAGACGAGGA CGAGGCCGAG
GAGGCCGCCG CTGCCCTCGC CGCAGTGGAC GCGGAGTTCG GCCGCACCAC GGACCCGGTG
CGCATGTACA TGCGCGAGAT GGGCAGTGTC GAGCTGCTCA CGCGCGAGGG CGAGATCCAG
CTGGCCAAGC GCATCGAGGA CGGGCTCGAC CGCGCCCTGG CCGCGCTGTC CTCTTACCCG
GAGGCAGCCC GCCAGCTGAT CATGCTCTAC GATCGCGCCC AGGAGGGCGA GGCCCGGCTA
ACCGATATCG TCGCCGGATT CCGGGATCGC GAAGACGACT CCGAGCAACC GCCGGAGGCC
CCTGCCGCCC CGGAGGCCGC TGCGGACGAG GACGAGGAGG CGACCAGCGC CGAGACCGGC
CCCGACCCGG AGGCCGTGGC CGAGCTCTTC GAGCGTCTGC GCGCCGCCTA CAACGAGATG
CAGGAGGTCC TGGCCACTGA GGGCTCGGCG TCCCCTCGCA TTGCCGAGCT GCGCGGCGAA
CTGGAAGAGG TCTTCCTCAG CATCAAGTTC ACCCCCAAGG TGGTCGACGC GGTGGCCGAC
CGGCTGCGCG GCACGGTGGA TACGATCCGC GCCCGTGAGC GCGAGATCAT GAACCTGTGC
ACCCGCGAGG GGCGCATGGC GCGCAAGGAG TTCGTCAAGA GCTTCCAGGA CCGCGAGACC
GACCCCACCT GGCTGGACGA CCTGCTGGCC GAAGGCGAGG AGCGTGCCGA GCGGCTGCAG
CCCCACGCCG AGGCGATCCG CAAGGCGCAG TCAGAACTCA AGGAGATCGC CGACAAGAAC
GGCCTCTCGG TGGCGGAGAT CAAGGAGATC AACCGCCGCA TGTCCATCGG CGAGGCCAAG
GCCCGCCGCG CCAAGAAGGA GATGGTCGAG GCCAACCTGC GCCTGGTGAT CTCCATCGCC
AAGAAGTACA CCAACCGCGG GCTGCAGTTC CTCGACCTGA TCCAGGAAGG CAACATCGGC
CTGATGAAGG CGGTGGACAA GTTCGAGTAC CGGCGCGGCT ACAAGTTCTC GACCTACGCC
ACCTGGTGGA TCCGGCAGGC GATCACCCGC TCCATCGCCG ACCAGGCGCG GACCATCCGC
ATCCCGGTGC ACATGATCGA AACGATCAAC AAGCTGAACC GGGTGTCGCG GCAGATGCTC
CAGGAGATGG GCCGCGAGGC CACCCCCGAA GAGCTGGCCG AGCGCATGGA GATGCCCGAG
GACAAGGTGC GCAAGGTCCT CAAGATCGCC AAGGAGCCGA TCTCCATGGA GACGCCGATC
GGCGACGACG AGGACAGCCA CTTGGGCGAC TTCATCGAGG ACACCAGCGT CACCTCGCCG
GTGGACTCGG CCACCTCGGA GGGACTGCGC GAGTCGGTCC GCGAGGTGCT CTCGGGGCTG
ACCCCGCGCG AGGCCAAGGT CCTGCGCATG CGCTTCGGCA TCGACATGAA CACCGACCAC
ACCCTGGAGG AGGTCGGCAA GCAGTTCGAC GTCACCCGTG AGCGGATCCG GCAGATCGAG
GCCAAGGCGC TGCGCAAGCT GCGCCACCCC ACCCGCTCCG ACGGGCTGCG CAGCTTCCTC
GAAGAGTAA
 
Protein sequence
MSQDQQSQIK QLIAKGKEQG FLTYAEVNDH LPDDIVDPDQ IDDIIGMIND MGINVHETAP 
DSDELLLAET TVATDEDEAE EAAAALAAVD AEFGRTTDPV RMYMREMGSV ELLTREGEIQ
LAKRIEDGLD RALAALSSYP EAARQLIMLY DRAQEGEARL TDIVAGFRDR EDDSEQPPEA
PAAPEAAADE DEEATSAETG PDPEAVAELF ERLRAAYNEM QEVLATEGSA SPRIAELRGE
LEEVFLSIKF TPKVVDAVAD RLRGTVDTIR AREREIMNLC TREGRMARKE FVKSFQDRET
DPTWLDDLLA EGEERAERLQ PHAEAIRKAQ SELKEIADKN GLSVAEIKEI NRRMSIGEAK
ARRAKKEMVE ANLRLVISIA KKYTNRGLQF LDLIQEGNIG LMKAVDKFEY RRGYKFSTYA
TWWIRQAITR SIADQARTIR IPVHMIETIN KLNRVSRQML QEMGREATPE ELAERMEMPE
DKVRKVLKIA KEPISMETPI GDDEDSHLGD FIEDTSVTSP VDSATSEGLR ESVREVLSGL
TPREAKVLRM RFGIDMNTDH TLEEVGKQFD VTRERIRQIE AKALRKLRHP TRSDGLRSFL
EE