Gene Hhal_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0038 
Symbol 
ID4710927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp40313 
End bp41302 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID639854496 
Productsigma E regulatory protein, MucB/RseB 
Protein accessionYP_001001635 
Protein GI121996848 
COG category[T] Signal transduction mechanisms 
COG ID[COG3026] Negative regulator of sigma E activity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.622863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATCC CTGACCGTGC GTGGTTCCTC CGCGCACTCC TGTGCGCCGG CCTTTTCTGG 
ATCGCCAACG GCCAGGTGGC CCTGGCGGAG GGTACGGATC GGGGCGCGGG CGAAGAGGGC
CGGGAGGCCC TCAACCGGAT GGTCGAGGCC CTGGATCAGC ACTCCTATGA GGGGATCTTT
GTCTACGCCC GGGCCGGGGT GGTGGAGACC GTGCACATCG TGCACAGAGC CGGTGATTCG
GGGCGCCATC AGCGGCTGGA GATGCTCACC GGCTCCCATC GGGAGATGAT CCGCACCCCG
GAGGGCGCCC TGCTTGCCGG CCCCATTGCT AAGGGCGATC GCCTGCGTGG CAGTGGCGTG
GTCACCTCCC TCGGCGAGGG GTGGCCCCGG GCGGACAGCA TCTCCGAAGA CCTCTACCGC
ATCCAGCTCC AAGGCGAGGG ACGGGTCGCC GGCCGCCCGA CTACGGTCAT CGCCGTCGAA
CCACTGGATC AGTTGCGTTA CGGTCACCGG CTGTGGGTCG ACGAGGAGAG CGGCCTACCG
CTTCATGCCC AGTTGCTCGA CGGGCGCCGC GTCATCGAAC GCCTGCTGTT CACGAGCTTC
ACCCTTCGTG ACGATATCGA CGCCGAAGAG TTGGAGCCCG TTGGTGAGGC GCAGCGGCGG
GTCGAGCGGC GCCTGGCCGA GGAGAGCGAC GACAGCGGCG AGCCGGAGTG GGAAGTGATC
GATGTACCGG CGGGTTTCTC ACGCATCGCA CACGGTCGTA TTCCCTCACC CGAGCCGGGG
CAGGACCCCA TCGAGCACCT GCTGTTTAGC GATGGCCTGG CCTCCGTCTC CGTCTATATC
GCCCCAGAGC CGGGCGGTAA CGAGGGCCAG GCGCGGGCCG GCGCCCTCCA TGCCTACGAG
CGCCCCGTGG AGGACCATCA GGTCACCGCC ATCGGCGATG TCCCGGCCAA GACCGTGGAA
CGGTTCGCGA GGCAGACGCG ACGGCGCTAG
 
Protein sequence
MGIPDRAWFL RALLCAGLFW IANGQVALAE GTDRGAGEEG REALNRMVEA LDQHSYEGIF 
VYARAGVVET VHIVHRAGDS GRHQRLEMLT GSHREMIRTP EGALLAGPIA KGDRLRGSGV
VTSLGEGWPR ADSISEDLYR IQLQGEGRVA GRPTTVIAVE PLDQLRYGHR LWVDEESGLP
LHAQLLDGRR VIERLLFTSF TLRDDIDAEE LEPVGEAQRR VERRLAEESD DSGEPEWEVI
DVPAGFSRIA HGRIPSPEPG QDPIEHLLFS DGLASVSVYI APEPGGNEGQ ARAGALHAYE
RPVEDHQVTA IGDVPAKTVE RFARQTRRR