Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0038 |
Symbol | |
ID | 4710927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 40313 |
End bp | 41302 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639854496 |
Product | sigma E regulatory protein, MucB/RseB |
Protein accession | YP_001001635 |
Protein GI | 121996848 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3026] Negative regulator of sigma E activity |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.622863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATCC CTGACCGTGC GTGGTTCCTC CGCGCACTCC TGTGCGCCGG CCTTTTCTGG ATCGCCAACG GCCAGGTGGC CCTGGCGGAG GGTACGGATC GGGGCGCGGG CGAAGAGGGC CGGGAGGCCC TCAACCGGAT GGTCGAGGCC CTGGATCAGC ACTCCTATGA GGGGATCTTT GTCTACGCCC GGGCCGGGGT GGTGGAGACC GTGCACATCG TGCACAGAGC CGGTGATTCG GGGCGCCATC AGCGGCTGGA GATGCTCACC GGCTCCCATC GGGAGATGAT CCGCACCCCG GAGGGCGCCC TGCTTGCCGG CCCCATTGCT AAGGGCGATC GCCTGCGTGG CAGTGGCGTG GTCACCTCCC TCGGCGAGGG GTGGCCCCGG GCGGACAGCA TCTCCGAAGA CCTCTACCGC ATCCAGCTCC AAGGCGAGGG ACGGGTCGCC GGCCGCCCGA CTACGGTCAT CGCCGTCGAA CCACTGGATC AGTTGCGTTA CGGTCACCGG CTGTGGGTCG ACGAGGAGAG CGGCCTACCG CTTCATGCCC AGTTGCTCGA CGGGCGCCGC GTCATCGAAC GCCTGCTGTT CACGAGCTTC ACCCTTCGTG ACGATATCGA CGCCGAAGAG TTGGAGCCCG TTGGTGAGGC GCAGCGGCGG GTCGAGCGGC GCCTGGCCGA GGAGAGCGAC GACAGCGGCG AGCCGGAGTG GGAAGTGATC GATGTACCGG CGGGTTTCTC ACGCATCGCA CACGGTCGTA TTCCCTCACC CGAGCCGGGG CAGGACCCCA TCGAGCACCT GCTGTTTAGC GATGGCCTGG CCTCCGTCTC CGTCTATATC GCCCCAGAGC CGGGCGGTAA CGAGGGCCAG GCGCGGGCCG GCGCCCTCCA TGCCTACGAG CGCCCCGTGG AGGACCATCA GGTCACCGCC ATCGGCGATG TCCCGGCCAA GACCGTGGAA CGGTTCGCGA GGCAGACGCG ACGGCGCTAG
|
Protein sequence | MGIPDRAWFL RALLCAGLFW IANGQVALAE GTDRGAGEEG REALNRMVEA LDQHSYEGIF VYARAGVVET VHIVHRAGDS GRHQRLEMLT GSHREMIRTP EGALLAGPIA KGDRLRGSGV VTSLGEGWPR ADSISEDLYR IQLQGEGRVA GRPTTVIAVE PLDQLRYGHR LWVDEESGLP LHAQLLDGRR VIERLLFTSF TLRDDIDAEE LEPVGEAQRR VERRLAEESD DSGEPEWEVI DVPAGFSRIA HGRIPSPEPG QDPIEHLLFS DGLASVSVYI APEPGGNEGQ ARAGALHAYE RPVEDHQVTA IGDVPAKTVE RFARQTRRR
|
| |