Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1944 |
Symbol | |
ID | 8535102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2079276 |
End bp | 2080307 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646384325 |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003263813 |
Protein GI | 261856530 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02394] RNA polymerase sigma factor RpoS [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000945772 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAGTG TGACGCAATA CAAAAAGTCT GTCGGCAGCG ACGGCGTTCA GATTTCAACA CGTCAACGGA TAGCACCTGA TCCTGTGGAC GAAGAAGAAC TGCTTGCTGT GGATGATGTC GAGTTGGACG ACGATATTGA CTCTCCGGAG AAAAATGAGG AAGAACTGAC AACGACCTTT AGTGCCAGCG ATTCTGAGTC GGTTGATCCT GTTCGCTTGT ATCTTCGTGA TGTTGAAACC GAGCGCTTGC TGACGGCGGA AGAAGAGGTG ATTTATTCAC GGCGAGCGCA AAGCGGTGAT GAATCCGCAC GACAAAAAAT GATCGTCTGC AATTTGCGCC TGGTGATCAA GATTGCGCGA CGTTACATGA ATCGAGGCCT CGATTTGCTT GATTTGATCG AGGAAGGAAA CATCGGTTTG ATGAGAGCGG TCGAAAAGTT CGATCCCGAG CGCGGATTTC GTTTTTCCAC CTATGCCACG TGGTGGATTC GGCAGACGAT CGAACGCGGT TTGATGAATC AATCCAGAAC CGTTCGTTTG CCAGTGCATG TCGTCAAGGA AGTGCATACT TTTAACCGAG CGGCCAGAAA GATTGCGCAA AACCAGAATG CCGAAGTGAC CGCTGAACAA ATCGCTGATC ACCTGGACCG ACCCGTAAAT GAAGTGCTAA AAATCATGGC ATTCAACGAG CGGCAGTCCT CGTTCGATAG CCCGCTGAAA GGCGATGGGG AATTCTCGTT GCTGGACCTG ATACCCGATC AGGAAAGCAA CCAGCCAGAT GAACTATTGC AGGATGCTGG CGTTCATGGA TTTCTCGATC AACTCATCGC CCAGCTTGAG AGCAAGCAGC GAGAAGTATT GGTGCGTCGG TATGGATTGC GCGGGTACGA AACGCATACA CTCGAAGAAG TGGGCAACCA TTTGGGCGTA ACACGCGAAC GCGTTCGGCA AATTCAGTTG GAAGCAGTGC GGCGATTGAA ATCACTGGCT AAAAAGGCTG GTGTTACCGC CGAAGTTATT TTTCCCGACT GA
|
Protein sequence | MGSVTQYKKS VGSDGVQIST RQRIAPDPVD EEELLAVDDV ELDDDIDSPE KNEEELTTTF SASDSESVDP VRLYLRDVET ERLLTAEEEV IYSRRAQSGD ESARQKMIVC NLRLVIKIAR RYMNRGLDLL DLIEEGNIGL MRAVEKFDPE RGFRFSTYAT WWIRQTIERG LMNQSRTVRL PVHVVKEVHT FNRAARKIAQ NQNAEVTAEQ IADHLDRPVN EVLKIMAFNE RQSSFDSPLK GDGEFSLLDL IPDQESNQPD ELLQDAGVHG FLDQLIAQLE SKQREVLVRR YGLRGYETHT LEEVGNHLGV TRERVRQIQL EAVRRLKSLA KKAGVTAEVI FPD
|
| |