Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0347 |
Symbol | |
ID | 8533467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 349342 |
End bp | 350337 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646382730 |
Product | DNA-directed RNA polymerase, alpha subunit |
Protein accession | YP_003262257 |
Protein GI | 261854974 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000103496 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAG GTTCTACCGA ACTGCTTAAG CCACGACTGG TCGATGTTCA GGTTTTGGAC GCGACCCGGG CCCGTGTCGT GCTGGAGCCG CTTGAGCGTG GCTTCGGTTA CACACTTGGT AATGCGCTTC GCCGTATTCT TCTTTCTTCC ATCCCAGGCG TTGCCGTAAC GGAAGCGGAA ATTGAAGGTG TTGTTCATGA GTACACTACC ATTGAGGGTG TGCACGAGGA CGTGCTTGAA ATTCTGCTCA ACCTGAAAGG TCTTTCTGTC ATGCTTCATG GTCGCGACGA GGCCATGCTG AGCATCAAGA AAACTGGTGC TGGCGCCATC ACGGCTGCCG ATATCAAGGC CGATCACGCT GTTGAAATCA TCAATCCTGA GCATCACATT GCTACATTGA ATGATCAGGG TAGCTTGGTT ATGACCTTCA AGGTCACACG TGGTAAAGGA TATGTTCCTG CCAGTGCGCG TCGTGAAGAG GCGGGTGGTC AAATTGGCCG ACTGCTGGTT GACGCCTCAT TTTCTCCACT GCGTCGCGTT TCTTATTCTG TCGAGCGCGC TCGTGTTGAA CAGCGTACCG ATCTTGACTC GCTTGTTCTT GATATCGAGA CCAACGGTTC TATTTCTGCC GAAGATGCCA TTCGTCAAGC CGCTGGCATT TTGGTTGATC AGCTATCGGT CTTTGTTGAC CTCAAGGCTG AAAAAGTAGA GCCAGTGGTT GAGCAAGCGC CTGCAGTTGA TCCGGTTCTG CTGCGTCCAG TCGATGATCT TGAGTTGACC GTTCGTTCGG CTAACTGCCT GAAGGCAGAA AATATTTATT ACATCGGTGA TCTCATTCAG CGTACGGAAA TCGAGTTGCT AAAAACTCCG AATCTGGGTA AAAAATCGCT GACTGAAATC AAGGACGTTC TGGCCAAGCA GGGTTTGTCT CTGGGTCAAC GTCTCGAAAA CTGGCCACCT GCAGAGCTTT TGAATCTCGC TGAGTCGAAG ATTTAA
|
Protein sequence | MQLGSTELLK PRLVDVQVLD ATRARVVLEP LERGFGYTLG NALRRILLSS IPGVAVTEAE IEGVVHEYTT IEGVHEDVLE ILLNLKGLSV MLHGRDEAML SIKKTGAGAI TAADIKADHA VEIINPEHHI ATLNDQGSLV MTFKVTRGKG YVPASARREE AGGQIGRLLV DASFSPLRRV SYSVERARVE QRTDLDSLVL DIETNGSISA EDAIRQAAGI LVDQLSVFVD LKAEKVEPVV EQAPAVDPVL LRPVDDLELT VRSANCLKAE NIYYIGDLIQ RTEIELLKTP NLGKKSLTEI KDVLAKQGLS LGQRLENWPP AELLNLAESK I
|
| |