Gene Hneap_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0347 
Symbol 
ID8533467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp349342 
End bp350337 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content50% 
IMG OID646382730 
ProductDNA-directed RNA polymerase, alpha subunit 
Protein accessionYP_003262257 
Protein GI261854974 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000103496 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAG GTTCTACCGA ACTGCTTAAG CCACGACTGG TCGATGTTCA GGTTTTGGAC 
GCGACCCGGG CCCGTGTCGT GCTGGAGCCG CTTGAGCGTG GCTTCGGTTA CACACTTGGT
AATGCGCTTC GCCGTATTCT TCTTTCTTCC ATCCCAGGCG TTGCCGTAAC GGAAGCGGAA
ATTGAAGGTG TTGTTCATGA GTACACTACC ATTGAGGGTG TGCACGAGGA CGTGCTTGAA
ATTCTGCTCA ACCTGAAAGG TCTTTCTGTC ATGCTTCATG GTCGCGACGA GGCCATGCTG
AGCATCAAGA AAACTGGTGC TGGCGCCATC ACGGCTGCCG ATATCAAGGC CGATCACGCT
GTTGAAATCA TCAATCCTGA GCATCACATT GCTACATTGA ATGATCAGGG TAGCTTGGTT
ATGACCTTCA AGGTCACACG TGGTAAAGGA TATGTTCCTG CCAGTGCGCG TCGTGAAGAG
GCGGGTGGTC AAATTGGCCG ACTGCTGGTT GACGCCTCAT TTTCTCCACT GCGTCGCGTT
TCTTATTCTG TCGAGCGCGC TCGTGTTGAA CAGCGTACCG ATCTTGACTC GCTTGTTCTT
GATATCGAGA CCAACGGTTC TATTTCTGCC GAAGATGCCA TTCGTCAAGC CGCTGGCATT
TTGGTTGATC AGCTATCGGT CTTTGTTGAC CTCAAGGCTG AAAAAGTAGA GCCAGTGGTT
GAGCAAGCGC CTGCAGTTGA TCCGGTTCTG CTGCGTCCAG TCGATGATCT TGAGTTGACC
GTTCGTTCGG CTAACTGCCT GAAGGCAGAA AATATTTATT ACATCGGTGA TCTCATTCAG
CGTACGGAAA TCGAGTTGCT AAAAACTCCG AATCTGGGTA AAAAATCGCT GACTGAAATC
AAGGACGTTC TGGCCAAGCA GGGTTTGTCT CTGGGTCAAC GTCTCGAAAA CTGGCCACCT
GCAGAGCTTT TGAATCTCGC TGAGTCGAAG ATTTAA
 
Protein sequence
MQLGSTELLK PRLVDVQVLD ATRARVVLEP LERGFGYTLG NALRRILLSS IPGVAVTEAE 
IEGVVHEYTT IEGVHEDVLE ILLNLKGLSV MLHGRDEAML SIKKTGAGAI TAADIKADHA
VEIINPEHHI ATLNDQGSLV MTFKVTRGKG YVPASARREE AGGQIGRLLV DASFSPLRRV
SYSVERARVE QRTDLDSLVL DIETNGSISA EDAIRQAAGI LVDQLSVFVD LKAEKVEPVV
EQAPAVDPVL LRPVDDLELT VRSANCLKAE NIYYIGDLIQ RTEIELLKTP NLGKKSLTEI
KDVLAKQGLS LGQRLENWPP AELLNLAESK I