Gene Hneap_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1944 
Symbol 
ID8535102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2079276 
End bp2080307 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID646384325 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003263813 
Protein GI261856530 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02394] RNA polymerase sigma factor RpoS
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000945772 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGTG TGACGCAATA CAAAAAGTCT GTCGGCAGCG ACGGCGTTCA GATTTCAACA 
CGTCAACGGA TAGCACCTGA TCCTGTGGAC GAAGAAGAAC TGCTTGCTGT GGATGATGTC
GAGTTGGACG ACGATATTGA CTCTCCGGAG AAAAATGAGG AAGAACTGAC AACGACCTTT
AGTGCCAGCG ATTCTGAGTC GGTTGATCCT GTTCGCTTGT ATCTTCGTGA TGTTGAAACC
GAGCGCTTGC TGACGGCGGA AGAAGAGGTG ATTTATTCAC GGCGAGCGCA AAGCGGTGAT
GAATCCGCAC GACAAAAAAT GATCGTCTGC AATTTGCGCC TGGTGATCAA GATTGCGCGA
CGTTACATGA ATCGAGGCCT CGATTTGCTT GATTTGATCG AGGAAGGAAA CATCGGTTTG
ATGAGAGCGG TCGAAAAGTT CGATCCCGAG CGCGGATTTC GTTTTTCCAC CTATGCCACG
TGGTGGATTC GGCAGACGAT CGAACGCGGT TTGATGAATC AATCCAGAAC CGTTCGTTTG
CCAGTGCATG TCGTCAAGGA AGTGCATACT TTTAACCGAG CGGCCAGAAA GATTGCGCAA
AACCAGAATG CCGAAGTGAC CGCTGAACAA ATCGCTGATC ACCTGGACCG ACCCGTAAAT
GAAGTGCTAA AAATCATGGC ATTCAACGAG CGGCAGTCCT CGTTCGATAG CCCGCTGAAA
GGCGATGGGG AATTCTCGTT GCTGGACCTG ATACCCGATC AGGAAAGCAA CCAGCCAGAT
GAACTATTGC AGGATGCTGG CGTTCATGGA TTTCTCGATC AACTCATCGC CCAGCTTGAG
AGCAAGCAGC GAGAAGTATT GGTGCGTCGG TATGGATTGC GCGGGTACGA AACGCATACA
CTCGAAGAAG TGGGCAACCA TTTGGGCGTA ACACGCGAAC GCGTTCGGCA AATTCAGTTG
GAAGCAGTGC GGCGATTGAA ATCACTGGCT AAAAAGGCTG GTGTTACCGC CGAAGTTATT
TTTCCCGACT GA
 
Protein sequence
MGSVTQYKKS VGSDGVQIST RQRIAPDPVD EEELLAVDDV ELDDDIDSPE KNEEELTTTF 
SASDSESVDP VRLYLRDVET ERLLTAEEEV IYSRRAQSGD ESARQKMIVC NLRLVIKIAR
RYMNRGLDLL DLIEEGNIGL MRAVEKFDPE RGFRFSTYAT WWIRQTIERG LMNQSRTVRL
PVHVVKEVHT FNRAARKIAQ NQNAEVTAEQ IADHLDRPVN EVLKIMAFNE RQSSFDSPLK
GDGEFSLLDL IPDQESNQPD ELLQDAGVHG FLDQLIAQLE SKQREVLVRR YGLRGYETHT
LEEVGNHLGV TRERVRQIQL EAVRRLKSLA KKAGVTAEVI FPD