Gene Hore_02070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02070 
Symbol 
ID7312526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp213196 
End bp214350 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content41% 
IMG OID643610629 
ProductStage II sporulation E family protein 
Protein accessionYP_002507963 
Protein GI220931055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGA AAGTTGATGT TGGAGTTGCC AGCCTTTCAA AACACGGTGA AGAGGTCTGT 
GGTGATAGTT ATCAGGTCAT CAGGTCTAGG GATGCGACAA CAGTAATTTT ATCTGATGGT
CTTGGTAGTG GGATCAAAGC CAGTATACTC TCGTTATTAT CTACGAAAAT TGCTTCAAGA
TTACTGGAAA GAAATATCAA TGTAGAACAG GTTTTTGCCA CCATAGCAGA TACCCTCCCT
ATCTGTCAGA CCAGGGAGAT AGCCTATTCA ACTCTGTCGA TTTTAAAAAT AACAGATAGT
GGCTATGCCC ATTTAATTGA ATACGACAAT CCCTCCCTGA TATTGATTCG TAACGGGCAA
AGGGTCAAAT TAGATAAAGA ACAAAAAATA ATCGCCGGTA AAAAAGTAAG TGAAGTCCAT
TTTAAACTTA AACTGGGTGA TTTATTGCTT GTAGTTAGTG ATGGAGTCAT TAATGCCGGG
GTTGGAGGGC TATTTCACCT GGGGCTGGGG CGAGAAAGAC TGGTAGAACA TGTTACCAAA
TATGGATTAT ATAAAAAGGA TTCCCTTCAT GTTGCCCGGG ATATCATTGA ATTAACTGAA
GCCTGTTATA TCTGTAAACC GGGTGATGAT TCGACATGTA TTGCTTTAAA ATTAAGAGAA
CCCCGTTCTG TTGTGGTGTT AACGGGTCCT CCCACTGATC CAGATCTGGA TGGAAAGGTG
GTTAAGGAGT TTCTTAAACG TAATAATTCA GAGAAAGTAG TCTGTGGCGG GGCGACCGGA
AATATGGTGG CTCGGGAACT TGGTGAAGAT ATAGAAACAA GCTTAACCTA TGATGACCCC
AGTGTTCCCC CCCTTGCTTC TATAAAGGGA ATTGATCTGG TGACAGAAGG CATATTAACC
CTCAATAAAT GTCTGGAAAA GATTTTGCAG TTAAAAAAGG GACAGAGTAT TGATGAAAAA
AAAGATGGGG CCAGCCTTTT AGCCAGGACA TTATTTAAGG CCGATCAAAT ACATTTTTTG
GTAGGAACTG CTGTAAACCC CGCCCACCAG GAATTAATGC AGTCCTTACA GTTAAAGCCC
AGGCCGGTAA TAGTTAATAA ACTGATAAAA GAGCTTGCTG AACTGGGTAA AGAGATAAAG
ATAAAGAGGT ATTAA
 
Protein sequence
MGMKVDVGVA SLSKHGEEVC GDSYQVIRSR DATTVILSDG LGSGIKASIL SLLSTKIASR 
LLERNINVEQ VFATIADTLP ICQTREIAYS TLSILKITDS GYAHLIEYDN PSLILIRNGQ
RVKLDKEQKI IAGKKVSEVH FKLKLGDLLL VVSDGVINAG VGGLFHLGLG RERLVEHVTK
YGLYKKDSLH VARDIIELTE ACYICKPGDD STCIALKLRE PRSVVVLTGP PTDPDLDGKV
VKEFLKRNNS EKVVCGGATG NMVARELGED IETSLTYDDP SVPPLASIKG IDLVTEGILT
LNKCLEKILQ LKKGQSIDEK KDGASLLART LFKADQIHFL VGTAVNPAHQ ELMQSLQLKP
RPVIVNKLIK ELAELGKEIK IKRY