Gene Hore_11020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_11020 
Symbol 
ID7312844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1195887 
End bp1196861 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content40% 
IMG OID643611539 
Productpeptidase M23B 
Protein accessionYP_002508851 
Protein GI220931943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases
[COG3409] Putative peptidoglycan-binding domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.185646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTA GTAGTCCCGT CCTTGCCACC ACCCTGAAGT TGGGTGATAG GGGAAGGGAA 
GTAAAAAAGG TTCAACAGAT TTTAAGGGAC CTGGGGTATG ACATTGAAGT TGACAGTGTC
TTTGGTTATA GAACAAAACA GGTTGTGCAG GCTTTTCAGT TAAATAATGG ATTGGATGTA
GATGGTATTG TTGGTGATAA GACTTTAAAT TTACTGCATG AAATGGTAGA AGAAACTAAA
TATATAGTTA ACAAGGGGGA TACCCTGTCT GAAATCGCCC TGAAAACAGG GTCTACCGTA
CAGGCAATAA AAGATAGAAA TAATCTTAAA TCTTCTAAGA TTTATACAGG TCAAAAATTA
TATATCCCTA AAACAGGTAT TGGTGGCGGT AAGGATGGAA TACTCTATGA TAGAATAATA
CATGTTGTCC AGCGGGGAGA TGCCCTCTAT AATCTTGCTA AAAGATATGG GACTGAAGTG
GAAACAATTA AACTGGCCAA TAACCTTCAT AGCAACCGTA TTTTTGTAGG ACAGACCCTG
GTTATACCCC ACCTTAAAGA AGGAAGGAAA CATAATTTCA GGCTTAAAAA GGGAGCTTTT
ATATGGCCTG TGCTCGGCCG GATTTCTTCA CCATATGGGT ACAGGATCCA TCCTATAACC
AACAAACGTG AATTTCATGG CGGTATAGAT ATAGCTGTTC CTATTGGTAC CAGAATTAGA
GCTGCTGCCA GTGGTACCGT TATCCAGAGT GGCTGGATAA GGGGGTTTGG GAAAACCATA
ATTATTGACC ATGAAAATGG TATCAGAACC CTTTATGCCC ATAATTCACG TCTGTTAATA
AGAGCCGGTC AAAAGGTGAA ACTGGGGGAT GTTATTGCAC TGGCAGGGAG TACCGGGATG
AGTACCGGTC CCCATCTGGA CTTCAGGATT TATAATAAGG GGAAAACAGT GAACCCGATT
AATTATTTAC CTTAA
 
Protein sequence
MIISSPVLAT TLKLGDRGRE VKKVQQILRD LGYDIEVDSV FGYRTKQVVQ AFQLNNGLDV 
DGIVGDKTLN LLHEMVEETK YIVNKGDTLS EIALKTGSTV QAIKDRNNLK SSKIYTGQKL
YIPKTGIGGG KDGILYDRII HVVQRGDALY NLAKRYGTEV ETIKLANNLH SNRIFVGQTL
VIPHLKEGRK HNFRLKKGAF IWPVLGRISS PYGYRIHPIT NKREFHGGID IAVPIGTRIR
AAASGTVIQS GWIRGFGKTI IIDHENGIRT LYAHNSRLLI RAGQKVKLGD VIALAGSTGM
STGPHLDFRI YNKGKTVNPI NYLP