Gene Hore_22030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22030 
Symbol 
ID7313751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2398101 
End bp2399330 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content37% 
IMG OID643612655 
ProductPeptidoglycan-binding LysM 
Protein accessionYP_002509943 
Protein GI220933035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1388] FOG: LysM repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAGA AGATAATTGT TTTAAGTCTA TTGCTTCTCT TCCTGGTAAG CCTGATACCA 
CTCCATACGG CCGGTGCTGC AGTTATTTAT CAGGTTAAAA GTGGAGATAC CATCTGGGAA
ATTTCCCGGG AATTTAATGT GCCTGTCGAA ATAATAATCC AGCAGAATAA TATTTCAAAC
CCTGCCAGTA TTTATACCGG TCAAAAATTA TTAATAACTT CTGAAAATGA TAAAATAATT
ATTACCATTG GAGACAACCA GACTGGTCAG AACCAGAATA CCCATAAATA TACGGTAAAA
CCTGGTGATA CTCTGTGGAA ACTGGCCCAG AAATTTAATA CTTCTATTAC TGAACTCGTT
GACCTCAATA ATTTAGAACA GTATAGTATC TATATCGGCC AAAAACTCCA GATACCATCC
AGTAATGCAC CTGAGAATGA TAACAATTTT ATCTATTACA CCATTCAACC CGGTGATATT
CTCTGGAATA TAGCCCAGAA ATATGATACT ACAGTCGAAC AACTTATAGA GTTAAATAAC
ATCAAGGATG CCTATGACCT TTATCCTGGA AGGAAACTCC TGGTTCCTCT CTCCGGAGGA
AACACACCTG CCGGGCAGGA AACCAGTAAT CCCTCCAGTG TCCCCTATAC AGCGTATTAT
TTCTATAAAA TACAGGAAGG GGATAAAATC TGGAAAATAG CCGATACTTT CGGAGTCAGG
GTCTCAGAAC TGGTGGGTTA TAATAATATA GAAAATATTA ATCAAATACA AACAGGCCAG
ATCTTAATAA TTCCTCTAGA AAAATCGACT AAGCTCTCCT ATGTTCAAAA AGCAGCCGCT
AAATTAAAGA ATTATTACCG TGTAAAAAAT AATGAAACCC TCGTTGATAT TGCCAAATAC
TTTATGGTCC CGGAAGAAGG GTTGAGGGCC ATCAACCATC TTCAGGAAGA TGAAACTGTA
TACCCGGGAC AACTGCTTTT AATGCCTGTT AGTAAGGCTC TGTTTAACAA ACATGAATTG
TATAAAGTTA AGAGTGGTGG CGAGTATATT TTCGACATTG CCTACCATAA AGGAGTATCT
ATCAAATCTA TCTTAAAAGT CAATTACCTT AAAAATCCTA ATCAGAAATT TGATGAAGAA
AAAGTTATTA TTATCCCCCT TGATGAAGAA AGTAAGGTTA CCTGGATTGA CTATGAAAAC
GGCAAGCCCC AGAATTCCTG GCTTAATTAA
 
Protein sequence
MTKKIIVLSL LLLFLVSLIP LHTAGAAVIY QVKSGDTIWE ISREFNVPVE IIIQQNNISN 
PASIYTGQKL LITSENDKII ITIGDNQTGQ NQNTHKYTVK PGDTLWKLAQ KFNTSITELV
DLNNLEQYSI YIGQKLQIPS SNAPENDNNF IYYTIQPGDI LWNIAQKYDT TVEQLIELNN
IKDAYDLYPG RKLLVPLSGG NTPAGQETSN PSSVPYTAYY FYKIQEGDKI WKIADTFGVR
VSELVGYNNI ENINQIQTGQ ILIIPLEKST KLSYVQKAAA KLKNYYRVKN NETLVDIAKY
FMVPEEGLRA INHLQEDETV YPGQLLLMPV SKALFNKHEL YKVKSGGEYI FDIAYHKGVS
IKSILKVNYL KNPNQKFDEE KVIIIPLDEE SKVTWIDYEN GKPQNSWLN