Gene Hore_04190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04190 
Symbol 
ID7314094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp442073 
End bp443410 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content39% 
IMG OID643610842 
Productglycoside hydrolase family 30 
Protein accessionYP_002508172 
Protein GI220931264 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones75 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCTA TTAGTGTAAT TTTAACCGCC AGAGACACCG GAGATAGATT AAGTTTAAAA 
GGTGAAAAAG TATTTAAATC GGGAATAGGG AGACAGGATA TAGACCTGGA ATTATATCCT
GATACAAGAT ATCAGAAAAT AATCGGTTTT GGTGGGGCAT TTACTGAGGC CGCTGCATAT
ACACTGTCTA AAATAAGTTC TGATAAGAGA CTTAAAATTA TCGAAAGCTA TTTTGATAGG
GATAAAGGTC TCGGGTATAA TATGGGGCGT GTTCATATCA ATAGCTGCGA TTTTGCCCTG
GAGAACTATA CTTATGTAGA AGATGGAGAT AGAGAGTTAA AGACATTTGA TATTTCCCGG
GAACGGCAAT GGGTGATACC TTTGATCAGG GATGCTATAA AGGCCAGGGG TGGTGAAATA
AAATTACTGG CCTCACCCTG GAGCCCACCC GCCTGGATGA AGAGCAATGA AAATATGAAT
TATGGCGGTA AATTGCTGCC TGAATATAGA GATGTCTGGG CTAAATATTA TACTAAATAT
ATTAAAGCCT TTCAGGAAGA AGGATTAAAT ATCTGGGGAA TTACTGTTCA GAATGAACCT
GCAGCAGTTC AGACCTGGGA TTCCTGTACA TATACTGCTG AAGAAGAGCG TGATTTTGTT
AAAAACCACC TCGGCCCGGT TATGCATGAA GAAGGCCTTG GTGACATTAA TATCCTTATC
TGGGATCATA ATAGAGATAT TATTGTTGAC AGAGTAAAAC CCATTCTGGA TGACCTTGAA
GCTGCTAAAT ATGTATGGGG GACCGCCTTT CACTGGTATG TGAGTGAAGA CTTTGATAAT
GTGGGCCAGG TACATGAAAT GTATCCTGAC AAGCATTTGC TTTTTACTGA AGGTTGTCAG
GAGGGTGGCT GTCAAATTGG CGAATGGTTT ACGGGTGAGA GATATGGGCG TAATATCATC
GGTGATTTAA ATAACTGGAC TGAAGGGTAT CTGGACTGGA ACATGGTATT GAATGAGGAA
GGTGGTCCAA ACCATGTGGG CAATTACTGT GATGCCCCGG TAATTGTGGA TACAAATACA
GAAGAGATAT ATTATAATAG TTCATATTAT TATATTGGCC ATTTCAGTAA ATATATCAGG
CCTGGTGCTG TCCGGATTGG TGTATCCTGT ACTAATGATA ATTTAAAGGC AACATCTTTC
CTTAATAGTG ATGGTAGTAT TATACTAATT GTTATGAATG AGACAGATAA TCCCACAGAT
TTTGCAGTAT CTCTTGATAA TAAGGTAGCT GACCTTACAT TGCCAGCCCA TGCTATTGCA
ACTTATATCA TTACTTAA
 
Protein sequence
MNSISVILTA RDTGDRLSLK GEKVFKSGIG RQDIDLELYP DTRYQKIIGF GGAFTEAAAY 
TLSKISSDKR LKIIESYFDR DKGLGYNMGR VHINSCDFAL ENYTYVEDGD RELKTFDISR
ERQWVIPLIR DAIKARGGEI KLLASPWSPP AWMKSNENMN YGGKLLPEYR DVWAKYYTKY
IKAFQEEGLN IWGITVQNEP AAVQTWDSCT YTAEEERDFV KNHLGPVMHE EGLGDINILI
WDHNRDIIVD RVKPILDDLE AAKYVWGTAF HWYVSEDFDN VGQVHEMYPD KHLLFTEGCQ
EGGCQIGEWF TGERYGRNII GDLNNWTEGY LDWNMVLNEE GGPNHVGNYC DAPVIVDTNT
EEIYYNSSYY YIGHFSKYIR PGAVRIGVSC TNDNLKATSF LNSDGSIILI VMNETDNPTD
FAVSLDNKVA DLTLPAHAIA TYIIT