Gene Hore_06440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06440 
Symbol 
ID7314549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp698694 
End bp699911 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID643611074 
Productpolysaccharide deacetylase 
Protein accessionYP_002508396 
Protein GI220931488 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.467697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTTACTATCA TTATTTGAAA TCTTTATCAA TGTACTTATC CCTTATTATT 
ATATTAACAC TTTTTATGCC TCTCAATATT GAAGCTGCTA AAGTACACCG GGTAAAACCC
GGGGAAAATC TTTATAGTAT AGCCAGAAAA TATGGTGTAA CCGTTAATGA AATTATCAGA
ACAAACAATT TACGTAATCC CGGTCAATTT TTTTCAACCC AGGCATTAAT TATCCCTGAT
GCTTACCGCC CCAATATTTA TCGTGTCCAG AAAGGAGATA CTCTTTATCT AATCTCTAAA
AAGCTGGAAA TACCAATGGA TTTTTTAGCA AAAGTAAATT ACCTGACCGA TAAAGATGAA
CTATATGAAC GCCAGGTTCT CTATGTTCCT GCCTGGCCCA GGTTTCATAA AGTTAAACCC
GGTGACACCC TTTATAAAAT ATCAAAAAGA TATGGAATTT CTTTAAAGAG AATTAAAGAA
GCAAACCAGT TATACAGTCA TAATAACCTG AAAATAGGTC AGTATATCAA AGTACCGGCT
CCGGAGTATA AACCACCTGA AAGAAAAGAC CCCCAGTATA CAAAGTTATT TCCTGATACC
TTTTTCTTAT CAGGTAAAAC CAACGGATAT AATATAGCCT TAACCTTTGA TGACGGTCCT
GATAACATTT ATACCCCTAA AATCCTTGAT ATACTAAAAA AATATAATGT AAGAGCCACC
TTCTTTTTAA TGGGTAGCAG GGCTGACAGA TACCCTGATA TTGTAAAACG AATGGTCAGG
GAAGGCCATA TAGTAGCTAA CCACACCTGG TCCCATGTAA ACCTGAACAA AACAACATCG
TCACGTTTCT ATAATGAAAT TACAAATACC AGTAATACAA TAAAAAATCA TACCAGTCTT
ACACCGGCTT TAGTACGGCC ACCTTATGGA GCAGTGTCTA CCAGTGTAAT TAAGCGACTT
AAAAATATGG GTTTTAAGGT TATATTCTGG TCAGTAGATT CCAGGGACTG GAATACCCAG
GATGTTGATA AAATTTTAAT TAATACCCTG CCAAATGTCA GAAAGGACTC CATAATTCTC
TTTCATTCAG CCGGTGGTGA AGGCCACGAT CTCTCGGCAA CTGTCAGGGC TTTACCGGAG
TTAATTAATA CCTTAAGAAT GCTTGACTAT CGCTTTGTAA ACCTGACCCA GCTTCTTGGC
ATTAAGGCAT ACCAGTAA
 
Protein sequence
MKKFYYHYLK SLSMYLSLII ILTLFMPLNI EAAKVHRVKP GENLYSIARK YGVTVNEIIR 
TNNLRNPGQF FSTQALIIPD AYRPNIYRVQ KGDTLYLISK KLEIPMDFLA KVNYLTDKDE
LYERQVLYVP AWPRFHKVKP GDTLYKISKR YGISLKRIKE ANQLYSHNNL KIGQYIKVPA
PEYKPPERKD PQYTKLFPDT FFLSGKTNGY NIALTFDDGP DNIYTPKILD ILKKYNVRAT
FFLMGSRADR YPDIVKRMVR EGHIVANHTW SHVNLNKTTS SRFYNEITNT SNTIKNHTSL
TPALVRPPYG AVSTSVIKRL KNMGFKVIFW SVDSRDWNTQ DVDKILINTL PNVRKDSIIL
FHSAGGEGHD LSATVRALPE LINTLRMLDY RFVNLTQLLG IKAYQ