Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_23300 |
Symbol | |
ID | 7314213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 2549402 |
End bp | 2550700 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643612782 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002510070 |
Protein GI | 220933162 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAA TAACATTTCT CGGGGCTGGA AGTACTGTAT TTGCTAAAAA TGTTCTGGGG GATTGTATGT TAACCCCGGC CTTAAGAGAT TCCCATATCG CCCTGTATGA CATTGATCAC CAGCGCTTAC GGGAATCTGA ACAAATGCTT AAAAGTATTA ATAGAAACTC AAACCAAAAC AGGGCAGAAA TTGTTGCTTA TACAGACCGT AAAGAAGCCC TCCGGGATTC TGACTACGTT GTTAACGCTA TCCAGGTAGG TGGATACAAA CCCTGTACTG TTACCGATTT TGAGGTCCCG AAAAAATACG GTTTACGACA GACCATCGGG GATACCCTGG GTATCGGTGG TATTTTCAGG GCCTTAAGAA CTATACCGAT CCTCCTGGAT TTTGCCAGGG ATATTGAAGA AGTATGTCCT GACGCCTGGT TTTTAAATTA TACCAATCCC ATGGCCATTT TAACCGGGGC TATGCTCAAA GCAACCAATG TTAAGACAGT TGGCCTCTGC CACAGTGTCC AGACCTGTGT TCCTGATCTA CTTAAAGACC TGGGAATGAG TACTGAAAAC GTCCAGTGGA AGATAGCCGG TATTAACCAC ATGGCCTGGC TTTTGGAAAT AACCAGAAAC GGCCGGGACC TTTATCCTGA AATTAAAGAA AAGGCCCGGG CCCGAAAAAA ACCCCATGAT GACATGGTCA GGTATGAAAT AATGGAACGC TTTGGCTACT ATGTCACCGA ATCCAGTGAA CACAATGCTG AATATATGCC CTATTTCATC AAAAGCAACT ACCCTGAATT AATCGAAAAA TATAATATTC CTCTCGATGA GTATCCTCGC CGTTGTGAAC AACAAATAAA AGACTGGGAA GAGATGAAAG ACAAACTGGT CCACAATGAT AATCTTGAAC ACAGTCGGAC CCATGAATAT GCTTCCTATA TTATGGAAGC CATGGAAACA GATAAGCCCT ATAAAATTGG GGGGAACGTC CTGAATACAG GGCTTATTAC CAACCTCCCT GAAGATGCCT GTGTAGAGGT TCCCTGCCTG GTGGACAGGA GTGGGGTTAC CCCCTGTTAT GTGGGAGACC TGCCACCTCA ACTGGCTGCC CTGAACCGGA CAAATATCAA TGTTCAGTTG CTGACCATTG AAGCTGCCCT GACCCGGAAA AAAGAATACA TTTACCAGGC AGCCATGCTG GATCCTCATA CAGCTGCCGA ATTATCTATT GATGAGATTT ATGCCCTGGT TGATGATATG ATTGAAGCCC ATGGTGACTG GCTCCCTGAA TATAAATAA
|
Protein sequence | MPKITFLGAG STVFAKNVLG DCMLTPALRD SHIALYDIDH QRLRESEQML KSINRNSNQN RAEIVAYTDR KEALRDSDYV VNAIQVGGYK PCTVTDFEVP KKYGLRQTIG DTLGIGGIFR ALRTIPILLD FARDIEEVCP DAWFLNYTNP MAILTGAMLK ATNVKTVGLC HSVQTCVPDL LKDLGMSTEN VQWKIAGINH MAWLLEITRN GRDLYPEIKE KARARKKPHD DMVRYEIMER FGYYVTESSE HNAEYMPYFI KSNYPELIEK YNIPLDEYPR RCEQQIKDWE EMKDKLVHND NLEHSRTHEY ASYIMEAMET DKPYKIGGNV LNTGLITNLP EDACVEVPCL VDRSGVTPCY VGDLPPQLAA LNRTNINVQL LTIEAALTRK KEYIYQAAML DPHTAAELSI DEIYALVDDM IEAHGDWLPE YK
|
| |