Gene Hore_01590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_01590 
Symbol 
ID7313287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp164939 
End bp166507 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content37% 
IMG OID643610582 
Productuncharacterized protein, probably involved in trehalose biosynthesis 
Protein accessionYP_002507916 
Protein GI220931008 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTAG TAACGGCATA TTCTTTTGGT GAAATAATAA AAGAACTAGA TAATATTATT 
TCATCAATAG AACCGAAAAA ATTTAAGGAG ACCCGGTGGT TCAGGAAAAA GGCCGATGAC
TTAATAGGTG TAGAGCTAGT AGACTATGGT GTATTAGAGT ATAAACCTGA TGAAAGACTG
GTAATTCCAG CAGTTGTCTC CTTTCATATG CAGGATCGGG TCAACAATAC AGGGTATGAA
GAACTATATT ATTTACCACT GGTAATAAAT AAAAATTATG ATGAAAAATG CCTTTTGTTA
CTGGAGATAG AAAACACCTC ATATACAGCT TATGTATATG AGGGGATATA TTCGCTGGAA
TATAACAGGG TTCTCGATGA GTTTGTTAAA AGAGAAGGTA GTCTTGAAAT GGAAAAAGGA
GGCTATCTTC AGGCTGAATT ATTAAATTAT TATAATAATG TTACCAGGTC TTATTCGTTG
ACAGAGGTTT CAAGTAATAG TCTGGCTTTA ATCGAAAGAA GCAGTATTAT AAAGACTTAT
CGAAAAATGG TACCCGGGGT CAATCCTGAT CTGGAAATAG GATTATCACT TATCAGAGAA
ACCGGTTTCA GAAATTTTCC CGAAATAAAA GGATATGTCT CCTATCAAAA GAATAATATC
AATTATGACA TCTGTCTTAT CGAACAATAT GTAAATAACG AGGGGGATGT CTGGGCTTAT
ACCCAGCGTT TCCTCTCGCG TTATTTGCAT TATGTAAGTG ATAACCCGGG GAAGGACTCC
CTGTTTACAT ACCTTGACCC CTTTATTGAA GAGATGGAAT ACCTGGGTGA GGTCATTGGT
GAACTACATC TATCCCTGTC TGGAATAGAG CAGGATAATT TTAAACCACG GAAGCCAACC
CTGGCAGAAA TTGAAGCCTG GCATGAAGAA GTACAACAGA ATACAGAGTT ACTCTTTAAA
TTACTTAAGA AAAACAAGAA TAATATCAAT AATGATTATA ACTATTACAG GGTGCTAAAC
ATGGTCCTTG ATAAGGAAAA AACAATATTT GGAGCAGTAA AGCGCATTTT TGGATTAAAG
GATTCACTGG GTAAATATAT GAGGGTTCAC GGGGACCTTC ATTTAGAGCA AATATTAAAG
ACTGAAGATG ATTTTATGGT CCTTGACTTT GAGGGCGAGC CTTTAAAGAG TATTCAGCAT
CGCCGCATGA AGTATTCTCC ATTAAAAGAT ATAGCCGGAA TGATGAGATC CTTTAACTAT
GCTGGTTATG CGGGCTACTT TGATTTTGTT AAAAAGAATC AGGGTATAAA TGACCGGGGA
AGATTTATTA AAGCAATCTC ACTATGGGAG GAAGAGGCCC GGACATCTTT CTTGACAGGG
TATCAGAATA AGATAAGAGA AAATAATGGA GACTTTTTAC CACCTGAAGA TAAATTTGAC
CAGGTGCTGG CCCTGTTTAA GATTGAAAAA GCTCTGTATG AAGGGATATA TGAGATCAAT
AATAGACCAG ACTGGTTGCA CATTCCGCTC CAGGGGATAC TGGACTGTGT GGAAGAAATC
GCTGACTGA
 
Protein sequence
MALVTAYSFG EIIKELDNII SSIEPKKFKE TRWFRKKADD LIGVELVDYG VLEYKPDERL 
VIPAVVSFHM QDRVNNTGYE ELYYLPLVIN KNYDEKCLLL LEIENTSYTA YVYEGIYSLE
YNRVLDEFVK REGSLEMEKG GYLQAELLNY YNNVTRSYSL TEVSSNSLAL IERSSIIKTY
RKMVPGVNPD LEIGLSLIRE TGFRNFPEIK GYVSYQKNNI NYDICLIEQY VNNEGDVWAY
TQRFLSRYLH YVSDNPGKDS LFTYLDPFIE EMEYLGEVIG ELHLSLSGIE QDNFKPRKPT
LAEIEAWHEE VQQNTELLFK LLKKNKNNIN NDYNYYRVLN MVLDKEKTIF GAVKRIFGLK
DSLGKYMRVH GDLHLEQILK TEDDFMVLDF EGEPLKSIQH RRMKYSPLKD IAGMMRSFNY
AGYAGYFDFV KKNQGINDRG RFIKAISLWE EEARTSFLTG YQNKIRENNG DFLPPEDKFD
QVLALFKIEK ALYEGIYEIN NRPDWLHIPL QGILDCVEEI AD