Gene Hore_18730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18730 
Symbol 
ID7312687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2001024 
End bp2002352 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content38% 
IMG OID643612320 
Productcytoplasmic alpha-amylase 
Protein accessionYP_002509617 
Protein GI220932709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT TTAGAAGGCG TCTTTTCATT GTTTTGTTCA GTCTGCTTCT GTTGATTTCA 
GTTATTACTT CTGCAAGGGC CGGGGTGTTG ATGCAAGGGT TTTACTGGGA TACTCCATAT
CAGGGAGAGT GGTATGACCA TATAGCCTCC AAAGCTGAAG AACTTTCAAA TGCTGGATTT
ACTGCAATAT GGTTTCCTTC GCCATGTAAA GGTGATAGTG GCGGTTATTC CATGGGTTAT
GACGTTTTTG ATCATTATGA CCTGGGGAAT TATTACCAGC AGGGGACTAC TGAAACCCGA
TTTGGGAGTA AAAATGAACT GTTAAATGCA ATCAATGCTT ACCACAGTGA AGGAATGCAG
GTGTATGTTG ATACTGTCAT GAACCACATG ATGGGTGGTG AACAGGAATG GAATCCGAAT
ACAAATTCGT ATACATATAC CAGGTTTGAT TACCCCCACG ATACTTTTGA AAAGAATTAT
AAACATTTTC ATCCAAATTA TACCCACCCA GATAATGACC CCCCTTATCA TAGTAAAGAA
TTTGGCGAAG ATGTCTGTTA TTATAATGAC TATAACTATA TGGGGAATGG GTTAAAAAAT
TGGGCAGCCT GGTTAAAGAA TAATATTGGA TTTGATGGGT ATAGATTAGA TTTTGTTAAA
GGTATAGAAC CTGATTATAT TAAATCCTGG AAACAAACTT CTCCAATGAG TAGTAGTTTT
GTCGTGGGTG AATACTGGGA TGGTAACAGG GATACCCTGG ATTGGTGGGC AAATTATACT
GGTTGTCATG TTTTTGACTT TGCATTATTT TACACATTAA AAGATATGTG TAATAGCGAC
GGCTACTATG ATATGAGAGG GCTACAGGAT GCAGGGTTGG TGGAAATAAA CCCTTACAGG
GCGGTAACAT TTGTAGAAAA CCATGATACA GATGAACATG ACCCGGTAAC AAAAAATAAA
TTAATGGCCT ATGCTTATAT TTTAACCCAT GAGGGTTATC CTACAGTATT TTGGAAAGAT
TATTATGTAT ATGATTTAAA GGATGAAATA AATAACCTGG TCTGGATACA TGAGAACCTG
GCCTCAGGAA CTACCAGTAA TCTTTACGCT GATGATAGTT TGTATATTGC CCAACGAAAT
GGTAATCCCG GACTTGTGGT CGGGCTCAAC GATAGTTCCA GTTGGAAGAG TAAATGGGTT
CAAACTAAAT GGAGTAATGT TACTTTACAT GACTATACCG GACAGGCCGG AGATGTATAT
GTGGATAGTA ACGGCTGGGT AGAAATTTCA ATACCACCAA AAGGATATAG TGTCTACTCT
CCATATTAA
 
Protein sequence
MKVFRRRLFI VLFSLLLLIS VITSARAGVL MQGFYWDTPY QGEWYDHIAS KAEELSNAGF 
TAIWFPSPCK GDSGGYSMGY DVFDHYDLGN YYQQGTTETR FGSKNELLNA INAYHSEGMQ
VYVDTVMNHM MGGEQEWNPN TNSYTYTRFD YPHDTFEKNY KHFHPNYTHP DNDPPYHSKE
FGEDVCYYND YNYMGNGLKN WAAWLKNNIG FDGYRLDFVK GIEPDYIKSW KQTSPMSSSF
VVGEYWDGNR DTLDWWANYT GCHVFDFALF YTLKDMCNSD GYYDMRGLQD AGLVEINPYR
AVTFVENHDT DEHDPVTKNK LMAYAYILTH EGYPTVFWKD YYVYDLKDEI NNLVWIHENL
ASGTTSNLYA DDSLYIAQRN GNPGLVVGLN DSSSWKSKWV QTKWSNVTLH DYTGQAGDVY
VDSNGWVEIS IPPKGYSVYS PY