Gene Hore_18210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18210 
Symbol 
ID7313819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1943539 
End bp1945029 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content45% 
IMG OID643612268 
ProductSucrose-phosphate synthase 
Protein accessionYP_002509565 
Protein GI220932657 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.338675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGTA TTAAACATGT AGCTTTTTTA AATCCCCAGG GTAATTTTGA CCCCGCTGAC 
AGTTACTGGA CAGAACACCC TGATTTCGGT GGGCAGCTGG TTTATGTCAA GGAAGTATCG
TTAGCCCTGG CCGAGATGGG AGTCCAGGTT GATATAATAA CCCGGCGTAT TAAGGATGAA
AACTGGCCTG AATTTTCCGG AGAAATCGAT TATTATCAGG AAACTAATAA AGTAAGGATT
GTCAGAATAC CCTTTGGTGG GGATAAATTC CTGCCCAAGG AGGAGCTCTG GCCCTATTTA
CATGAGTATG TGAATAAGAT AATTAATTTT TACCGGGAAG AAGGAAAGTT TCCCCAGGTG
GTAACAACCC ATTACGGTGA TGGGGGACTG GCCGGTGTTT TATTAAAGAA TATTAAAGGA
CTTCCCTTTA CCTTTACCGG CCACTCACTG GGGGCCCAGA AGATGGAGAA ACTCAATGTT
AATACTTCCA ACTTTAAGGA AATGGATGAA CGCTTTAAAT TTCACCGGAG GATTATAGCC
GAGCGGCTGA CCATGTCCTA TGCGGACAAA ATTATTGTTA GCACCTCCCA GGAACGATTC
GGTCAATACA GTCATGACCT TTATCGGGGG GCAGTTAATG TAGAGGATGA TGATAAATTC
TCAGTCATTC CCCCCGGTGT AAATACCAGG GTCTTCGATG GAGAATATGG AGATAAGATT
AAAGCAAAGA TCACCAAGTA CTTAGAGCGA GATCTCGGTT CAGAACGGAT GGAATTACCG
GCCATAATAG CTTCAAGTCG CCTTGATCAA AAGAAAAACC ATTACGGTCT GGTCGAGGCC
TATGTCCAAA ATAAAGAACT CCAGGATAAA GCCAATCTGG TTCTAACCCT GCGCGGTATT
GAAAACCCCT TTGAAGATTA TTCCAGAGCT GGACAAGAAG AGAAGGAGAT TCTCGGTAAG
ATAATTGAGT TGATTGATAA CAATGACTGT CGCGGTAAGG TCAGTATGTT CCCCTTAAAC
AGTCAGCAGG AGCTGGCCGG ATGTTATGCC TACCTGGCCT CAAAGGGATC TGTATTTGCC
CTGACTTCCT TTTATGAACC CTTTGGCCTG GCCCCGGTTG AGGCCATGGC TTCAGGCCTA
CCGGCTGTTG TAACCAGAAA TGGTGGACCG GCTGAAATTC TGGATGGAGG AAAATATGGT
GTTCTGGTTG ACCCTGAAGA TCCTGAAGAT ATTGCCCGGG GCCTGTTAAA AGCCTTTGAG
AGTGAAGAGA CATGGTCCGC CTATCAGGAA AAAGGCAAGC AACGGGTTGA GGAACGTTAC
ACGTGGCAGG AGACAGCCCG GGGTTATCTG GAGGTTATTC AGGAAATCGC TGATCGTAAG
GATGAAGAGG ATGAAGGCGG AAGTCTGAAT ATACCGGATT ATTTTACTAA CCCCGGGGCC
AGTAATGATG AAAAATTGCT TGACACTTTT AACAAACTCT GGAAGGAGTA A
 
Protein sequence
MTRIKHVAFL NPQGNFDPAD SYWTEHPDFG GQLVYVKEVS LALAEMGVQV DIITRRIKDE 
NWPEFSGEID YYQETNKVRI VRIPFGGDKF LPKEELWPYL HEYVNKIINF YREEGKFPQV
VTTHYGDGGL AGVLLKNIKG LPFTFTGHSL GAQKMEKLNV NTSNFKEMDE RFKFHRRIIA
ERLTMSYADK IIVSTSQERF GQYSHDLYRG AVNVEDDDKF SVIPPGVNTR VFDGEYGDKI
KAKITKYLER DLGSERMELP AIIASSRLDQ KKNHYGLVEA YVQNKELQDK ANLVLTLRGI
ENPFEDYSRA GQEEKEILGK IIELIDNNDC RGKVSMFPLN SQQELAGCYA YLASKGSVFA
LTSFYEPFGL APVEAMASGL PAVVTRNGGP AEILDGGKYG VLVDPEDPED IARGLLKAFE
SEETWSAYQE KGKQRVEERY TWQETARGYL EVIQEIADRK DEEDEGGSLN IPDYFTNPGA
SNDEKLLDTF NKLWKE