Gene Hhal_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1120 
Symbol 
ID4710074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1216041 
End bp1217411 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID639855592 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_001002698 
Protein GI121997911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.541755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTCGAC TGGTAACCGT TTCCAATCGT GTGGCCTTGC CCAGCCAGCT GCAGGCCGCG 
CAGGGTGGTC TGGCCGTCGG CCTGCGTTCG GCGCTGGAGG AGTCCGGTGG CATGTGGTTC
GGTTGGGACG GCGGCGTGGA CGAACGCATC GACGGGTTGC GCCAGCCCCG GGTCCAGACC
GCCAACGGGG TGCGCTACGC CACCCTGCGC CTGAGTCGCC TCGAGTACGA TCGCTACTAC
CTGGGCTACG CCAACCAGGT GCTCTGGCCG CTGTTCCACT ACCGCATGTC CTTCGTCCAC
TGCCGGCGCG AGCGCATCGA GGGCTACTGG GAGGTCAACC GGCTGTTTGC CGAGCATCTG
CCACCGTTGC TCGAGGGCGA CGAGATCATC TGGGTCCACG ACTACCACTT CATTCCTCTC
GGGCAACTGT TGCGCGAGCA GGGTGTCGAG GCCCCCATTG GTTTCTTCCT GCACACCCCC
TTCCCGCCCT GGGACGTCTT CCGCGCCCTG CCCGGTCACG AGCCACTGCT CGAGGCGCTG
TGCCGGTACG ATCTGGTCGG GTTCCAGACG CGCATCGACC GGGACAACTT CCTCGATTGC
CTGACCCACT ACCGCCCGCA GCTGCAGCGC CCACGGGCCG AGGTGTTCCC CATCAGCATC
GATGTCGATC AGGTGGCCCG GGAGGCGCAG CGGGGCTACA ACTCCCAGCA GGGGCGGCGG
CTGCAGCAGA GCCTGCGCGA CCGTCGGTTG ATGATCGGCG TCGACCGGCT CGATTACAGC
AAGGGCCTGC GCAACCGGTT CGAGGCCTAC GAGGCGCTGC TCGAGCAGCA CAGTGAGCAC
CGCGGGGACG TGGTCTTCCT GCAGATCGCC CCGGTCTCCC GTGGCGATGT ACCCGAGTAC
GAGGAGATCC GCCAATACCT GGAGTACCTG GCTGGCCACA TCAACGGTCG TTTCGCCGAG
TACGACTGGG TGCCGCTGCG TTACCTCAAT CGCGGTTTCC ACCGTTCGAA TATCCTCGGC
TTCCTGGCGC GTAGCGACGT CGGGCTGATC ACCCCCATGC GTGACGGCAT GAATCTGGTG
GCCAAGGAGT TTGTCGCCGC CCAGGATCCC GGCGATCCGG GGGCGCTGGT GCTGTCGCGC
TACGCTGGCG CTGCCGAAGA GCTCGATGGC GCGGTGCTGG TCAATCCCTA CGACGTGGAT
CAGATGGTTG ATGCCATGCA CCAGGCGCTG ACCATGCCGC TGGGGGAGCG GCGCGAGCGC
TGGCAGCAGA TGATGGACGC GCTACGCCGA CAGGACGTGC ATCGCTGGCG GAAGGATTTC
ATCCAGGCCC TGCACGATGC CCACCGCGCA CGGGGTTCGG AGGCGCTGTG A
 
Protein sequence
MSRLVTVSNR VALPSQLQAA QGGLAVGLRS ALEESGGMWF GWDGGVDERI DGLRQPRVQT 
ANGVRYATLR LSRLEYDRYY LGYANQVLWP LFHYRMSFVH CRRERIEGYW EVNRLFAEHL
PPLLEGDEII WVHDYHFIPL GQLLREQGVE APIGFFLHTP FPPWDVFRAL PGHEPLLEAL
CRYDLVGFQT RIDRDNFLDC LTHYRPQLQR PRAEVFPISI DVDQVAREAQ RGYNSQQGRR
LQQSLRDRRL MIGVDRLDYS KGLRNRFEAY EALLEQHSEH RGDVVFLQIA PVSRGDVPEY
EEIRQYLEYL AGHINGRFAE YDWVPLRYLN RGFHRSNILG FLARSDVGLI TPMRDGMNLV
AKEFVAAQDP GDPGALVLSR YAGAAEELDG AVLVNPYDVD QMVDAMHQAL TMPLGERRER
WQQMMDALRR QDVHRWRKDF IQALHDAHRA RGSEAL