Gene Hhal_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1108 
Symbol 
ID4710054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1204329 
End bp1205822 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content71% 
IMG OID639855580 
Product4-alpha-glucanotransferase 
Protein accessionYP_001002686 
Protein GI121997899 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.204434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGCG CGGGACTGTC CGAGCGGCGC CGCGCCGGCG TGCTGCTGCA CGTCAGCGCA 
CTGCCCGGGC CCGGGGGCAA CGGCGACCTG GGGCACCACG CCTACCGCTT CGTGGACTGG
CTCGCCGAGG CCGGCTTCAC GATCTGGCAG ATGCTGCCGC TGGGCCCCAC CCACGACGAC
CTCAGCCCCT ATCAAGGCCT GTCGGTGCAC GCCGGTGAGC CGTGCTACAT CGACCTGCAC
ACGCTGGTCG AACTCGGCTG GTTGACCCCC GAGGAGATCC AGCCCCCGGA GACCGGCGAC
GACCCCGTCG CGCTGCGGGC GTGGCGCCGC TCCTGCCTTG GCCGCGCCCG ACAACGCCTG
CGCGATCGCA ACGACGCAAC GACCGAGGCT CAGATCGCCG CCTTCCGCCA GGCGCACGGC
CACTGGCTGG AAGACTACGC CCTCTACGCG GCGCTGCGCG AGGAGCATGA CCTCCTGCCC
TGGTGGCAGT GGCCCACCGC CGAACGCGAT CGGGAACCGG CGGCGCTGGA GGCGGCAGCC
ACCCGCCTGG CCGACCGGAT CGACCAGCAG GTCTTCGAGC AGTACCTGTT CTTCACCCAG
TGGCAGGCGC TGCGCCACTA CGCCGCGGAA CGCGGCGTCC GGTTCTTCGG CGACATCCCC
ATCTTCGTCG CCCACGACAG CGCCGACACC TGGGCCCGGC GGGCCTGCTT CCGACTCGAC
AGCGAGGGGC AGGCAGCGGT GGTGGCCGGC GTCCCACCGG ACTACTTCTC GGCCGAGGGG
CAGCGCTGGG GCAACCCCCT CTACGACTGG CAGCAGCTGC AGGCCGATGG CTTCGGCTGG
TGGCTCGAGC GCCTGGCCAC ACAGTTGGCA CTGTTCGACT TCGTGCGCAT CGACCACTTC
CGTGGCCTGA GCGCCTGCTG GACCATCCCC GCCGAGGCCC CCACGGCCCG GGACGGGTAC
TGGGAAGCGA CCCCCGGAGA CGCTCTGCTC GAGGCCGTTC AGGAACGCTT CGGGCGGGTC
CCGCTGGTCG CCGAGGACCT AGGGGTGATC ACCGAGGACG TGGAGCGCCT GCGCGACCGC
TTCGCCCTGC CGGGGATGAA GGTCCTCCAC TTCGCCTTCG ACAGCGACGC CGCCAACCCC
TACCTGCCGC ACCACCACCA CCGCCACAGT GTGGTCTACA CCGGTACCCA CGATAACAAC
ACCACCGTGG GATGGTACGC CGGTCTCGCG CCGGAGACCG TGGAGCGGGT CCACGCGTAC
CTGGGCTACC CGACCGAGCC GATGCCCTGG CCCCTGACTC GAGCCGCCCT GGCCTCGGTG
GCGAGCGTGG CCGTCATCCC CCTACAGGAC CTGCTCGAGC TGGACGGCGA ACACCGCATG
AACGTCCCCG GCACCACCGA AGGCAACTGG CGCTGGCGCT TTGCTTGGGA GTGGCTGCCC
GACTCCCTGG CCGGGCAGCT GTACGACCTC AACCGGCTCT ACGGCCGGCT CTAG
 
Protein sequence
MSGAGLSERR RAGVLLHVSA LPGPGGNGDL GHHAYRFVDW LAEAGFTIWQ MLPLGPTHDD 
LSPYQGLSVH AGEPCYIDLH TLVELGWLTP EEIQPPETGD DPVALRAWRR SCLGRARQRL
RDRNDATTEA QIAAFRQAHG HWLEDYALYA ALREEHDLLP WWQWPTAERD REPAALEAAA
TRLADRIDQQ VFEQYLFFTQ WQALRHYAAE RGVRFFGDIP IFVAHDSADT WARRACFRLD
SEGQAAVVAG VPPDYFSAEG QRWGNPLYDW QQLQADGFGW WLERLATQLA LFDFVRIDHF
RGLSACWTIP AEAPTARDGY WEATPGDALL EAVQERFGRV PLVAEDLGVI TEDVERLRDR
FALPGMKVLH FAFDSDAANP YLPHHHHRHS VVYTGTHDNN TTVGWYAGLA PETVERVHAY
LGYPTEPMPW PLTRAALASV ASVAVIPLQD LLELDGEHRM NVPGTTEGNW RWRFAWEWLP
DSLAGQLYDL NRLYGRL