Gene Haur_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2545 
Symbol 
ID5734423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3270955 
End bp3273312 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content48% 
IMG OID641279685 
ProductAlpha-glucosidase 
Protein accessionYP_001545311 
Protein GI159899064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000167108 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATT CATCAATGTT TACACTTGAT CAACGTGATG CCCGTAGCTG TCTGTTTAAA 
CGAGCAAATC AGCAAATTCG CATCAGCATC TGTAGCGAAC AGCTTATTCG GGTTTGTATT
AGTAATGATG CACCATTAGC ACGGCGTTCA TGGGCGGTTA ATCGCGCTGA TGATGCATGG
GAGGCACTTG CATGGTCGGT TTCAGAAAAT TCAGATTCAA CGAGGATTGA GACGACAAAG
CTACAAGTAA ATGTTGCACA TGCTGATGGC CGAATTACAT TTAACAATCT GGAGCAGCAG
CCATTTTTTA GTGACGTTAC GCCAGCAAGC TACAACGCCG ATGGATGGGT TGTTCGTAAG
CAAATTTATA ATTCTGAGCA TTTTTATGGT TTTGGTGAAC GCACTGGCTG GCTTGAAAAA
ACAGGTCAGC ATTTTTTAAA TTGGACGCTC GATCCTGAAC CACATCATAG CCCGCGTATT
GATAATATGT ATGCAACAAT GCCAGTTTTT ATGGGATTGC AGCCTAATCT CTGCTATGGC
GTGTTTTTCA ATACATCATT TCGCTCAAGT ATTGATGTTG GGGCGGCTGA TGCTGCATTG
TTGAGCTTAA AAACCCAAGG CCCAGATCTC GATTATTATG TGGTTTTGGG TACAACACCT
GCCGAAATTA CCGCTACTTG GCGTGAATTA TTGGGAGCAA TGCCTTTACC GGCCTATTGG
GCGCTTGGTT ATCATCAAAG TCGCTGGGGC TACGATTCAA GCATGACGAT GCAGGCAATT
GCTGATGAAT TACGTGCTCG CAATATTCCG TGCGATGCGA TTCATTTTGA TATTGATTAT
ATGGATGGTT ATCGGGTTTT TACTTGGCAT CCTGAACGTT TTGCCCAGCC AGCTCAATTG
TTGCAAAATT TGGCTCGTGA TGGCTTCAAT GTGGTAACAA TCATTGATCC TGGGGTCAAA
ACTGACCCAA ATTATGCAGT ATTTGCCGAA GGAATCGCCA ACGATTATTT TATCAAGCGG
GCTGATGGAA CGTTATTCAG TGGTTATGTT TGGCCTGATG ATAGCGCATT TGCTGATTTT
ACCCGTGCTG ATGTACGTGA ATGGTGGGGA AATTTACATA AGAAATTGAT CGATGCTGGG
GTACGCGGCA TTTGGGATGA TATGAATGAA CCAACCGTGT TTGACCGACC TTTTAGCGAA
GGTGGTGGCA ATGGTGGTAC GATCGATCTG AATGCGCCGC AAGGATCTGC CGATGAGCGT
ACAACTCACG CCGAAGTACA TAATTTGTAT GGTTTGTTGA TGGCTCGCTC AACTTATGAA
GGCTTGCGAC AATTGCGCCC TAATGAACGA CCATTTGTAT TAACTCGCTC AGGTTTTGCT
GGTTTATCAC GATGGGCGAC TCTCTGGACT GGTGATAATT CGGCGTTGTG GGAACATTTA
GAAATGATGT TGCCGCAAAT TGCTAACTTG GGGCTTTCAG GAATTCCCTT TGTTGGCGTG
GATATTGGTG GATTTTTTGG CAATGCGTCG CCAGAATTAT GGGCACGTTG GGTTCAAGTT
GGGGCATTTC TGCCGTTCTG TCGTGGGCAC TCGTGTTCGG GCACACGTCC GGCTGAGCCG
TGGGCGTTTG GCGAACGCAC CGAAGCAATT GCGCGGGCCT ACCTTAGTCT GCGCTATCGT
TTATTGCCCT ACTTGTATAC GTTGTTTTAT CAAGCTTCAA CCACAGGTGC GCCAATTATT
CGTCCATTGG TGTATGAATT TGCGGCTGAT CCGACCACTC ACGCCTTGCA CGATCAGGTG
TTGTGTGGCT CGCAATTAAT GCTTGCGCCG ATTGTACGGC CTGGGACTGA ATATCGTTCG
GTTTATTTGC CCGCTGGCGA GTGGTACGAT TGGTGGACGG GTGAGCGGAT CAAGGGTTCG
CAGCATATTT TGGTGCATGC GCCGCTTGAA CGGTTACCGC TGTATGTGCG TGGTGGGGCG
ATTTTGACTC TCGGCCCAGT ACTCAACTAC ACCAGCGAAG CCCCACTTGA TCCTTTAACC
CTCGATGTTT ACCCCAGTGG CACAAGCGAA TGGACGCTCT ACGAGGACGA TGGCATCTCG
TTCGATTACG AACAGGGCCA AGCAGCGACC ACAACGTTTA GCTGTGTTGA AACTGAGCAA
ACAATTACGT TGATGATTGC CGCCCGCCAA GGTAGTTGGC AACCTGCCCT GCGCACAATC
GTGGTCAATC TGCATTCGCT GCCGCCCAAA GCGGTTTTGT TTGATACAAA TGCAATCGAA
TGGGTCTACG CTGAAGGCGC AACGACCGTG AGTTTTGCTG ATGATGGCTT GGCACACACG
CTTGAGGTGC AGTTGTAA
 
Protein sequence
MEHSSMFTLD QRDARSCLFK RANQQIRISI CSEQLIRVCI SNDAPLARRS WAVNRADDAW 
EALAWSVSEN SDSTRIETTK LQVNVAHADG RITFNNLEQQ PFFSDVTPAS YNADGWVVRK
QIYNSEHFYG FGERTGWLEK TGQHFLNWTL DPEPHHSPRI DNMYATMPVF MGLQPNLCYG
VFFNTSFRSS IDVGAADAAL LSLKTQGPDL DYYVVLGTTP AEITATWREL LGAMPLPAYW
ALGYHQSRWG YDSSMTMQAI ADELRARNIP CDAIHFDIDY MDGYRVFTWH PERFAQPAQL
LQNLARDGFN VVTIIDPGVK TDPNYAVFAE GIANDYFIKR ADGTLFSGYV WPDDSAFADF
TRADVREWWG NLHKKLIDAG VRGIWDDMNE PTVFDRPFSE GGGNGGTIDL NAPQGSADER
TTHAEVHNLY GLLMARSTYE GLRQLRPNER PFVLTRSGFA GLSRWATLWT GDNSALWEHL
EMMLPQIANL GLSGIPFVGV DIGGFFGNAS PELWARWVQV GAFLPFCRGH SCSGTRPAEP
WAFGERTEAI ARAYLSLRYR LLPYLYTLFY QASTTGAPII RPLVYEFAAD PTTHALHDQV
LCGSQLMLAP IVRPGTEYRS VYLPAGEWYD WWTGERIKGS QHILVHAPLE RLPLYVRGGA
ILTLGPVLNY TSEAPLDPLT LDVYPSGTSE WTLYEDDGIS FDYEQGQAAT TTFSCVETEQ
TITLMIAARQ GSWQPALRTI VVNLHSLPPK AVLFDTNAIE WVYAEGATTV SFADDGLAHT
LEVQL