Gene Haur_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1246 
Symbol 
ID5733124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1452499 
End bp1454823 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content52% 
IMG OID641278386 
Productalpha-xylosidase YicI 
Protein accessionYP_001544022 
Protein GI159897775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.758952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CTGATGGATA TTGGATGATG CGTGAGGGCG TTAATGCGCT CTTTGCATCA 
CAGGCCTACG ATGGCCAAAT AAACGACACA ACTCTGACGG TCTATGCGCC AGGCAAGCGG
ATCAATCATC GAGGCGATAC ACTGAACCTA GGCACAATTA CCGCCCGTTT TTCCTCGCCC
ATGCCCGATG TGGTGCGGGT CAAACTTACC CATTTTGAAG GCCAACGTTC ACTTGGCCCC
AATTTTGCAA TTGCTGAATC AGCCAATGAG AGCGTGAGCA CCAGCGAAGA TGAGCAAGAA
TTTCGCTTGA CCAGCGGCAA ATTGAGCGTG CAAATCCCCA AAACTGGCGA TTGGAGTATC
AATTTTTGGG CTGAGAATCG CCGCATTACC AGCAGTGGCC GCAAAGGCAT TGGCTACATT
AGCATGGCCG ATGCTGGCGA ATTTATGCAC GAGCAATTAT CGCTTGGCGT TGGCGAGCAA
GTGTATGGCT TGGGCGAACG CTTTACCGCC TTCGTCAAAA ATGGCCAATC AGTCGATGTT
TGGAATCAAG ATGGTGGCAC GGGCAGCGAG CAAGCCTACA AAAACGTACC ATTCTATCTG
ACCAATCGTG GCTATGGCGT GTTCGTCAAT CAGCCCGAAA ACGTGGCCTT TGAAATTGCC
TCGGAAAAAG TTTCGCGGGT GCAATTTAGT GTGCCAGGCC AAAGCCTCGA ATATTTTGTG
ATCTATGGCC CAACGCCCAA GGAAATTCTG GAAAAACTGA CTGCTTTGAC AGGCCGCCCA
GCTTTGCCGC CAGCATGGTC GTTTGGTTTA TGGCTCACCA CCTCGTTTAC CACCTCGTAT
GATGAGCAAA CTGTTACCAG CTTTATCCAA GGCATGGCCG ACCGCGATTT ACCGTTGCAT
GTCTTCCATT TCGATTGTTT TTGGATGCGC GAATTTCACT GGTGCGATCT TGAATGGGAT
TCACGCACCT TCCCCGATCC TGAGGGCATG CTCAAGCGGC TCAAAGATCG CGGCTTGAAA
ATTTGTGTTT GGATCAATCC ATATATTGCC CAGCGTTCGG CGATGTTCCG CGAAGGTATG
GAGCATGGCT ACTTGGTCAA GAAGCCCAAC GGCGATGTCT GGCAATCGGA TATGTGGCAA
TCGGGCATGG GCTTGGTCGA TTTCACCAAC CCCGCTGCTT GCGCTTGGTA TGCAGCCAAA
CTCAAAGGCC TGCTCGATAT GGGCGTAGAT TGCTTCAAAA CCGACTTTGG CGAACGCATT
CCCACCGATG TAGCCTATTT CGACGGTTCC GACCCCCAGC GGATGCACAA CTACTACACC
CATCTTTACA ACAAAACCGT GTTTGATTTG TTGAAAACTG AGCGCGGCGA AAACGATGCA
GTGGTGTTTG CCCGATCGGC AACTGCTGGC GGCCAACAAT TCCCAGTGCA CTGGGGCGGC
GACTGCGAAT CGACCTTCGA ATCGATGGCT GAAAGTTTAC GCGGCGGTTT ATCGTTAGGG
CTTTCAGGTT TTGGCTTCTG GAGCCACGAT ATTGGCGGCT TCGAGGGCAT GCCACCAGTT
GAAATTTACA AACGCTGGAT TGCCTTTGGC ATGCTTTCAT CGCACAGTCG TTTGCATGGC
AACCATACTT ATCGCGTGCC ATGGATTTAC GATGAAGAAG CCGTCGATGT GCTGCGCTAC
TTCACCAAAC TTAAATCACG TTTGATGCCC TATCTCTATG GGGCTGCTGT GACCGCTTCC
ACCAGTGGCA TTCCAGTGAT GCGGGCCATG TTGCTAGAAT TCCCCAACGA TCCGACCTGT
GATTTCCTTG ATCGTCAATA TATGTTGGGC GATTCGTTGT TGGTTGCGCC AGTATTCGCC
TACGACAACA CGGTGACCTA CTATGTGCCT GCTGGCCGCT GGACGCACAT TACGACTGGT
GCGGTGGTTG AAGGCCCACG TTGGGTCACT GAAACCCACG ACATGCTGAG TTTGCCATTA
TTGGCTCGCC CCAACAGCTT GATTGCGATT GGTAACAACA GCGAGCGCCC CGATTACGAC
TATAGCAGCG GTGTGACCCT GAATTTATAC CAATTGGGCG ATGGTCAAGC GGCCTACACC
ATGGTTCCGG CAACCAATGG CGATATTGCC GCCTCGTGGA GTGCCCGCCG CGATGGCGAC
ACGATCAGAA TCGTGCAAGA AGGCCAAGCT AACGATTGGC AAGTGGTGTT GGTTGGCGTG
CAACAAGTTG CCAGCGCCGA TGGCGCTTTA GTCGAGCAAC ATCCGCTCGG TGTCCAACTC
ACCGCACTTG ATCAAGCCAC CAAGTTGGTG GTCAAGTTGA AGTAG
 
Protein sequence
MKFTDGYWMM REGVNALFAS QAYDGQINDT TLTVYAPGKR INHRGDTLNL GTITARFSSP 
MPDVVRVKLT HFEGQRSLGP NFAIAESANE SVSTSEDEQE FRLTSGKLSV QIPKTGDWSI
NFWAENRRIT SSGRKGIGYI SMADAGEFMH EQLSLGVGEQ VYGLGERFTA FVKNGQSVDV
WNQDGGTGSE QAYKNVPFYL TNRGYGVFVN QPENVAFEIA SEKVSRVQFS VPGQSLEYFV
IYGPTPKEIL EKLTALTGRP ALPPAWSFGL WLTTSFTTSY DEQTVTSFIQ GMADRDLPLH
VFHFDCFWMR EFHWCDLEWD SRTFPDPEGM LKRLKDRGLK ICVWINPYIA QRSAMFREGM
EHGYLVKKPN GDVWQSDMWQ SGMGLVDFTN PAACAWYAAK LKGLLDMGVD CFKTDFGERI
PTDVAYFDGS DPQRMHNYYT HLYNKTVFDL LKTERGENDA VVFARSATAG GQQFPVHWGG
DCESTFESMA ESLRGGLSLG LSGFGFWSHD IGGFEGMPPV EIYKRWIAFG MLSSHSRLHG
NHTYRVPWIY DEEAVDVLRY FTKLKSRLMP YLYGAAVTAS TSGIPVMRAM LLEFPNDPTC
DFLDRQYMLG DSLLVAPVFA YDNTVTYYVP AGRWTHITTG AVVEGPRWVT ETHDMLSLPL
LARPNSLIAI GNNSERPDYD YSSGVTLNLY QLGDGQAAYT MVPATNGDIA ASWSARRDGD
TIRIVQEGQA NDWQVVLVGV QQVASADGAL VEQHPLGVQL TALDQATKLV VKLK