Gene Haur_2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2401 
Symbol 
ID5734282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3059378 
End bp3060736 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content52% 
IMG OID641279542 
ProductBeta-glucosidase 
Protein accessionYP_001545169 
Protein GI159898922 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00110187 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAGC GTGCCTTTCC TTCCGATTTC CTCTGGGGTT CCGCCACATC GTCGTATCAA 
ATTGAAGGCG CTGCGTTTGC CGATGGCCGT AGCGAATCGA TTTGGGATCG TTTTTGTAAA
CAACCAGGCG CTATTTTAGA TCAATCCAAT GGTGATATTG CTTGCGATCA TTACAATCGC
TATCGTGATG ATGTTGCTTT GATGGCGCGT TTGGGTCTAC AAGCCTATCG ATTTTCGGTG
GCTTGGCCAC GCGTGTTGCC CAATGGACGT GGTGCGGTCA ATCAGGCTGG CCTTGATTTT
TATCGGCGCT TGGTCGATGA ACTACTGCAA CACAACATTC GGCCATTTGT AACGCTCTAC
CACTGGGATT TGCCGCAAAT TCTCGAAGAT GCTGGCGGTT GGCCTGAGCG GGCTACGGCT
GAAGCTTTTG TTGAATATGC CGATGCGGTC AGCCGTGCCT TGGGCGATAC TGTCAAAGAT
TGGATTACTC ACAACGAGCC ATGGTGTGCA GGGTTGCTCG GCTACCAAAT TGGCGAACAT
GCTCCAGGCC GAAAAAATTG GAATGACGGC TTGAAGGCCA GCCATCATCT TTTGCTTTCG
CACGGCTGGG CGGTTGATGT AATTCGGCGC AACGTACCAC AGGCTAGCGT TGGGATCACG
CTGAACTTTA CGCCAGCGAT GCCTGCTTCA CGTAGTACCG AAGATTTGAA TGCAACCCGC
CACTTTGATG GTTTCTTCAA TCGCTGGTTC CTCGACCCAG TGTATGGCCG CGAATATCCT
GCCGATATGG TGCGCGATTA CACTGAACTT GGCTACTTGC CTAATGGCCT CGATTTTGTT
CACGATGGCG ACTTCAAGGC CATGGCCGCA ACTACCGATT TCTTGGGTGT AAACTACTAC
AGTCGGGCGG TGATTCACGA TCCCAAAACT GGAACCGCGC CCAAACTAGA TTCCGAATAT
ACCGATATTG GCTGGGAAGT GTACCCTCAA GGCTTAGGCG ATTTGCTCAA GCGCTTGGCC
TTTGCCTACA ACCCAGGCAA AATTTATGTG ACTGAAAATG GTGCTAGCTA CAACGATGGC
CCTGACGCGC ATGGCGAAGT CAATGATACC CGTCGCACCC AATATTTGCA CGATCACCTG
AGCGTTTGCT CCGATGCAAT TGCTGCTGGG GTGCCATTGG CGGGCTACTT TGTCTGGTCA
TTGATGGATA ACTTTGAGTG GGCCAAGGGC TATAGCCAAC GCTTTGGCGT GATTTGGGTC
GATTACGAAA CCCAACAACG CATCCCCAAG GCTAGCGCTC ATTGGTATAG CCGTGTGGTC
AAGGCCAACG CCGTCCAACC ATTGGAAGTT TTAGCCTAA
 
Protein sequence
MTQRAFPSDF LWGSATSSYQ IEGAAFADGR SESIWDRFCK QPGAILDQSN GDIACDHYNR 
YRDDVALMAR LGLQAYRFSV AWPRVLPNGR GAVNQAGLDF YRRLVDELLQ HNIRPFVTLY
HWDLPQILED AGGWPERATA EAFVEYADAV SRALGDTVKD WITHNEPWCA GLLGYQIGEH
APGRKNWNDG LKASHHLLLS HGWAVDVIRR NVPQASVGIT LNFTPAMPAS RSTEDLNATR
HFDGFFNRWF LDPVYGREYP ADMVRDYTEL GYLPNGLDFV HDGDFKAMAA TTDFLGVNYY
SRAVIHDPKT GTAPKLDSEY TDIGWEVYPQ GLGDLLKRLA FAYNPGKIYV TENGASYNDG
PDAHGEVNDT RRTQYLHDHL SVCSDAIAAG VPLAGYFVWS LMDNFEWAKG YSQRFGVIWV
DYETQQRIPK ASAHWYSRVV KANAVQPLEV LA