Gene Haur_2422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2422 
Symbol 
ID5734303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3105018 
End bp3106442 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content51% 
IMG OID641279563 
ProductBeta-glucosidase 
Protein accessionYP_001545190 
Protein GI159898943 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000277167 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTG TGGAGCAACA TTTTCCTGCT GATTTTATGT GGGGCACAGC CACCTCATCG 
TACCAAATTG AAGGCGCGGT GCATGAAGAT GGCCGAGGCG AATCAATTTG GGATCGATTT
AGCCATACGC CAGGCAAGAC CAAATTTGGC CAAACTGGCG ATATTGCCTG CGATCACTAT
CATCGTTACC CTGAAGATTT AGATTTAATG CGTGAGTTAG GCTTGGGCAG CTATCGTTTT
TCGCTTGCTT GGCCACGACT TTTCCCCGAA GGCAAGGGCA AAATCAACCA AGCTGGGCTA
GATTTTTACA AACGGATTAT CGAGGGCTTG CACCAGCGGC ATCTCACGCC GATGGCCACA
CTGTATCACT GGGATTTGCC CCAAGCCTTA CAAGACAAGG GCGGCTGGAT GAATCGTGAT
ACAGCTTTGC GTTTTGCTGA ATATGCCGAG GCCATGTATC GCCAATTAGG CGAGAGTGTA
CCATTTTGGA TCACCCATAA CGAGCCTTGG GTTGCAGCAT TTGTTGGGCA CTTCCAAGGT
CGTCACGCCC CAGGCATCAA AGATTTGCCA AGCGCAGTCA AAGCCTCGCA CCATCTGCTG
TATTCGCATG GCTTGGCAAC CCAATTGTTC CGCGAAAGCA AGTTAGCGGG CCAAATTGGC
ATCACACTGA ATTTAACCCC AGCCTACCCA ACCCACGACA CCCCCGACGA TCATGCAGCA
GCTTGGCGCA ACGATGGCTA TGGCAATCGC TGGTTTCTCG ACCCCATTTT CCGTGGTAGC
TATCCAGCTG ATACGGTTGA GTGGTTCCAA CAACACCATC AAATTGAAAT GGATTATGTG
CAGACTGGCG ATTTGGCCGT CATTCAACAA CCGATTGATT TCTTAGGCAT CAACTATTAT
TTCCCGAATC GGATTTCGGC TGCCGATGAA AGCAAATTTT TGGCACTCGT TAATAGCCCG
GCAATTGGCG AAACCAGTTT TCGTGGCTGG GAAGTTGTGC CAGCGGCATT TGCTGATTTA
TTGAAGCGGG TGCAGCGCGA TTATGGCAAT ACGCCAATTT ATATCACCGA AAATGGTAGT
GCCTTCGCCG ACCTCAAACG GGCCGCAGAT GGTTCAGTCA ACGACGGCGA TCGCATGAGC
TATTTGCACA CCCATTTGGA AGCAGTGGCC GATGCGATTG CGGCTGGTGT GCCAGTCAAA
GGCTACTATG CTTGGTCGAT GCTCGATAAC TACGAATGGG CCGAAGGCTA CGATGAGCGC
TTTGGCATTA TCGAAGTCGA TTTTGCCACC CAAAAGCGCA CGCCCAAACG AACAGCCCGT
TGGTATCAGC AAATTGTGGC CAATAACGGC TTGCCAAGCT TGCCCGCCGA CGTGCAAGCG
CTAGCCGAAC GCTACCGTAA TTGCCCAATT GGCCCACAAG ATTAA
 
Protein sequence
MTTVEQHFPA DFMWGTATSS YQIEGAVHED GRGESIWDRF SHTPGKTKFG QTGDIACDHY 
HRYPEDLDLM RELGLGSYRF SLAWPRLFPE GKGKINQAGL DFYKRIIEGL HQRHLTPMAT
LYHWDLPQAL QDKGGWMNRD TALRFAEYAE AMYRQLGESV PFWITHNEPW VAAFVGHFQG
RHAPGIKDLP SAVKASHHLL YSHGLATQLF RESKLAGQIG ITLNLTPAYP THDTPDDHAA
AWRNDGYGNR WFLDPIFRGS YPADTVEWFQ QHHQIEMDYV QTGDLAVIQQ PIDFLGINYY
FPNRISAADE SKFLALVNSP AIGETSFRGW EVVPAAFADL LKRVQRDYGN TPIYITENGS
AFADLKRAAD GSVNDGDRMS YLHTHLEAVA DAIAAGVPVK GYYAWSMLDN YEWAEGYDER
FGIIEVDFAT QKRTPKRTAR WYQQIVANNG LPSLPADVQA LAERYRNCPI GPQD