Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2422 |
Symbol | |
ID | 5734303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3105018 |
End bp | 3106442 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279563 |
Product | Beta-glucosidase |
Protein accession | YP_001545190 |
Protein GI | 159898943 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000277167 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTG TGGAGCAACA TTTTCCTGCT GATTTTATGT GGGGCACAGC CACCTCATCG TACCAAATTG AAGGCGCGGT GCATGAAGAT GGCCGAGGCG AATCAATTTG GGATCGATTT AGCCATACGC CAGGCAAGAC CAAATTTGGC CAAACTGGCG ATATTGCCTG CGATCACTAT CATCGTTACC CTGAAGATTT AGATTTAATG CGTGAGTTAG GCTTGGGCAG CTATCGTTTT TCGCTTGCTT GGCCACGACT TTTCCCCGAA GGCAAGGGCA AAATCAACCA AGCTGGGCTA GATTTTTACA AACGGATTAT CGAGGGCTTG CACCAGCGGC ATCTCACGCC GATGGCCACA CTGTATCACT GGGATTTGCC CCAAGCCTTA CAAGACAAGG GCGGCTGGAT GAATCGTGAT ACAGCTTTGC GTTTTGCTGA ATATGCCGAG GCCATGTATC GCCAATTAGG CGAGAGTGTA CCATTTTGGA TCACCCATAA CGAGCCTTGG GTTGCAGCAT TTGTTGGGCA CTTCCAAGGT CGTCACGCCC CAGGCATCAA AGATTTGCCA AGCGCAGTCA AAGCCTCGCA CCATCTGCTG TATTCGCATG GCTTGGCAAC CCAATTGTTC CGCGAAAGCA AGTTAGCGGG CCAAATTGGC ATCACACTGA ATTTAACCCC AGCCTACCCA ACCCACGACA CCCCCGACGA TCATGCAGCA GCTTGGCGCA ACGATGGCTA TGGCAATCGC TGGTTTCTCG ACCCCATTTT CCGTGGTAGC TATCCAGCTG ATACGGTTGA GTGGTTCCAA CAACACCATC AAATTGAAAT GGATTATGTG CAGACTGGCG ATTTGGCCGT CATTCAACAA CCGATTGATT TCTTAGGCAT CAACTATTAT TTCCCGAATC GGATTTCGGC TGCCGATGAA AGCAAATTTT TGGCACTCGT TAATAGCCCG GCAATTGGCG AAACCAGTTT TCGTGGCTGG GAAGTTGTGC CAGCGGCATT TGCTGATTTA TTGAAGCGGG TGCAGCGCGA TTATGGCAAT ACGCCAATTT ATATCACCGA AAATGGTAGT GCCTTCGCCG ACCTCAAACG GGCCGCAGAT GGTTCAGTCA ACGACGGCGA TCGCATGAGC TATTTGCACA CCCATTTGGA AGCAGTGGCC GATGCGATTG CGGCTGGTGT GCCAGTCAAA GGCTACTATG CTTGGTCGAT GCTCGATAAC TACGAATGGG CCGAAGGCTA CGATGAGCGC TTTGGCATTA TCGAAGTCGA TTTTGCCACC CAAAAGCGCA CGCCCAAACG AACAGCCCGT TGGTATCAGC AAATTGTGGC CAATAACGGC TTGCCAAGCT TGCCCGCCGA CGTGCAAGCG CTAGCCGAAC GCTACCGTAA TTGCCCAATT GGCCCACAAG ATTAA
|
Protein sequence | MTTVEQHFPA DFMWGTATSS YQIEGAVHED GRGESIWDRF SHTPGKTKFG QTGDIACDHY HRYPEDLDLM RELGLGSYRF SLAWPRLFPE GKGKINQAGL DFYKRIIEGL HQRHLTPMAT LYHWDLPQAL QDKGGWMNRD TALRFAEYAE AMYRQLGESV PFWITHNEPW VAAFVGHFQG RHAPGIKDLP SAVKASHHLL YSHGLATQLF RESKLAGQIG ITLNLTPAYP THDTPDDHAA AWRNDGYGNR WFLDPIFRGS YPADTVEWFQ QHHQIEMDYV QTGDLAVIQQ PIDFLGINYY FPNRISAADE SKFLALVNSP AIGETSFRGW EVVPAAFADL LKRVQRDYGN TPIYITENGS AFADLKRAAD GSVNDGDRMS YLHTHLEAVA DAIAAGVPVK GYYAWSMLDN YEWAEGYDER FGIIEVDFAT QKRTPKRTAR WYQQIVANNG LPSLPADVQA LAERYRNCPI GPQD
|
| |