Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1431 |
Symbol | |
ID | 5733339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1657174 |
End bp | 1659423 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278569 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001544203 |
Protein GI | 159897956 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0291974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCGA GCGATCAACA AATCAACGAC TTGTTGACTC AAATGACGTT GGAAGAAAAA ATTTCGCTGA CGATCGGCCA AGATATGTGG AGCACCCACC CCGTCGAACG CTTGGGGCTT GGCTCGATTA ACATGAACGA TGGCCCACAT GGCTTGCGCA AACCCCCCGA AAATTCCTCA ATTGGCATTA TCGATGCGAT ACCGGCAACC TGTTTTCCCA CCGCTGCTGC TGTTGCCTCA ACGTGGGATG TTGATTTGAC CAAAGCGATT GGCGAGGCGA TTGCCCAAGA ATGCTTAGCC AACAATGTGC AAATTGTACT TGGGCCTGGC ATCAACCTCA AACGCACGCC CTTGGGTGGC CGCAATTTCG AATATTATTC CGAAGATCCG GTTTTGGCTG GCGAGTTGGG CACGGCATTT GTCGAAGGTG TGCAAAGCCA TGGCGTTGGC ACATCGCTCA AACATTATGC CTGCAACAAC CAAGAATTCG AGCGCATGAC GATTAGCTCG GAAGTCGATC AACGTACTTT GCGCGAGTTG TATTTAGCAG CCTTTGAACG TGTGGTCAAA CGAGCGCAAC CTTGGACGAT CATGGCGGCC TATAACAAAA TCAATGGAAT CTATGCGACC GAACATCGCC AACTGCTAAC TGAAATTCTG CGCGAAGAAT GGGGCTTTGA AGGAATTGTC GTTTCCGACT GGGGCGCAGT TAACGATAAG GCTGCCGCAT TAACTGCTGG CCTCGATTTG GAAATGCCTG GCCCAGCGCT TAATCATGTT GAATTTTTGG CTGGTTTGGT ACGCAAGGGA GCGCTCTCAG AAACCGTTAT CGATACTGCT GCCAGTCGCA TGCTCAAGAT TATTCTGCGT GGCATTGCCC AACGCCAGCC CGCAGCCAGC TACGACAAAG CTGCCCATCA TGCTTTGGCT CGCCGTGCTG CCAGCGAATC GATGGTGCTG CTCAAGAACG ATGGTATTTT GCCGTTGCAG CCAACTGCTG GGAGCACAGT CGCGGTGATT GGCAATTTTG CCCAAAAGCC ACGCTATCAG GGTGCTGGTA GCTCGGAAAT TAATGCAACT CAGGTTGATA CGCCACTCGA GGCCTTGCAA ACGTGGCTAA AAAACCAATC GGTTGAAGTT AATTTTGCTG CTGGCTACGA TCACGATGGC AATACCAACG ATCAATTAAT TGCTGAAGCG GTGGCAGCGG CCAAAAACGC TAGCCTGAGC TTAGTTTTGG TTGGTCTACC CGATGCCTAC GAAACTGAAG GCGCTGATCG GGCACACATG AACATGCCAA CTGGACATAA TCAATTGCTT GAGGCAGTAG CTGCGGTTCA AGCCAACACC GTGGCAATTT TGATCAATGG CTCAGCCGTG ACGATTCCAT GGCTTGATCA AGTGCGTGCG GTGCTTGAAG CAGGTTTGGC AGGTCAGGCT GTCGGCAGCG CTTTGGTCGA TGTGCTTTCG GGCGCGGTCA ATCCCAGTGG CAAATTGGCC GAAACCTTCC CTTACGATCT TGCTGATACT CCAGCCTTTT TGAATTATCC AGGCGAGGCG GGAGTGGTGC GCTATGGTGA AAGCCTGTTT ATTGGCTATC GCTACTACGA TGTGCGCAAG GTCAAGCCAT TATTCCCGTT TGGCTATGGC TTATCCTACA CCAGTTTCCG CTATGATCAG ATTGCGCTGA GTGCTGCCAG CATCGATGAA GCTACGCCCT TGACTGTCAG TGTTACCCTG ACCAATACTG GCGAACGGGT TGGCAAAGAA GTTGTGCAAG TGTACGTCAA ACCGAGCAAT TCGGCCTATC TGCGTCCAGT TAAAGAACTA CGGGCGTTTG CTAAAGTTGA ATTGGCTGCT GGCGAGACGA AAACCGTTGA ATTGACCCTC GTTGCCCGCG ATTTCAGCCT GTATGATCAA CAACGAGCGG CTTGGCGTAT GGAAGGCGGC AGCTATCAGA TTTTGGCTGG TGGTTGCAGC GCCGATCTGC CATTAGTGGC TGATCTGACG GTGAATGAAG ACCCACGTTC AGCTCGCAAA GTGCTCACGC GCATGAGTTC CATCAAGGAA TTCTTGGATG ATCCGATTGG CGCTGAAATT TTGCATGCGA CCGCTGGAGC TTTTATCGAA GGCCAAAGCG CTAGCACTCG CGCGATTTTC GAGCCAATTC CATTAGCCAA ATTTGTTAAC TTCGGCTTCT TCGAGGCCAG CCAAGTTGAC GAAATTGTAG CCAAGGTCAA TCAGGGCTAG
|
Protein sequence | MTASDQQIND LLTQMTLEEK ISLTIGQDMW STHPVERLGL GSINMNDGPH GLRKPPENSS IGIIDAIPAT CFPTAAAVAS TWDVDLTKAI GEAIAQECLA NNVQIVLGPG INLKRTPLGG RNFEYYSEDP VLAGELGTAF VEGVQSHGVG TSLKHYACNN QEFERMTISS EVDQRTLREL YLAAFERVVK RAQPWTIMAA YNKINGIYAT EHRQLLTEIL REEWGFEGIV VSDWGAVNDK AAALTAGLDL EMPGPALNHV EFLAGLVRKG ALSETVIDTA ASRMLKIILR GIAQRQPAAS YDKAAHHALA RRAASESMVL LKNDGILPLQ PTAGSTVAVI GNFAQKPRYQ GAGSSEINAT QVDTPLEALQ TWLKNQSVEV NFAAGYDHDG NTNDQLIAEA VAAAKNASLS LVLVGLPDAY ETEGADRAHM NMPTGHNQLL EAVAAVQANT VAILINGSAV TIPWLDQVRA VLEAGLAGQA VGSALVDVLS GAVNPSGKLA ETFPYDLADT PAFLNYPGEA GVVRYGESLF IGYRYYDVRK VKPLFPFGYG LSYTSFRYDQ IALSAASIDE ATPLTVSVTL TNTGERVGKE VVQVYVKPSN SAYLRPVKEL RAFAKVELAA GETKTVELTL VARDFSLYDQ QRAAWRMEGG SYQILAGGCS ADLPLVADLT VNEDPRSARK VLTRMSSIKE FLDDPIGAEI LHATAGAFIE GQSASTRAIF EPIPLAKFVN FGFFEASQVD EIVAKVNQG
|
| |