Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0293 |
Symbol | |
ID | 5732188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 345231 |
End bp | 346766 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277417 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543073 |
Protein GI | 159896826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.240918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAG CATTGCACTC GCGTTGGCTT ATTTTGGTTA TGTTGGTTGG TCTCGGCCTT AGCTTGCCCC GTCAGCAATC GGCCAGCGCC AATCCGCAAG CATTTAGCTT TCCCTATAAT CAAGCTTCGG GCATTAACTA CAATGTGACC GATCTCACTC AAGCTTGGAA TGAATGGAAA AGTGCTCAAA TTACCAGCAA CAACGCTGGT GGTGGCGGAC GTTTGCGGGT ACTCGGCGGA GTTAGCAATA GCACCACGGT CTCTGAGGGC CAAGGCTATG GCTTGCTTTT CGCCTCATTA TTCGATGATC AAGCAACCTT CGACGGTTTG TGGCTTTTCA CTCGTGATTA TTTTACCAGC CGCGGCCTGA TGCATTGGCA CATCGGCAAT CCAGGCCAAA TTAATGGCAG CGGCGCAGCA ACCGATGGCG ATGAAGATAT TGCCATGGGA CTGGTCAATG CCTGTATCAA AGTGCAACAA GGAGTTTGGC CCGCTAGCAG CAATGGCCTG AATTATTGCA CGCTTGCCAG CACCATGATC AACAATATTT ATACCTATGA AGTTGATCAT GCTGGCAGCA ACCCACCCGC CGGCTTGCCC AATAACCAAG GCAACGAACT ACTGCCAGGT GATTCATGGT CAACCGCGAC CGAATATACT CAAGGTATTG TTAATCTTTC TTATTTTTCA CCTGGCTATA GCACGGTCTT TGGCAAATTC ACCAACAAAA ATACTGAATG GGCCGCCGTC AACACCCGTA ACTACGCTAT CACCAATTTG GTGCAAGCCA AAGCAGGCAA TTGCTCGAAA CTTGTGCCCA ATTGGAATCA ATATGACGGC GATGTGCAAT ATGTATCGTG GCAACCAGAA GAATCGGCGT GGTGGAGCTA CGATGCAGCG CGGTTTGCAT GGCGCATCGC GATCGACAAA GCGTGGTACA ACACCAGCAA CTCACGCGAA ACCATGAACG AAATTGGTGG TTTTTTCAGC AGCGTGGGCA TTGACAACAT TCAAGCTCGC TATCGGCTTG ACGGCACATC AGTTGATAAT TATCGTGGCG TATTCTTCGT CGCCAACGCA GCGGCGGCAA TTTGGGCGGC TCCAGCTCCG CAAGCAGTCA ATTGTGGTGC GGCAACTGCC AGCCTCAAAA CCACACCGCA ACAAGCATAC AATGCGGTGC TCGCCACCAA AGATACGCCC AACAGCTATT ATCCCAATGC TTGGCGCTTG CTGAGCATGT TATTACTCAC GGGCAATTTT CCCAATTTAT ATGAAATGGC CCAAAGTGGC ACTGGCGGCA CGGCAACCCC AACTATCACA CCGACGATTA CGCGCACCCC AACTGCGACA GCAACGCGGA CTGCAACCGC AACTCCAAGC AACACCCCAA CCCGTACCTC GACTGGCACA CCAGCTACCA TCACGATAAC GCCTAGTCGC ACCCCAACCG CGACAATAAC CCCCAGCCGT ACACCAACGG TTGTACCAAA CCTCAATTTC CGAATGTATC TGCCATTTGC CAAATCGAAT AGTTAA
|
Protein sequence | MKQALHSRWL ILVMLVGLGL SLPRQQSASA NPQAFSFPYN QASGINYNVT DLTQAWNEWK SAQITSNNAG GGGRLRVLGG VSNSTTVSEG QGYGLLFASL FDDQATFDGL WLFTRDYFTS RGLMHWHIGN PGQINGSGAA TDGDEDIAMG LVNACIKVQQ GVWPASSNGL NYCTLASTMI NNIYTYEVDH AGSNPPAGLP NNQGNELLPG DSWSTATEYT QGIVNLSYFS PGYSTVFGKF TNKNTEWAAV NTRNYAITNL VQAKAGNCSK LVPNWNQYDG DVQYVSWQPE ESAWWSYDAA RFAWRIAIDK AWYNTSNSRE TMNEIGGFFS SVGIDNIQAR YRLDGTSVDN YRGVFFVANA AAAIWAAPAP QAVNCGAATA SLKTTPQQAY NAVLATKDTP NSYYPNAWRL LSMLLLTGNF PNLYEMAQSG TGGTATPTIT PTITRTPTAT ATRTATATPS NTPTRTSTGT PATITITPSR TPTATITPSR TPTVVPNLNF RMYLPFAKSN S
|
| |