Gene Haur_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0293 
Symbol 
ID5732188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp345231 
End bp346766 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content51% 
IMG OID641277417 
Productglycoside hydrolase family protein 
Protein accessionYP_001543073 
Protein GI159896826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAG CATTGCACTC GCGTTGGCTT ATTTTGGTTA TGTTGGTTGG TCTCGGCCTT 
AGCTTGCCCC GTCAGCAATC GGCCAGCGCC AATCCGCAAG CATTTAGCTT TCCCTATAAT
CAAGCTTCGG GCATTAACTA CAATGTGACC GATCTCACTC AAGCTTGGAA TGAATGGAAA
AGTGCTCAAA TTACCAGCAA CAACGCTGGT GGTGGCGGAC GTTTGCGGGT ACTCGGCGGA
GTTAGCAATA GCACCACGGT CTCTGAGGGC CAAGGCTATG GCTTGCTTTT CGCCTCATTA
TTCGATGATC AAGCAACCTT CGACGGTTTG TGGCTTTTCA CTCGTGATTA TTTTACCAGC
CGCGGCCTGA TGCATTGGCA CATCGGCAAT CCAGGCCAAA TTAATGGCAG CGGCGCAGCA
ACCGATGGCG ATGAAGATAT TGCCATGGGA CTGGTCAATG CCTGTATCAA AGTGCAACAA
GGAGTTTGGC CCGCTAGCAG CAATGGCCTG AATTATTGCA CGCTTGCCAG CACCATGATC
AACAATATTT ATACCTATGA AGTTGATCAT GCTGGCAGCA ACCCACCCGC CGGCTTGCCC
AATAACCAAG GCAACGAACT ACTGCCAGGT GATTCATGGT CAACCGCGAC CGAATATACT
CAAGGTATTG TTAATCTTTC TTATTTTTCA CCTGGCTATA GCACGGTCTT TGGCAAATTC
ACCAACAAAA ATACTGAATG GGCCGCCGTC AACACCCGTA ACTACGCTAT CACCAATTTG
GTGCAAGCCA AAGCAGGCAA TTGCTCGAAA CTTGTGCCCA ATTGGAATCA ATATGACGGC
GATGTGCAAT ATGTATCGTG GCAACCAGAA GAATCGGCGT GGTGGAGCTA CGATGCAGCG
CGGTTTGCAT GGCGCATCGC GATCGACAAA GCGTGGTACA ACACCAGCAA CTCACGCGAA
ACCATGAACG AAATTGGTGG TTTTTTCAGC AGCGTGGGCA TTGACAACAT TCAAGCTCGC
TATCGGCTTG ACGGCACATC AGTTGATAAT TATCGTGGCG TATTCTTCGT CGCCAACGCA
GCGGCGGCAA TTTGGGCGGC TCCAGCTCCG CAAGCAGTCA ATTGTGGTGC GGCAACTGCC
AGCCTCAAAA CCACACCGCA ACAAGCATAC AATGCGGTGC TCGCCACCAA AGATACGCCC
AACAGCTATT ATCCCAATGC TTGGCGCTTG CTGAGCATGT TATTACTCAC GGGCAATTTT
CCCAATTTAT ATGAAATGGC CCAAAGTGGC ACTGGCGGCA CGGCAACCCC AACTATCACA
CCGACGATTA CGCGCACCCC AACTGCGACA GCAACGCGGA CTGCAACCGC AACTCCAAGC
AACACCCCAA CCCGTACCTC GACTGGCACA CCAGCTACCA TCACGATAAC GCCTAGTCGC
ACCCCAACCG CGACAATAAC CCCCAGCCGT ACACCAACGG TTGTACCAAA CCTCAATTTC
CGAATGTATC TGCCATTTGC CAAATCGAAT AGTTAA
 
Protein sequence
MKQALHSRWL ILVMLVGLGL SLPRQQSASA NPQAFSFPYN QASGINYNVT DLTQAWNEWK 
SAQITSNNAG GGGRLRVLGG VSNSTTVSEG QGYGLLFASL FDDQATFDGL WLFTRDYFTS
RGLMHWHIGN PGQINGSGAA TDGDEDIAMG LVNACIKVQQ GVWPASSNGL NYCTLASTMI
NNIYTYEVDH AGSNPPAGLP NNQGNELLPG DSWSTATEYT QGIVNLSYFS PGYSTVFGKF
TNKNTEWAAV NTRNYAITNL VQAKAGNCSK LVPNWNQYDG DVQYVSWQPE ESAWWSYDAA
RFAWRIAIDK AWYNTSNSRE TMNEIGGFFS SVGIDNIQAR YRLDGTSVDN YRGVFFVANA
AAAIWAAPAP QAVNCGAATA SLKTTPQQAY NAVLATKDTP NSYYPNAWRL LSMLLLTGNF
PNLYEMAQSG TGGTATPTIT PTITRTPTAT ATRTATATPS NTPTRTSTGT PATITITPSR
TPTATITPSR TPTVVPNLNF RMYLPFAKSN S