Gene Haur_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1177 
Symbol 
ID5733070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1349957 
End bp1351756 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content51% 
IMG OID641278317 
Productglycoside hydrolase family protein 
Protein accessionYP_001543953 
Protein GI159897706 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAT TACCAAAACC AAAAAACCTT CAGCGTGCTG AGGGAAGCTA TGCTTTACCC 
AAAATCTGGC AGATTAGCCT ACCTAGTGGG CTTGATCAGC GGGTTACGGC CAGTTTGGCG
GCACTGGGCG AATATCAAAT GGTTGAGCAA GCTGCCAATT TGCTGCTAAC GCTTGATTCA
AGCATCGTTC ATGCCGATGG CTATCGTTTG CAGATTGCTG CTGATGGCGT GCAGATTGTG
GCGCAAACTG CGGCAGGCTT GTTTTATGGC CTGCAAACGC TGCGCCAAAT TCAACAACAA
TCAGCTGAGC AATTGCCGTA TCTATTAATT GATGATGCCC CTGATTTTGT GGTGCGTGGC
TTTATGCTCG ATATTAGCCG CGATAAAGTG CCAACCATGG CCACGCTCTA TGCGCTAATC
GACGAATTAG CTAGCTGGAA AATTAACCAA ATTCAGCTGT ATATTGAGCA TACGTTTGCT
TACCAAAACC ACCCCACAGT ATGGGCTGAT GCTTCGCCGT TGACTGCCGA GGAAGTACGG
GCGCTCGATG ATTTCTGTCT TGAACGCTTT ATCGAATTAG TGCCCAACCA AAATTGTTTT
GGCCATATGC GACGTTGGCT GACCAAACCA GCCTATCGCG ATTTGGCCGA ATGCCCCGAT
GGCTGCGACA CTGGCGACCC CGATTGGGGC TATTTTGAAG AACCCTTCAC CTTAGCTCCC
GAACACCCAG GCAGCATCGA ATTAGTGCGT AGCATGCTCG ACGAACTATT GCCCAACTTC
CGCACTCGCA CACTCAACGT TGGCTGCGAC GAAACGGTCG AATTGGGCTT GGGGGCAAGT
GCAGCAGCAG TTGCTGAGCG CGGCAAAGGT CGGGTTTATC TCGAATTTTT GCAAAAGCTC
TACCACGAAG CCCATTCACG CGGCTATGTG ATGCAATTCT GGAGCGATAT TATTTTGCAT
TATCCAGAAT TGGTCAGCGA ATTGCCCCGC GATGCCGAGG TGTTGATTTG GGGCTACGAG
TCGCATCATC CCTTTGAAGA ACAAGCCGCC ACCATTGCCA AAGCGGGCTT GCCCTTCTAC
GTTTGCCCAG GCACGTCAAG CTGGAACACC GTCGCAGGCC GCACCACCAA CGCCTTGGAA
AATATCGCGA GCGCCGCCAA ACATGGCTTG AACCATGGTG CAAAAGGCTT TTTGATGACC
GATTGGGGCG ATAACGGCCA CTGGCAACCA CTCGTGACCA GCTATCTTGG CTTGGCAATC
GGCGCTGCTC GGGCTTGGAA TGCGCTCGCT GAGCTTGATG TGGTTGCCTT GCTCGATACG
GTGGTTTTTG CCGATAGCAA CAAGGTTTTG GGCAATTTGG TCTATCAATT GGGCGATGTC
TATCGCGCTG TACCACCGTT ATTGCACAAT ACCTCGTCGT TATTCCGCAT TTTGCAGGCC
AACCCAGCCG CAATTGCCGA ATTAAACCTT GATCGCAAAA ATTTGCATAA TGCCGATGAG
ATTTTGCTGC AACTGCGCAA CGACTTGCAA ACCATCAAAC CCCAACGCGC TGATGGCGAG
CTTTGCCAAG TTGAATTAGC TTGGGCGATC GATTTATTGC GCCATGCGGT GCAACGCGGT
TTGTGGGTGC TTGATCAACA GCCAAGCGAA ACCGCTAGCA AATTGCAACC AGAAATCGAT
GCCTTGATCG AACGCTTCCA ACAAGTTTGG CTGAGCCGCA ATCGCCCCGG TGGCTTGCAA
GATAGCCTTA AACATTTTGC AAGCTTGCGC GATAGCTATG GCGAGGTGCG TTATGCCTGA
 
Protein sequence
MQLLPKPKNL QRAEGSYALP KIWQISLPSG LDQRVTASLA ALGEYQMVEQ AANLLLTLDS 
SIVHADGYRL QIAADGVQIV AQTAAGLFYG LQTLRQIQQQ SAEQLPYLLI DDAPDFVVRG
FMLDISRDKV PTMATLYALI DELASWKINQ IQLYIEHTFA YQNHPTVWAD ASPLTAEEVR
ALDDFCLERF IELVPNQNCF GHMRRWLTKP AYRDLAECPD GCDTGDPDWG YFEEPFTLAP
EHPGSIELVR SMLDELLPNF RTRTLNVGCD ETVELGLGAS AAAVAERGKG RVYLEFLQKL
YHEAHSRGYV MQFWSDIILH YPELVSELPR DAEVLIWGYE SHHPFEEQAA TIAKAGLPFY
VCPGTSSWNT VAGRTTNALE NIASAAKHGL NHGAKGFLMT DWGDNGHWQP LVTSYLGLAI
GAARAWNALA ELDVVALLDT VVFADSNKVL GNLVYQLGDV YRAVPPLLHN TSSLFRILQA
NPAAIAELNL DRKNLHNADE ILLQLRNDLQ TIKPQRADGE LCQVELAWAI DLLRHAVQRG
LWVLDQQPSE TASKLQPEID ALIERFQQVW LSRNRPGGLQ DSLKHFASLR DSYGEVRYA