Gene Haur_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1626 
Symbol 
ID5733498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1884988 
End bp1886508 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content49% 
IMG OID641278765 
Productglycoside hydrolase family protein 
Protein accessionYP_001544397 
Protein GI159898150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000010667 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAG TTCAACGAGT TCCATTGCTG CTGCTTGGCA TGCTGGCCTT GCTGATCAGT 
ACAATCCCTT CGACCTCAAC GAGCGGCGCA ACCAATGCCT ACGCCGCCGC CGACCCAGCA
ATTGCTAGCT ACGCCCAAAA CGTGATTAAT CGCATGGCCC AATATCAAGG CCACTCTATC
ATCAGCGGCC AGCAAGAAGT CCATTGGGAT TCCTCGCGCA AAGACGAAAT GTCCAACGCC
ATCTACAATC GCACAAATCC ACGCCAATAT CCAGGTTTGC GGGGCTGGGA TTTCCCCTTC
GGCGGAGCTT ACGCCAACGA TGCCCAATGG ATGATCGACA CCATGATTAG CGATTGGAAT
ACTGCCAAAG TGCTGCCAAC AATCAGCCAA CATTGGACAC CCTACGGCAG CCAAGGCACA
AATCACGATG ATATGTTTGT GCAAGTCAAT ATCAATAATA TATTTGTTGA TGGCACAACT
GAGCGTAGCC GCTACCTTAC CTGGCGCAGC AACATCGCTG ACGATTTGCA AAAACTGGAA
AATGCCAGTG TACCAGTCTT ATGGCGGCCC TATCACGAGG CGGGCGGTGG CTGGTTCTGG
TGGGATAAAA TTGGCTCAAG TAATTACAAG CGTTTGTGGA ATGACCTCTG GGATTATCTG
ACCAACAGCC GTGGCTTGCA CAATTTGATC TGGGTTTGGT CGGCAGGAAC CAAGGGAGTC
GGTACTGATT GGTATCCGAG CGGCAAGGTC GATATTCTTG GCCACGATAT TTATAACAAT
AGCTCGGCTG ACTATAGCAG TTGGTACACT GATTTAGCTC GTTTCGATAG TAGCAAATTA
CGAGCATTAA CCGAAGTCGA TTATATGCTT GATCCAGCAG CATTGAACAA TGCACCATTT
GCCTTCTTTA TGACGTGGCA TACCGACATG TTTTATCGCA ATAGCGATAG CAAAATCCAA
AGCGTTTATC AACATAACAA AACCGTCAAT CGCAGCCGCA TCAGCCAATA TCTGAATGGG
AGATTGGGTA GTAGTGGAAC CAACCCAACC GCTACGCCTA TCAGTAGTGG CGCAACGCTC
TATAATTTTG AGGGCAGCAC CCAAGGCTGG AGCGCCGCCA ATGTCAATGC AGGGCCATGG
TCGGTCAACG AATGGGCCGC CAATGGTAGC TACAGTCTCA AAGCCGATGT GAGTTTGGGC
AATCGTAGTT ACGACTTGAA ATTAACTCAA GCCCACAATT TTAGTGGCAA AAGCCAACTC
CAAGCCCGCG TGCGCCATGC CACTTGGGGC AATGTTGGCA GCGGAATTAG CGCCAAACTC
TATATCAAGG TTGGCTCAAA TTGGGCTTGG TACGACGGCG GAGCAATCAC AATTAATAGT
GGGGGCACAA CTACATTAAC CCTCAACTTG AGTGGCATTG CCAATCTTGG TATCGTCAAC
GAAATTGGGG TTAGTTTTAG TTCGCCCGCT AATAGCAGCG GCACTAGCGC AATTTATCTC
GATTATGTCA CCTTGCAATA A
 
Protein sequence
MSQVQRVPLL LLGMLALLIS TIPSTSTSGA TNAYAAADPA IASYAQNVIN RMAQYQGHSI 
ISGQQEVHWD SSRKDEMSNA IYNRTNPRQY PGLRGWDFPF GGAYANDAQW MIDTMISDWN
TAKVLPTISQ HWTPYGSQGT NHDDMFVQVN INNIFVDGTT ERSRYLTWRS NIADDLQKLE
NASVPVLWRP YHEAGGGWFW WDKIGSSNYK RLWNDLWDYL TNSRGLHNLI WVWSAGTKGV
GTDWYPSGKV DILGHDIYNN SSADYSSWYT DLARFDSSKL RALTEVDYML DPAALNNAPF
AFFMTWHTDM FYRNSDSKIQ SVYQHNKTVN RSRISQYLNG RLGSSGTNPT ATPISSGATL
YNFEGSTQGW SAANVNAGPW SVNEWAANGS YSLKADVSLG NRSYDLKLTQ AHNFSGKSQL
QARVRHATWG NVGSGISAKL YIKVGSNWAW YDGGAITINS GGTTTLTLNL SGIANLGIVN
EIGVSFSSPA NSSGTSAIYL DYVTLQ