Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2181 |
Symbol | |
ID | 5734068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2763374 |
End bp | 2765080 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279322 |
Product | putative glycosyl hydrolase |
Protein accession | YP_001544949 |
Protein GI | 159898702 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3934] Endo-beta-mannanase |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTTCA TGAGTCAGCG GCGCTTCAAA ATTATTATAT GGCTCCTCTT TTTGCTCATC CTTGGTCAAT GTGGCTGGTG GGGAGTTGGT CGGGTTGCCT GGCTTGTCGC TAGCTTCCAG CAAGGTGCCG ACCCTGCTAG TGCCCTAAAA TTGCCACCCA TGTTGCCGCC CGAAGCCGCA ATCGCAACAC AGTGGCTGCC TGACGACCCC GACACTGGTC GTGTGATGGA AGATTTTACT CGCCAACAAC TGACCAGCGA TTATGGTCGA GCGTGGCTCC AATGGAATCT TTCCTTGAAG ATTGGCCGTC CCTACAATTT ACACACCAGT TTTAGTGGGC CAGCCTTGCA TTATCTCACG ACTAGTATCA GCACGACTGA GAATATAAAG ATCGATCAAA TTGATACAGC CCACCAATTA CAACTGCATT TCTATGCAGC AGATGGCTCA ATTGTAGCTC TGACCGATAC CCAAGCCGAG ATCTATCAGA TGATCTATGA TGCTGACGGG GAAATAGTAT CGGCTCAATG GTTGTATGCA CGCTATGAGA TTGTGATGCT GTTGCAAGAT GGCACATGGC GAATTCGGCA TTGGGTTCGC CACGAACTGC CAACCCCGAT AGAACCCTCG TATCAACCAA GGCCTCAACT GTTGCAGGTT AATCAGCAAC AGTTACAACT TGCCGATCAG CCTTTTATCG CCCGAGGCGT TAATTATTAT CCTTCTAAAA CTCCATGGCT ACAATTTTGG CCTGCCTATC AACCCCAACA AACGACAAAA GATCTCGCGT TAGTTCAACA ACTAAATTTG AATAGTGTGC GGATTTTCAT TCCGTTTGAG CAATTTACCG AACCCTATAG CACAAGTCTC TATCTACGTT CTGTGACCGA TTTTCTGGAT CAGGCTGATC AAGCGAACAT CAAGGTCATT GTCACCTTAT TCGATTTTTT AGGCGATTAT CAGCTGGCAC GCTGGCAAGC AACCGACAGC TATTTAACAA CCGTGGTAAC TGCATTACGA GAACATCCGG CGATTATGGC ATGGGATGTT AAAAACGAGC CAGATCGTGA TTATGCAACT GCTAGTCAGG TAGTGGTTGA GGCATGGCTC CAGCATAGCA TCCGCCAGCT TCGTCGGCTT GATCCCCATC ATCTGATTAC GATTGGCTGG TCTACACCTG AAGCTGCTGA ACGGTTATAT CAAGATGTCG ATTTGGTCTC GTTTCATTAC TATGGATCAA CTGAACTATT GGCAACCCAC TATCAACGCC TCAGACAGGT AGTTGGCGAG AAACCGCTCC TGTTAACTGA GATTGGGATG CCAACCTGGA ATAGCCCATT TTTTCCTCAT GGCCACAACC AGCGTGAGCA AGCAAACTAC TTAACGGGAT TAATGCAACA AGCCGAAGCC TATAATGGTT TTTTGATCTG GACACTCTTC GATATTCCTG AGGTTCCAAA GGCTGTTGCT GGGTTTTGGC CATGGCAACG TGGCCCGCAA ACCCAATTAG GACTCTATAA CGAGACCTAT GAACCAAAAT TAATCAGCAA AGTGTTGTTG GGGCTTGAGC CTAGCCCGTT TATTCGCTGG TGGGAGGCGT GGCTCAAGCC GTTTCGAATA ACGCTTATCG TGCTAAGCAT GCTTGGATTA ATCGGAGGAT GGCAGGGTTA TCGCCGCTGG AAACGCAAAA AACACCCAAA GATTTAA
|
Protein sequence | MWFMSQRRFK IIIWLLFLLI LGQCGWWGVG RVAWLVASFQ QGADPASALK LPPMLPPEAA IATQWLPDDP DTGRVMEDFT RQQLTSDYGR AWLQWNLSLK IGRPYNLHTS FSGPALHYLT TSISTTENIK IDQIDTAHQL QLHFYAADGS IVALTDTQAE IYQMIYDADG EIVSAQWLYA RYEIVMLLQD GTWRIRHWVR HELPTPIEPS YQPRPQLLQV NQQQLQLADQ PFIARGVNYY PSKTPWLQFW PAYQPQQTTK DLALVQQLNL NSVRIFIPFE QFTEPYSTSL YLRSVTDFLD QADQANIKVI VTLFDFLGDY QLARWQATDS YLTTVVTALR EHPAIMAWDV KNEPDRDYAT ASQVVVEAWL QHSIRQLRRL DPHHLITIGW STPEAAERLY QDVDLVSFHY YGSTELLATH YQRLRQVVGE KPLLLTEIGM PTWNSPFFPH GHNQREQANY LTGLMQQAEA YNGFLIWTLF DIPEVPKAVA GFWPWQRGPQ TQLGLYNETY EPKLISKVLL GLEPSPFIRW WEAWLKPFRI TLIVLSMLGL IGGWQGYRRW KRKKHPKI
|
| |