Gene Haur_2181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2181 
Symbol 
ID5734068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2763374 
End bp2765080 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content48% 
IMG OID641279322 
Productputative glycosyl hydrolase 
Protein accessionYP_001544949 
Protein GI159898702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTTCA TGAGTCAGCG GCGCTTCAAA ATTATTATAT GGCTCCTCTT TTTGCTCATC 
CTTGGTCAAT GTGGCTGGTG GGGAGTTGGT CGGGTTGCCT GGCTTGTCGC TAGCTTCCAG
CAAGGTGCCG ACCCTGCTAG TGCCCTAAAA TTGCCACCCA TGTTGCCGCC CGAAGCCGCA
ATCGCAACAC AGTGGCTGCC TGACGACCCC GACACTGGTC GTGTGATGGA AGATTTTACT
CGCCAACAAC TGACCAGCGA TTATGGTCGA GCGTGGCTCC AATGGAATCT TTCCTTGAAG
ATTGGCCGTC CCTACAATTT ACACACCAGT TTTAGTGGGC CAGCCTTGCA TTATCTCACG
ACTAGTATCA GCACGACTGA GAATATAAAG ATCGATCAAA TTGATACAGC CCACCAATTA
CAACTGCATT TCTATGCAGC AGATGGCTCA ATTGTAGCTC TGACCGATAC CCAAGCCGAG
ATCTATCAGA TGATCTATGA TGCTGACGGG GAAATAGTAT CGGCTCAATG GTTGTATGCA
CGCTATGAGA TTGTGATGCT GTTGCAAGAT GGCACATGGC GAATTCGGCA TTGGGTTCGC
CACGAACTGC CAACCCCGAT AGAACCCTCG TATCAACCAA GGCCTCAACT GTTGCAGGTT
AATCAGCAAC AGTTACAACT TGCCGATCAG CCTTTTATCG CCCGAGGCGT TAATTATTAT
CCTTCTAAAA CTCCATGGCT ACAATTTTGG CCTGCCTATC AACCCCAACA AACGACAAAA
GATCTCGCGT TAGTTCAACA ACTAAATTTG AATAGTGTGC GGATTTTCAT TCCGTTTGAG
CAATTTACCG AACCCTATAG CACAAGTCTC TATCTACGTT CTGTGACCGA TTTTCTGGAT
CAGGCTGATC AAGCGAACAT CAAGGTCATT GTCACCTTAT TCGATTTTTT AGGCGATTAT
CAGCTGGCAC GCTGGCAAGC AACCGACAGC TATTTAACAA CCGTGGTAAC TGCATTACGA
GAACATCCGG CGATTATGGC ATGGGATGTT AAAAACGAGC CAGATCGTGA TTATGCAACT
GCTAGTCAGG TAGTGGTTGA GGCATGGCTC CAGCATAGCA TCCGCCAGCT TCGTCGGCTT
GATCCCCATC ATCTGATTAC GATTGGCTGG TCTACACCTG AAGCTGCTGA ACGGTTATAT
CAAGATGTCG ATTTGGTCTC GTTTCATTAC TATGGATCAA CTGAACTATT GGCAACCCAC
TATCAACGCC TCAGACAGGT AGTTGGCGAG AAACCGCTCC TGTTAACTGA GATTGGGATG
CCAACCTGGA ATAGCCCATT TTTTCCTCAT GGCCACAACC AGCGTGAGCA AGCAAACTAC
TTAACGGGAT TAATGCAACA AGCCGAAGCC TATAATGGTT TTTTGATCTG GACACTCTTC
GATATTCCTG AGGTTCCAAA GGCTGTTGCT GGGTTTTGGC CATGGCAACG TGGCCCGCAA
ACCCAATTAG GACTCTATAA CGAGACCTAT GAACCAAAAT TAATCAGCAA AGTGTTGTTG
GGGCTTGAGC CTAGCCCGTT TATTCGCTGG TGGGAGGCGT GGCTCAAGCC GTTTCGAATA
ACGCTTATCG TGCTAAGCAT GCTTGGATTA ATCGGAGGAT GGCAGGGTTA TCGCCGCTGG
AAACGCAAAA AACACCCAAA GATTTAA
 
Protein sequence
MWFMSQRRFK IIIWLLFLLI LGQCGWWGVG RVAWLVASFQ QGADPASALK LPPMLPPEAA 
IATQWLPDDP DTGRVMEDFT RQQLTSDYGR AWLQWNLSLK IGRPYNLHTS FSGPALHYLT
TSISTTENIK IDQIDTAHQL QLHFYAADGS IVALTDTQAE IYQMIYDADG EIVSAQWLYA
RYEIVMLLQD GTWRIRHWVR HELPTPIEPS YQPRPQLLQV NQQQLQLADQ PFIARGVNYY
PSKTPWLQFW PAYQPQQTTK DLALVQQLNL NSVRIFIPFE QFTEPYSTSL YLRSVTDFLD
QADQANIKVI VTLFDFLGDY QLARWQATDS YLTTVVTALR EHPAIMAWDV KNEPDRDYAT
ASQVVVEAWL QHSIRQLRRL DPHHLITIGW STPEAAERLY QDVDLVSFHY YGSTELLATH
YQRLRQVVGE KPLLLTEIGM PTWNSPFFPH GHNQREQANY LTGLMQQAEA YNGFLIWTLF
DIPEVPKAVA GFWPWQRGPQ TQLGLYNETY EPKLISKVLL GLEPSPFIRW WEAWLKPFRI
TLIVLSMLGL IGGWQGYRRW KRKKHPKI