Gene Haur_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2646 
Symbol 
ID5734526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3394887 
End bp3397325 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content53% 
IMG OID641279788 
Productglycoside hydrolase family protein 
Protein accessionYP_001545412 
Protein GI159899165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00270036 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTTC ATTTCCAACG GCGGAAACGA TCACGCACGC TGATCGCAGC CTGTAGCACG 
GTTGCATTGA CGGCAAGTGT GGCTTTTGGG AACATCTCCT CAGTGTTTGG GGTTATCCAA
CACGAATCCC GCAATACGAC CCCGGCTAAT CTCCCCGCCG CTGCGGTAGC AGCTTCCTCA
CTGATTCGGG TCAACCAACA TGGCTATTTG CCCAACGCGA TCAAACGCGC TAGCTTGGTC
AATAGCAGTA ACTCGCCTGT GGCATGGCAA TTGCGCAATA GCTCTGGCAC AACCGTACTT
TCAGGCAATA CCACGGTTTT TGGCTACGAT GCCAGTTCTG GCGATACGAT TCATATCATC
GATTTTTCGG GCTACACTGG CACAGGCAGC AGCTTTACGT TAGCAGCGGC TGGCAATCTG
AGCCATCAAT TTGATATTAG CTCAACAGTG TATCGCCAAT TAAAATACGA TGCCCTCGCC
TACTTCTACC ATAATCGCAG CGGAATTGCG ATCACCATGC CCTATGCTGG ACGCAATGAT
CTGACTCGAC CTGCTGGCCA CGTTGGCATC GCTCCCAACC GTGGCGATAC CAATGTACCC
TGTGCGCCTG GCACTGGCTG TAGCTATTCG CTGAATGTAG TTGGCGGCTG GTACGATGCT
GGCGATCATG GCAAATATGT GGTCAATGGT GGGATTTCGG TCTGGACATT GCTTAACCAA
TATGAGCGCA ACCAATATTT AGGCTCGTCG GCAGCCGATT TTGGCGATGG CCGCATGAGT
ATTCCTGAAA ATAGCAATGG TGTAGCCGAT TTGCTCGACG AGGCGCGTTG GGAATTGGAT
TTTATGTTGC GCATGCAAGT GCCTGCTGGC CAACCACGGG CTGGCATGGT GCACCACAAA
ATGCACGATG CCAACTGGAC TGGCATTCCA CTGCGCCCCG ATCAAGATTC ACAGGATCGG
GTATTACGCC CACCAAGCAC CGCCGCAACC TTGAACTTGG CGGCAACTGC GGCCCAAGGC
TCACGGATCT GGCGTTTCAT TGATCCAACG TATTCAGCAC GTTTGCTGAG CGCTGCCGAA
ACTGCTTGGG CAGCGGCGCG GGCTAACCCC AACGTAATTG CTTTGGATAG CGATGGAACT
GGCGGTGGTT CGTATGGCGA TCCGCAAGTT GGCGATGAAT TTTATTGGGC CGCCAGTGAA
CTCTACATCA CAACTGGCAA AGCCGAATAC AAGAGCTATT TGCAAAGCTC AAGCTACTAC
CAAGATGTAC CCAGCGATTA CAGCAGCCGC GAACCAGCCA TGACGTGGGG CACAACCGAG
GCACTTGGCA CAATTTCGTT GGCGGTTGTA CCCAGCGGTT TCAATAGCAG CGAACTGGCG
GGCGTGCGCA ATGCAGTGAT CAGCGCAGCC ACCGTGTTTG CCAATAACAC CACTGGACAA
GGCTATGGCA CACCATTTAA TTCAACCAGC ACTGGTTATC CATGGGGTTC CAACTCGTTT
GTCTTGAATA ATGGCATTAT TTTGGGCTTG GCCCACGATT TCACTGGCAA CGTGAGCTAT
CTCAATGCGA TGAGCCAAGG TATGGATTAT TTGCTGGGCC GCAATGCGAT GGACAAATCG
TATGTGACGG GCTATGGCGA AAACCCCTTG CAAAACCCGC ATCACCGTTT CTGGGCCTTT
CAAGCGAATA GTGGCTTTCC TCGCCCACCA GCTGGGGCAG TTTCAGGCGG CCCTAACTCG
GCCTTGCAAG ACCCTTATGC TCAAAGCATT GGCTTGCCTG GCTGCAACGC CCAAAAATGT
TTCGTCGATC ATATCGAATC GTGGTCAACC AACGAAATTA CCATCAACTG GAATGCACCA
TTAGCTTGGG TCGCCGCCTA TTTAGATGAA AAGGCTGGTT CGGTAATACC ACCGACGGCT
ACGCCAATTC CGCCAACTGC GACTCCAATT CCGGCGACCG CCACGCCACG CCCAGCCACA
GCTACGCCAA TTCCGGCGAC GGCAACCCCG ATTGGCCCAA CCGCGACGGC TACGCCAATT
GTGCCAACCG CCACGCCACG TCCAGCGACG GCAACTCCTG TGCCAACCAA CACGCCAGTT
GTGCCAACCC AAACGAACTC AAGTTGCCAA GTGACCTATA CAATTTCTAA CCAATGGCCA
ACCGGCTTTA CCGCCGATGT TGTGATCAAA AACAACGGTG CGGCGATCAA TGGCTGGAAC
CTTGCTTGGA CGTATGCTGG TAATCAATTT ATTAGCAATT TGTGGAATGG CAATGTGAGC
CAAGTTGGTC AAGCGGTCAA TGTGACCAAC GTCAATTGGA ATGGCTACTT AGGCACTGGG
GCAACTGCTA GTTTTGGCAT GCAAGCGAGC TTTAGTGGCA CAAATGCCAA ACCAACTGCT
TTCAGCTTGA ATGGTATCGC CTGTAGCGTT ACGCCGTAA
 
Protein sequence
MTVHFQRRKR SRTLIAACST VALTASVAFG NISSVFGVIQ HESRNTTPAN LPAAAVAASS 
LIRVNQHGYL PNAIKRASLV NSSNSPVAWQ LRNSSGTTVL SGNTTVFGYD ASSGDTIHII
DFSGYTGTGS SFTLAAAGNL SHQFDISSTV YRQLKYDALA YFYHNRSGIA ITMPYAGRND
LTRPAGHVGI APNRGDTNVP CAPGTGCSYS LNVVGGWYDA GDHGKYVVNG GISVWTLLNQ
YERNQYLGSS AADFGDGRMS IPENSNGVAD LLDEARWELD FMLRMQVPAG QPRAGMVHHK
MHDANWTGIP LRPDQDSQDR VLRPPSTAAT LNLAATAAQG SRIWRFIDPT YSARLLSAAE
TAWAAARANP NVIALDSDGT GGGSYGDPQV GDEFYWAASE LYITTGKAEY KSYLQSSSYY
QDVPSDYSSR EPAMTWGTTE ALGTISLAVV PSGFNSSELA GVRNAVISAA TVFANNTTGQ
GYGTPFNSTS TGYPWGSNSF VLNNGIILGL AHDFTGNVSY LNAMSQGMDY LLGRNAMDKS
YVTGYGENPL QNPHHRFWAF QANSGFPRPP AGAVSGGPNS ALQDPYAQSI GLPGCNAQKC
FVDHIESWST NEITINWNAP LAWVAAYLDE KAGSVIPPTA TPIPPTATPI PATATPRPAT
ATPIPATATP IGPTATATPI VPTATPRPAT ATPVPTNTPV VPTQTNSSCQ VTYTISNQWP
TGFTADVVIK NNGAAINGWN LAWTYAGNQF ISNLWNGNVS QVGQAVNVTN VNWNGYLGTG
ATASFGMQAS FSGTNAKPTA FSLNGIACSV TP