Gene Haur_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2520 
Symbol 
ID5734398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3218561 
End bp3221338 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content51% 
IMG OID641279660 
Productglycoside hydrolase family protein 
Protein accessionYP_001545286 
Protein GI159899039 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGCA ACCCACCCCG CGAACGAATT CTGTTTGATC AAGGCTGGCG TTTTGCCCTC 
GGCCATGCCT TCGACCCCGC CCAAGATTTT CAGCTGGATA CAAGTCATTT TTCGTTTCTG
GCCAAGGCTG GCTATGGCGA CGGCGCAGCC TCAGCCAAGT TCGACGATCG GGCTTGGCGT
TTGCTCGATC TACCCCATGA TTGGTGTGTT GAATTGCCCT TCGATCAACG CGGAACTCAC
AGCCATGGCT ACAAAGCAAT TGGCCGTAAG TTTCCCGCCA ATAGCGTTGG TTGGTATCGC
AAAACCTTTG CAATTCCCGC TGACGATCTT GGGAGACGCA TCAGCCTCGA ATTCGATGGG
GTGCATCGCG ATTCTAAAGT TTGGGTCAAT GGCTTTTATC TAGGCAACGA GCCAAGCGGC
TACACCAGTT TTAGCTACGA TATCAGCGAT TATCTGAATT ATGGCGGCCA AAACGTGGTG
GTAGTACGCG CCGATGCCAC CACCGAAGAA GGCTGGTTTT ATGAAGGCGC TGGCATTTAT
CGCCACGTTT GGTTGAGCAA AACGGCTCAA GTCCATGTCG CCCGCTATGG CACATTCGTC
ACTAGCGAAA TTCATGAAGC TAGCGCCACG CTCACAATTC AAACAACGGT TATTAACGAA
AGCCAACATC CAATCGAATT TAGCTTGCTG GAAACAATCA GCGATCCGGC TGCCAATAAT
GTGCTGCAAG CCCAAAGTGC CAATTATCAG CTGGCGGCAG GCCAAAGTGC TGAATTCAGC
CACCAGCACA CGCTCAACAA TCCGCAACTC TGGGATCTTG CTAGCCCACA TTTGTATCAA
CTAACCACCA CTGTGCTGCT TGGTGAGCAG GCAGTAGATA GCTATCAAAC AACCTTTGGC
ATTCGCAGCA TTCGCTTTGA TCCTGATCAG GGCTTTTTCT TGAATGGGCG TTCGGTCAAA
TTGGTTGGCA GCAACCAGCA TCAAGATCAC GCGGGAGTTG GCACGGCTAT TCCCGATAGC
TTGCAAGAAT ATCGCATCAA ACGCTTGCAG GAATTCAATC ATAATGCGAT TCGGACTTCG
CACAACCCGC CAACCCCCGA ATTCCTCGCT GCATGCGATC GTTTGGGAAT GTTGGTGCTG
AATGAAAATC GCTTGATGGG AACAAATGCT GAGCATTTAC GCCATGTTGA GCAGCTGATC
AAGCGTGATC GGAATCATCC TTCGGTGATT TTGTGGTCGT TGGGCAACGA AGAATGGGCG
ATCGAGGGTA CGATCATCGG CGCACGCATC TCCAGCACCA TGCAAAATTT TGCCCGTCAA
TTCGATCATA CACGCCTTTT CACAATCGCT TGTAGCGGCG GCTGGGATAC TGGTATTGGA
ATGGTGACCG ATGTGGTTGG CTATAATTAT ATCTTTCACG GCGATATTGA TGCCCATCAT
GCGAAATTCC CATGGCAATC GGGCGTTGGA ACAGAGGAAA GTAATACCTA CGGTACACGC
GGGATCTACC AAACCGATGC CAGTCGTGGG CATTTAGCGC CTTCTCCCAG TGGTGATGGC
TACGCCGATA CGGAGTTCGG CTGGCAATTT TATGTCAAAC GACCATTCTT GGCTGGGTTA
TTCTATTGGA CTGGCTTTGA TTATCGGGGC GAGCCAACGC CCTTTGAATG GCCAGCGGTC
GCCAGTCAAT TTGGCTTTTT CGATTTGTGC GGCTTCCCCA AAGATATTGC CTATTACACC
AAGGCTTGGT GGGGCCAGCA ACCAGTCTTG CACATTGGCT ATCACTGGGA TTGGCCGAAT
GATCTTGATC GAGTGAAAAG CTTTCCAATT TATAGCAATT GCCAAGAAGT TGAGCTATGG
CTGAATGGTC AGAGCCTTGG CCGCCGCCCA ATTCCCGAAA ACGGCCACCA ACAATGGGAA
GTTGCCTATC AACCAGGCGA ATTGTTAGCG CGTGGCTATA ACAACGGGGT TGAAGTGCTC
AGCGCTAGTT TGCGCACCAG CCAAGCTGCC AGCCAAATTC AACTGGTGCT CGATTATGGT
CAATTGCAAC AAACAGGTGA TCTAGCCGTT GTCACCTTGC AAGTTTGCGA TGATCATGGC
CAGATTGTGC CAACCGCCAA TCAACTACTC GATCTGCAAT TAACCGGTTC GGCCAAAATT
TTAGGGGTCG GCAACGGTGA TCCAGCTAGC CACGAGCCTG AGCAATATCA CGCTAGCTAT
CAACTTTGCC CAATCCAAAT TAATGCCGAG CAAACCGTGG CCGAGTTGAG CACAGGTTTG
GAGCGAGCGC TAGGAGCCGA AACCCAAGTT TGGCAACCAG CATTCTTGCA TCACAACGAC
GATCAACAGG CTAATCCTCA GGCCTATATT CTGTTGCGTG GCAGCTTCGA GCTACCACAG
TACAGCGCTG CCAGCAGCAT TCGCTTGTTC AGCAAAAGCA TAGCCCACGA TTACAGCGTC
TATATCAACG GCCAATTGAT CATCAGCGAT ATTGCGCTTG AGCAAGGCGA TCAAGCGCTT
GAATTGAGCC ATGAACTGGT GCACCCAGGC CAAAACGAAT ATTTGGTGAC TGGGCGCTGG
ATCAAAAAAG TCAATCCTTG GACGTTCCCC AATCGTGAGC CTGGCGTGGT GCAGGTTATC
GAGCCAGCCG CCGCTTGGCA ACGCCGCGCC TTCAACGGCC TAGCCCAAGT CATCGTGCAA
CATACTGGCG GCGACCAGCC AATTAGCCTT AGTGCCAGCA ACCCTGAGCT AGGCACGGCA
ACGCTTAAGT TCGATTAA
 
Protein sequence
MISNPPRERI LFDQGWRFAL GHAFDPAQDF QLDTSHFSFL AKAGYGDGAA SAKFDDRAWR 
LLDLPHDWCV ELPFDQRGTH SHGYKAIGRK FPANSVGWYR KTFAIPADDL GRRISLEFDG
VHRDSKVWVN GFYLGNEPSG YTSFSYDISD YLNYGGQNVV VVRADATTEE GWFYEGAGIY
RHVWLSKTAQ VHVARYGTFV TSEIHEASAT LTIQTTVINE SQHPIEFSLL ETISDPAANN
VLQAQSANYQ LAAGQSAEFS HQHTLNNPQL WDLASPHLYQ LTTTVLLGEQ AVDSYQTTFG
IRSIRFDPDQ GFFLNGRSVK LVGSNQHQDH AGVGTAIPDS LQEYRIKRLQ EFNHNAIRTS
HNPPTPEFLA ACDRLGMLVL NENRLMGTNA EHLRHVEQLI KRDRNHPSVI LWSLGNEEWA
IEGTIIGARI SSTMQNFARQ FDHTRLFTIA CSGGWDTGIG MVTDVVGYNY IFHGDIDAHH
AKFPWQSGVG TEESNTYGTR GIYQTDASRG HLAPSPSGDG YADTEFGWQF YVKRPFLAGL
FYWTGFDYRG EPTPFEWPAV ASQFGFFDLC GFPKDIAYYT KAWWGQQPVL HIGYHWDWPN
DLDRVKSFPI YSNCQEVELW LNGQSLGRRP IPENGHQQWE VAYQPGELLA RGYNNGVEVL
SASLRTSQAA SQIQLVLDYG QLQQTGDLAV VTLQVCDDHG QIVPTANQLL DLQLTGSAKI
LGVGNGDPAS HEPEQYHASY QLCPIQINAE QTVAELSTGL ERALGAETQV WQPAFLHHND
DQQANPQAYI LLRGSFELPQ YSAASSIRLF SKSIAHDYSV YINGQLIISD IALEQGDQAL
ELSHELVHPG QNEYLVTGRW IKKVNPWTFP NREPGVVQVI EPAAAWQRRA FNGLAQVIVQ
HTGGDQPISL SASNPELGTA TLKFD