Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3638 |
Symbol | |
ID | 5735499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4575362 |
End bp | 4577296 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641280787 |
Product | Beta-galactosidase |
Protein accession | YP_001546402 |
Protein GI | 159900155 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTAG GTGTTTGTTA TTACCCCGAG CATTGGCCCC AAACGTGGTG GGCCGATGAT GCCAAACAGA TGCAGGCCTT GGGCTTGGAA TATGTGCGGA TTGGCGAGTT TGCTTGGGCT TTGATGGAGC CAGCCGCTGG CCAGTATGAT TGGGTATGGC TCGACCAAGC AATTGAAACC TTGGCGAGCC AAGGCTTGAA CATTGTGCTT GGCACGCCCA CCGCTACCCC GCCTGCTTGG CTAACGCATA ACCAGCCCGA TTTAATGCGG ATCGATGCCC AAGGCCGTCG TTTAGGCCAT GGTGGTCGTC GCCAAGCTTG TTTGGTCAAT CCTCAGTACA TCGAATATAG CCGCCAGATC GTCACGGCCA TGGCCGAGCG CTATGGTCAG CATCCAGCGG TTGCCGCTTG GCAAATCGAT AACGAAATTG GCAATCATGG CTCGGCGCGT TGCTACTGCG AACATTGTGC TGCGGCTTTT CGCCAATGGC TGATCCAACG CTATGGCGAT TTAGCAGGCC TCAACGAAGC ATGGGGCACG GCCTTTTGGA GCCAAACCTA CAGCGATTGG CAACAAATTC CCTTGCCAAA TGTACCAGTT GGCGGCGGCC ATAATCCCTC GTTAGTGCTC GATTATCGCC GCTTCGCCTC GGATCAGCAG GTAGCATATT GCGCGATGCA GGCAGAAATT TTGCGCCAGC ACTCTCCAAA TCGCACGATT TTAACCAACA TCGCACCTGG CGACGATGAG ATTAATTGGT TTGATATGGC GCAGCAAGTC GATACAATTG CTTGGGATAA TTACCCGCAT GGCTTTCCCG ATTGGCAAGC GGTGGCGATG TATCACGACC ATATTCGTGG CCTCAAGCGT CAGCCATTTT GGGTGATGGA GCAACAGCCA GGCCAAATCA ATTGGACTCC CACCAATCCA CCAGTGCCAC CCAACCAAGT GCGCTTGTGG AGCTATCAAG ATGCCGCCCA TGGTGCAGCC AATGTGCTGT ATTTCCGCTG GCGGGCATGT TGGCTCGGCC AAGAGCAATA TCATAGCGGC CTGCGCGATC ATGCCAATCG GCCAGCGCGT GGCAGCACCG AAGCGCGGAT TGTTGCCAAC GAATGGCAGC AGCATGGCCA GCCCGAAGCT GCACCGCGCA AGGTTGCCTT GCTGGTTTCC TACGACGATC ATTGGGCGCA ACAACTCGAT CCGCATGCTC AAGGCTGGAA TTATTGGCAA TTGCTGCGCA CCATCCATCG CACGCTTACC AGCTATGGCG TTGGGGTCGA TATTGTGCAG CGTGGCACGC CACTCGCTGC CTATCAACTA GCGATTGCTG TCGCCCCAAT GCTCGATAAT CCTGCTGAAA CTGCGGGCTG GCGTGAGTGG GTTCAGGCAG GCGGCACGTT GATCTGCACG CCACGCAGTT TAACCAAACG CCGCGACAAT CGCACCGCTC CCGATGGCTT CCCCAGCGGC TTGACCGATT TATTTGGGGC TGATGTTGCC GAGTGGAGCG CCCTCGACCC AGCCAAGCCG TGGGCAGTCA AATTTGGCGA GACGAGCCAC ACCGCACCAC TTTGGATGGA AGTGCTGAAT GTGAGCCATG CCAATAGCTT AGCAACCTGG AGCAAAAGCT ACGCAAAGGG TCAGGCTGCA ATCACCGCCG CGACCTATGG CAAAGGCCTA GCAGTATTGA TGGGCTGCTA TCCCACCGAG GAAATTTTGG GCGATCTGCT GCCACGGCTC TGGCCCGCTG CCCAACGCTT GCCCAACGAA ATTGAACGCA TCGAGTTGAC CGATGGCGTG TTGTGGTTCA ACCATGGCGA ACAAGCCCAA AGCGTCAAAC TTCAAGGCAC TTGGCACGAT CGCTTGAGTG GCGAGCAATG CAGTGGCGAT TGTTCAATCG AAAGTTTAGG TATTCGCTGG CTCAAACCCC TATAA
|
Protein sequence | MPLGVCYYPE HWPQTWWADD AKQMQALGLE YVRIGEFAWA LMEPAAGQYD WVWLDQAIET LASQGLNIVL GTPTATPPAW LTHNQPDLMR IDAQGRRLGH GGRRQACLVN PQYIEYSRQI VTAMAERYGQ HPAVAAWQID NEIGNHGSAR CYCEHCAAAF RQWLIQRYGD LAGLNEAWGT AFWSQTYSDW QQIPLPNVPV GGGHNPSLVL DYRRFASDQQ VAYCAMQAEI LRQHSPNRTI LTNIAPGDDE INWFDMAQQV DTIAWDNYPH GFPDWQAVAM YHDHIRGLKR QPFWVMEQQP GQINWTPTNP PVPPNQVRLW SYQDAAHGAA NVLYFRWRAC WLGQEQYHSG LRDHANRPAR GSTEARIVAN EWQQHGQPEA APRKVALLVS YDDHWAQQLD PHAQGWNYWQ LLRTIHRTLT SYGVGVDIVQ RGTPLAAYQL AIAVAPMLDN PAETAGWREW VQAGGTLICT PRSLTKRRDN RTAPDGFPSG LTDLFGADVA EWSALDPAKP WAVKFGETSH TAPLWMEVLN VSHANSLATW SKSYAKGQAA ITAATYGKGL AVLMGCYPTE EILGDLLPRL WPAAQRLPNE IERIELTDGV LWFNHGEQAQ SVKLQGTWHD RLSGEQCSGD CSIESLGIRW LKPL
|
| |