Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2392 |
Symbol | |
ID | 5734273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3047564 |
End bp | 3049582 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279533 |
Product | Beta-galactosidase |
Protein accession | YP_001545160 |
Protein GI | 159898913 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTGA CGCTGACTGC CAACGGTCTT GCAGTCAATG GTCAGGAAGT GCCGGTTTAT TCCGGCACTA TTCACTATTG GCGTTTAGAG CGCGACCGCT GGGACTATAT TCTCGACCAA GCTAAGGCGC TTGGGTTTTC GATGATCGAA ACCTACATTC CTTGGGGCGT GCATGAAACC GCTCCCGGTC AATACGATTG GGGCCAAATC GATCCACGCA AAGATCTCGC GGCCTTTATG CGTTTGTGCC ATGAACGGGG AATCTGGCTC ATTGCGCGGC CTGGGCCATT GATCAATGCT GAATTGACCG ACTTTGGCTT TCCCCAATGG ATTTTGGACG ATCCAGCGAT GCAAGCTCGC ACCGTGCTGG ATACCATGCA TATGAGCTTA GCAGCGGGTT TGCATCCACC ACATCAATTT CCCGTGCCAT CGTATACCAG CCCTGAATAT TTGGCCGCCG TGGGCGTTTG GTTCGACCAC ATGTGCCAAG TGATTGTGCC GCAACTTGCG CCGCATGGGC CAATTGTCGC GGTGCAAAGC GATAATGAAA CCTGCTATAT GTTTCATGAG CAAGCTTATG CGACCGATTA CAGCCCTTCG TCGTTGGCCT TGTATCAAGC AATGTTGGCT GAGCAATATG GCGCAATCGA AGAATTGAAT CAAGCCTATG CGACCAACTA TGCCAATTTC AGCGAAGTGA TTGCGCCGCG CGATGCCGAT ATCGCCAGCC GCCCCGATTT AGTGGCTCAC CTCGATTGGG TGCGCTACAA AGAAGTGCAT GTTAATTGGA TTGTCGCGAC CATGGCCGAT ATGCTGCGTG AGCGTGGCGT GGTTGATGTG CCATTATTCC ACGATGTAGC CTTCCAGTAT CGCACGCCGC TGGATATTAA CGCCATGGAA GCCAACGGCC ATGTCGATTG GGTTGGGATC AACTATTATC GCCAACCACA AGGCTTTGAT GGCGCGATCA CCTTAGTGCG CTATATGGCT GGTACCACGC GCCTGCCGTT TATCCCTGAA TTTGGCAGCG GCTTGTGGAT TCACCATGCG CTCACGCCAC GGCCTGAAGA AGCAGAATTT GTAACTTTAG CGGCCTTGAT GTATGGAATT AAAGCCTTCA ATTTCTATAT GTTGGTCGAG CGTGATCGTT GGTTGGCCTG CCCAATCACT CGTCATGGCG ATTATCGACC GGAATATGCG CAATTGTTTG AGCGCTTGAT GGGCTTTTTG CAGCAGCAAC AATGGTGGAA CTTCCAGCGC AAACCTGAAG TTTTGGTGCT GATGAGCTAT GATCTTGGGC GCTATTGGGC CGCCACTTCA ACCTTGCACT ATGGCCATGT TGATTTACTC GGCCTGCCGC CAGCATTAAA TCGGGTCGAA TTGGATTTGG GCTTTAGCAC CGATCTCGAA GTTGAAAGCG ACGATTTTCA TCCGCAAAGT TGGTATGGCT CGTTGCGGAG CACGCTCGAT GCCAACCATA TCGACTACGA TTTGAGCGAT AGCCATTTGC GCAGCTCACG AATTGCTGAC TATAAATTGG TGTTTGCCCA GAGCGTCGAT TGGATGAGCC GCGCCGATCA ACAGCGGTTG TTGCAGGCTG CCGAAACTGG TGCAACCGTT TGGCTTGGCC CGACCTTGCC AACCCTTGAC GAATACTTCC AGCCCTGCAC AATCTTGGCT GATCAATTGG CGGGCAAACG CCAAGTTGCC TTGGGCAGTG GCCATTTGGG CTTACTGAGC CAAGCCGAAT TAGGCGCATT TTGTGCCGAA GTTGCAGCGG GTTTGCCTGT ACGGCCCCAA AACCCTCAAT TGGCGGTTAC CAGCCATCGT CACCAAGGCC GCGAAATTCT GTTTGTAGCC AACCCAACCG CTGAAACGAT TAACTCAAGC TTGGATTTTG CCCATCCTGT GCGCTTAACC AGCTTGTGGG GAGCATTCGC TGGCGCTGAT CTTCAACAAC AGCAGCCGAT TAGCCTAGCA GCCTACACGA TTGCAATTGT CGAGGTGCAG CATGATTGA
|
Protein sequence | MSVTLTANGL AVNGQEVPVY SGTIHYWRLE RDRWDYILDQ AKALGFSMIE TYIPWGVHET APGQYDWGQI DPRKDLAAFM RLCHERGIWL IARPGPLINA ELTDFGFPQW ILDDPAMQAR TVLDTMHMSL AAGLHPPHQF PVPSYTSPEY LAAVGVWFDH MCQVIVPQLA PHGPIVAVQS DNETCYMFHE QAYATDYSPS SLALYQAMLA EQYGAIEELN QAYATNYANF SEVIAPRDAD IASRPDLVAH LDWVRYKEVH VNWIVATMAD MLRERGVVDV PLFHDVAFQY RTPLDINAME ANGHVDWVGI NYYRQPQGFD GAITLVRYMA GTTRLPFIPE FGSGLWIHHA LTPRPEEAEF VTLAALMYGI KAFNFYMLVE RDRWLACPIT RHGDYRPEYA QLFERLMGFL QQQQWWNFQR KPEVLVLMSY DLGRYWAATS TLHYGHVDLL GLPPALNRVE LDLGFSTDLE VESDDFHPQS WYGSLRSTLD ANHIDYDLSD SHLRSSRIAD YKLVFAQSVD WMSRADQQRL LQAAETGATV WLGPTLPTLD EYFQPCTILA DQLAGKRQVA LGSGHLGLLS QAELGAFCAE VAAGLPVRPQ NPQLAVTSHR HQGREILFVA NPTAETINSS LDFAHPVRLT SLWGAFAGAD LQQQQPISLA AYTIAIVEVQ HD
|
| |