Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0285 |
Symbol | |
ID | 5732180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 333436 |
End bp | 334392 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277409 |
Product | glycosidase PH1107-related |
Protein accession | YP_001543065 |
Protein GI | 159896818 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00104033 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATGC AACCTAATCG ACTCGATCTC TTCGCACGCC ATCCAGCCAA CCCGATTTTA CAAGCAAGCG ATTGGCCGTA CCCGATTAAT AGCGTGTTTA ACCCTGGTGC AACCGTGCTC GCCGATGGGA CGACCTTGTT ATTATGCCGC GTTGAAGATC GGCGTGGCCA TTCGCATTTT TGTGCAGCTC GCTCGGCCAA TGGGCTTGAT CATTGGCAAA TTGATGCCCA CCCAACCTTT GCCCCCGATC CACAGCAGTA TCCCGAAGAA CGCTGGGGGA TTGAAGATCC ACGCATTACC TATCTTGAGG AATTAGCGGC CTATGGGGTT GTATATACCT CCTATGCGAC TGGTGGGCCT GGGGTTTCCT TGGCAACCAC CACCGATTTC AAGACCTTTA CCCGCTATGG GGTCGTGATG CAGCCCGAAA ATAAAGATGC GGCGCTCTTT CCAGTGCGGA TTAACGGCTT ATGGGCGATG ATCCATCGCC CAATTGGCGC ACAAGGGTCG CATATTTGGA TGTCGCTCTC ACCGGATTTG CGCCATTGGG GCCAACACCA ATGTATGCTT GAGGCTCGTA AAGGCGGATG GTGGGATGCT AATAAAATTG GGTTGTCGCC ACCGCCGATC GCCACCGAGG AAGGCTGGTT GATGATCTAT CATGGCGTTC GGATGACTCC TGGTGGGTGT CTCTATCGGC TTGGCGTAGC CCTGTTCGAT ACCGAACATC CTGAGCGCTG TCTGCGGCGT GGCGAACCAT GGGTCTTGAG TCCGCACACG GAGTATGAAC GCCATGGTGA TGTCCCCAAT GTGATTTTCC CCTGTGGCGT GACCGTCCTG CCGGATGGCG ATACCCTCCA TGTCTATTAT GGAGCCGCCG ATAGCTGCGT TGCGGTTGCG ATTGGAAGTA TTCGCCAGAT TCTTGATTGG CTGACGATCT ATGGAGTGGC GACGTAA
|
Protein sequence | MMMQPNRLDL FARHPANPIL QASDWPYPIN SVFNPGATVL ADGTTLLLCR VEDRRGHSHF CAARSANGLD HWQIDAHPTF APDPQQYPEE RWGIEDPRIT YLEELAAYGV VYTSYATGGP GVSLATTTDF KTFTRYGVVM QPENKDAALF PVRINGLWAM IHRPIGAQGS HIWMSLSPDL RHWGQHQCML EARKGGWWDA NKIGLSPPPI ATEEGWLMIY HGVRMTPGGC LYRLGVALFD TEHPERCLRR GEPWVLSPHT EYERHGDVPN VIFPCGVTVL PDGDTLHVYY GAADSCVAVA IGSIRQILDW LTIYGVAT
|
| |