Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0891 |
Symbol | |
ID | 5732792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1018578 |
End bp | 1019576 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278023 |
Product | glycosidase PH1107-related |
Protein accession | YP_001543667 |
Protein GI | 159897420 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.208733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCAG CGACGCTTCA ATCCACAACC ATCCTTGGAA GTGCGCTCCC CTCAATTCCA TGGGAAGAAC GTCCGGTCGG CAATTCAGAT GTTGTTTGGC GCTACAGCGG CAATCCGGTG ATTCCGCGTG ATGCCATTCC TACCTCAAAT AGTATCTTCA ACAGTGCTGT TGTGCCCTTC AAAGATGGCT TTGCCGGCGT GTTTCGCTGC GATGATAAAC GCCGCGTGAT GAACATTCAT CGTGGCTTTA GCAAAGATGC CGTCAATTGG GAGATCGATC CCAAGCCGTT GGAATTTTCG GGTGATCCCG AAGTTACGGC CTTTGAATAT CGCTATGACC CGCGGGTCTG TTGGATCGAA GATCGCTATT ATGTCACCTG GTGTAATGGC TATCATGGCC CAACCATCGG CGTAGCCTAT ACCTACGATT TTGAAACATT CCATCAGTTA GAAAATGCCT TCTTGCCGTT TAATCGCAAT GGGGTGCTCT TCCCACGCCG CATCAATGGC AAATATGCCA TGGTCAGTCG CCCCAGCGAT AACGGTCACA CGCCATTTGG CGATATTTAC TACAGCGAAA GCCCCGATAT GGAACACTGG GGCAAACATC GCTTTGTGAT GGGCACAAAA GGCGGCTGGC AAAGCACCAA AATCGGCGCA GGCCCAACCC CCATCGAAAC AACCGAAGGT TGGTTGTTGT TCTATCACGG CGTATTGACT TCGTGCAATG GCTTTGTCTA TAGTTTTGGG GCCGCCTTGC TCGATTTGGA GCAGCCTTGG AAAGTGATTT ATCGCACCGC GCCCTATCTG CTTGCGCCCC AAACCTTGTA TGAATGTGTT GGCGATGTGC CAAACGTGGC CTTCCCATGT GCGGCCTTGA CCGACGCTGC CACAGGCCGA ATTGCCATTT ATTATGGCTG TGCTGATACC GTTACCGGCA TTGCCTTTGC CCAAGTTGAT GAAGTGCTGA GCTTCCTTAA AGCGAATTCA GAGATCTAG
|
Protein sequence | MEAATLQSTT ILGSALPSIP WEERPVGNSD VVWRYSGNPV IPRDAIPTSN SIFNSAVVPF KDGFAGVFRC DDKRRVMNIH RGFSKDAVNW EIDPKPLEFS GDPEVTAFEY RYDPRVCWIE DRYYVTWCNG YHGPTIGVAY TYDFETFHQL ENAFLPFNRN GVLFPRRING KYAMVSRPSD NGHTPFGDIY YSESPDMEHW GKHRFVMGTK GGWQSTKIGA GPTPIETTEG WLLFYHGVLT SCNGFVYSFG AALLDLEQPW KVIYRTAPYL LAPQTLYECV GDVPNVAFPC AALTDAATGR IAIYYGCADT VTGIAFAQVD EVLSFLKANS EI
|
| |