Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0081 |
Symbol | |
ID | 5731974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 105986 |
End bp | 106990 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277203 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_001542861 |
Protein GI | 159896614 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAG AACAAAATGA ACGCGATATT GAGGAAGCCT TGGCCGAAAT TGGCGACCAT AATCGCCCAA TTGTCTTGAG TGAACTCAAA GTTTTTAACG ACCTTGACCA AGATGAAACT GAGATTTTTG AAATTGAATG GCTGCGCATC GAGCCAAGTC ATCGGCGTAA TTTGGCTCAA GCGATGCAAG AGGTTGGTGA GGCCAGCCTT GAGCTTGATT TTCGCGCCGT ATTTAGCATT TTGCTGACCG ACCAAGATCC TGCAATTCGC ATTGCAGCAG TCAAAGGTAT GGCCGAAGAT ACCCGCCGCT CAAGTTTGCG CCGTTTAAGC GAATTGCTTA CAACCGATCC CGATGATGGG GTGCGAGCTA ATGCGGCGAT CACGCTTGGT GCTTGGGCTT TACGCGCAGG CGAAGGCAAT CTCGATCAGC GTACCAGTAA TGAATTATTG CAAACGCTCT GGGCCGTTTT TGATGATCGC CAAACCTCGA CCTTGGTACG CCAACGCTTG TTGGAAACCT TGGGCTACTT GGCCGATAGC GATCCACGAG TCAATCAGGA AATTGGGGCA GCTCACCAAC GACTTGATGA TGGTTGGCAG GCCGCCGCGC TTTGTGCCAT GGGTCGCACT GGCTTAGATC AATGGTTGCC AACAATTACA GCTAGTTTGC GTTCCCATGA GCCATTGTTA CGTTTTGAGG CAGTGCGGGC CGCTGGTGAG CTTGGCGATT TAGCCGAATC GATTGTCAAC CACGTGGCGC GGGCTACTGC TGATGGTGAT GTTGAAGTTG CTACAACTGC GATTTGGGCA TTGGGCCAAA TTGGTGGCGC TGCTGCGCGA CGCTTTCTCG AACAATTAGT CAACGATTCA GAGGGTGTTC GGCGTGAAGC CGCTGCCGAA GCACTCAAAG AATTGCAATT CTTCGACGAT CCCATACAAT CCTTGCCCCT CGACGACGAT GAGGACGAGG ACGAATATTG GTATGGCGAT GACGAGGACG AGTAA
|
Protein sequence | MTEEQNERDI EEALAEIGDH NRPIVLSELK VFNDLDQDET EIFEIEWLRI EPSHRRNLAQ AMQEVGEASL ELDFRAVFSI LLTDQDPAIR IAAVKGMAED TRRSSLRRLS ELLTTDPDDG VRANAAITLG AWALRAGEGN LDQRTSNELL QTLWAVFDDR QTSTLVRQRL LETLGYLADS DPRVNQEIGA AHQRLDDGWQ AAALCAMGRT GLDQWLPTIT ASLRSHEPLL RFEAVRAAGE LGDLAESIVN HVARATADGD VEVATTAIWA LGQIGGAAAR RFLEQLVNDS EGVRREAAAE ALKELQFFDD PIQSLPLDDD EDEDEYWYGD DEDE
|
| |