Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4453 |
Symbol | |
ID | 5736304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5695880 |
End bp | 5696860 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281616 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_001547213 |
Protein GI | 159900966 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0155685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAGAC CAGCTATTTT GCCACTGAGA GATGTACAAA TGCAGCCAAT TGAACAAACC GAAACCTACA GCTCCGCCGA AATGCAGGCG CTGGCACAGC ACCCTGAGGT TGAGCAACGC CAAACCATCG CTTACGATCT GCGTTTCTCG ACCAACCCCT TAACCCTGGC AATCTTGATT GACTTATTAA ATGATCCAGA TCAAGCAGTG CAAATTAACG CAATTCGTTC GATTGGTTTG TGGGGTGCAC GGCGCAATTC GCCCAGCCTG ATGCAGCCAG CAACTCAGGC TCTATTAGCG CTAGTTCAAC AATCACACGA GCAGATGCTA CTTGATCACG GGTTAATTAG CCTTGGTGAA ATTGGCGATC AGTTATCAAT TGACTGGTGT TTAAGCCAGC TGATCAATCA ACCTCGTTCA CGCTTATGCG CGGCAACAGC CTTGGGTATG CTCAAAGCCG AAAAAGCTCG CCCGTGGCTC TTGACCATTC TAGCCGATCC GTGCCAAGCA TCGATTGTCC GCACAACCTG TATTGAGGCA TTGAGTCAGC TCGCATTCGA CCTTCCCACC AACCAAACCT TGATCGCCGC GCTGCAAGAT TCCGTAGCCG AAGTGCGCGA AAAGGCTGGC TTAGCGCTCT GTAAGTTGGG CGATTTTAGT GCCTTCAAGC CAATCTGGGC CTACATTCGC CTTGAAACCG CAATCAAGCC CAGCCAAGTT GCCCATGCAT TAGCCTTATT TGGCGACCAA GCATTTGAGC CAACCTTGGC TTTTTTAAAT GATCCTGATC CCAATCTGCG CTATTGGGCC GCCTTAGCGC TCGGCATGTT CCACGATTCA CGGGCGATTC CGGCCTTGAT TGCATTATTG AATGATCAAG CACAAACGCA CACCCGTGCC GTGGTTGCAA CCGCAGCTCG CAAATCGCTT AACCGCCTCC AAAATTTGGC GGTTGGCAAC CCTGACAGCA CTTTAACGTA G
|
Protein sequence | MLRPAILPLR DVQMQPIEQT ETYSSAEMQA LAQHPEVEQR QTIAYDLRFS TNPLTLAILI DLLNDPDQAV QINAIRSIGL WGARRNSPSL MQPATQALLA LVQQSHEQML LDHGLISLGE IGDQLSIDWC LSQLINQPRS RLCAATALGM LKAEKARPWL LTILADPCQA SIVRTTCIEA LSQLAFDLPT NQTLIAALQD SVAEVREKAG LALCKLGDFS AFKPIWAYIR LETAIKPSQV AHALALFGDQ AFEPTLAFLN DPDPNLRYWA ALALGMFHDS RAIPALIALL NDQAQTHTRA VVATAARKSL NRLQNLAVGN PDSTLT
|
| |