Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1236 |
Symbol | |
ID | 5733144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1440159 |
End bp | 1441448 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278376 |
Product | hypothetical protein |
Protein accession | YP_001544012 |
Protein GI | 159897765 |
COG category | [R] General function prediction only |
COG ID | [COG4134] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCAA CAGCACGCAA ACGGATTTGG AATTTGGGTT TGTTAATTGG CCTTCTGAGC CTGATTATCG CCAGTTGTGG TAGCAATACG GCTACGCCAA CCAGTGCTCC CAGCAGCACC AGCCTTGATC TGAGCAATTG GACCAGTATT GAGCAAGCCG CCAACGGCCA AACGGTCAAT TGGTATATGT GGGGTGGCTC GGATAGCATC AATCGTTTTG TTGATGAATT TTATGGCAAA GCGCTCAAAG AACGCTACAA TATTACGCTC AATCGCGTGC CAGTCGCTGA TACTGTCGAT GTGGTCAATC AGTTGCTCAG CGAAAAAGAG GCAGGCAAAA CCAGCGGTGC TGTCGATTTG ATTTGGATCA ATGGCGAGAA TTTTGCTTCG TTGAAACAAG CCAAACTCTT ACGCGGCGAT TGGGGCCAAA GCCTGCCCAA CAGCCAATAT GTCAACTGGA ACAATCCAGC GGTTAATCTT GATTTTGGCG AGCCAGTCGA AAGCCTCGAA AGCCCATGGT CATCGGCCCA ATTTCAGTTG ATTTACGATT CGGCTAAACT TCAAGCCAGC GATTTGCCCC GTTCGTATGC TGCGCTCAAG GAGTATGCCT GCGCCAATCC GGGCAAAGTC AGCTATATCG CACCTGGGCC AGGCGCATTT CAAGGCACCC GCTTTGTTAA ACAAGCCTTA TTTGAGATCA GCGGCGGCGC TGAACAATGG CTCGGAGCCT TCAATCAGCA ATTATGGGAT CAATGGTCGC CCAAGTTGTG GGAATATTTC AATGATTTAG AAGGCTGTTT GTGGCGCGAA GGCAGCACCT ACCCCAAAAC CGAGAACGAA TTACATAGCT TGTTTGCCAA TGGCGAAGTT GATTTTTCAA TCACCCAAGC GATCGCCGGA GCTGGCTCGT TGATCAAGGA AAATTTAGTG CCAGCGAGTG CGCGAGCATT TGTATTCGAT GATAATATGA TCGGCGATTT CAACTATGTC GCCATCCCCA GCACCGCACC AAATCCAGCG GCAGCTTTGG TATTAGCCAA TTTGATTCTC GACCCTCAAC TACAAGCAGC CCAAATTTTG CCAGAAAATG GCTTTGGCTT GGGCTATGGC ATCGACCCAA CCAAGGTCAG TGATCCAGCT TTGGCAGCTA AATTGGCGAG TGCCGCCCAA CAACTGGGCG ACCCAGCAAC GCCTGCCAGC GATTTGGCAA AATCGTTGCG CAGCGATATT GCCGCTGAAT ATCAAAGCTT GATCGAACAG GGCTGGGATG CCAATGTGTT GCGTAAGTAG
|
Protein sequence | MQATARKRIW NLGLLIGLLS LIIASCGSNT ATPTSAPSST SLDLSNWTSI EQAANGQTVN WYMWGGSDSI NRFVDEFYGK ALKERYNITL NRVPVADTVD VVNQLLSEKE AGKTSGAVDL IWINGENFAS LKQAKLLRGD WGQSLPNSQY VNWNNPAVNL DFGEPVESLE SPWSSAQFQL IYDSAKLQAS DLPRSYAALK EYACANPGKV SYIAPGPGAF QGTRFVKQAL FEISGGAEQW LGAFNQQLWD QWSPKLWEYF NDLEGCLWRE GSTYPKTENE LHSLFANGEV DFSITQAIAG AGSLIKENLV PASARAFVFD DNMIGDFNYV AIPSTAPNPA AALVLANLIL DPQLQAAQIL PENGFGLGYG IDPTKVSDPA LAAKLASAAQ QLGDPATPAS DLAKSLRSDI AAEYQSLIEQ GWDANVLRK
|
| |