Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4084 |
Symbol | |
ID | 5735943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5217358 |
End bp | 5218746 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281236 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546844 |
Protein GI | 159900597 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.806562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCGAG TGAAGATTTT ACTACTTGCA ATTATTTTTC TGCCAATTCA ATCAATCAAT CAGGTAAACG CCCAAAAAAT AGCTGAAATT TCACGAAATA CCACAAATGA GCAACTTGAT AGTGATAGCG ATAGTTTGAC CAATAGCTGG GAAACCAACC ATCAACTTAA TCCCTATAGC GCACTTGGCG ACGATGGCGC GGCGGGCGAC CCTGATCACG ATGGCTTGAG TAATGCCAAT GAGCAAATTC ACGCGACCAA TCCGCGTAAC CCTGATAGCG ATGCTGATGG GCTGAATGAT GGCTGGGAAG TGCGCTATTG GCTGAATCCA ACCAGCGCAA TCGGCGATCA TGGAGCCACT GGCGATGCTG ATCGCGATGG CTTGACCAAT CAAGAGGAAT TATTTAGTAA TACTTACCCG AACAATCCTG ATACAGATGG CGATGGGTTA CGTGATGGCT GGGAAGTTGA TCATCAATTT TCGCCGATTA ATACCACTGG TTATCACGGC AGCGCAGGCG ATGTTGATAC TGATAATCTG AATAATTTTC AAGAGCAAGC AGCAGATACT AACCCCCGTC ACCCCGATAG CGATGGCGAT GGCTTGCCCG ATGGCTTGGA AATTGAGCAT GGTTCGAATC CCTTAGATTG TGATTTGGAT GCTGACGCAG ATGGTTTGGA GAATCAGCCA GAACTGGCTT TGGCAACCAA TCCATTTAAT TCTGATAGTG ATGGTGATGG TTTGCCCGAT GGCTGGGAAG TGGCGCAGCA GCTCAATCCC TTGAGTGTTG CTGGCATTCA TGGAGCCAAT GGCGATGGCG ATGCAGATGG TTTGGGCCAA TTGCAAGAAT ATCTGCATCA AACCAACCCA CATCAGGCCG ATAGTGATGA CGATGGCTTA CTCGATGGTT GGGAAGTGCA GCATCATTTG AACCCGCTCA GTAGTTTGGC CGCTGATGGA GCCAACGCCG ACCCCGATAG CGATTCGCTG AGTAACCTTG CCGAACAAGG CTTTGCCACC AACCCACGCA ACGCCGATAG TGATAATGAT CAATTGCCCG ATGCTTGGGA ATTGCAACAG CAACTTGATC CGTTGCTGGC AAGTGGCGAA GTTGGCACAC ATGGCGATCC TGATGCTGAT CGATTAACCA ATCTAGCTGA ATATCGCAAT AGCACCCGAC CGTTGCTGGC GGATAGCGAC GGCGATGGCT TAGACGATGG TTGGGAAGTG CAGTATACGC TTGATCCACT TGATGGATTT GGGGTAAACG GAGCCAATGG CGACCCTGAT AGCGATGGCA TGAATAATTT GATTGAATTG CAACGCCAAA CCCACCCGTT GCAGGCCACG TACCTCGTTA TGCTGCCACT CGCTCAACAT TCCTACTAA
|
Protein sequence | MLRVKILLLA IIFLPIQSIN QVNAQKIAEI SRNTTNEQLD SDSDSLTNSW ETNHQLNPYS ALGDDGAAGD PDHDGLSNAN EQIHATNPRN PDSDADGLND GWEVRYWLNP TSAIGDHGAT GDADRDGLTN QEELFSNTYP NNPDTDGDGL RDGWEVDHQF SPINTTGYHG SAGDVDTDNL NNFQEQAADT NPRHPDSDGD GLPDGLEIEH GSNPLDCDLD ADADGLENQP ELALATNPFN SDSDGDGLPD GWEVAQQLNP LSVAGIHGAN GDGDADGLGQ LQEYLHQTNP HQADSDDDGL LDGWEVQHHL NPLSSLAADG ANADPDSDSL SNLAEQGFAT NPRNADSDND QLPDAWELQQ QLDPLLASGE VGTHGDPDAD RLTNLAEYRN STRPLLADSD GDGLDDGWEV QYTLDPLDGF GVNGANGDPD SDGMNNLIEL QRQTHPLQAT YLVMLPLAQH SY
|
| |