Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5289 |
Symbol | |
ID | 5737247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 77853 |
End bp | 80426 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641282453 |
Product | TPR repeat-containing protein |
Protein accession | YP_001548044 |
Protein GI | 159901799 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.752131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTC GAACCCTTAT CCCACCCCTT GTTGATGCTG TGTTAGCAGT GTGTCCTGTT TATGAGCGTG CCACTATAGC CACGGCACTT GAACGAGTTC TTGTTGGTGA ACACATCACG CTTGGCAATA ACACAATCTC AATGTTGTTT GGTCAAAATA ATGATTTTAG CAATGCAAAG ATTATGATTG GTTCGATCCA AGCTGGACAT ACTATTTCTA TTAATGTTCA ACCTATCATA GATAAATCTT TTTCTGATTC TACTCATACT ACCGATGATG ATTTAAAGCA AGCAAGGTTA TTGCTATCTC ATATACCATT GGCCTCGCTA CCAAAGAAAG GGTTATTGCC TAAAGGTTCA CGGATTCCCT TTAAAGATAA CCCTTTCTTT GTTGGGCGAG ACGACATGTT ACTTTCTATT GCTAGTACTT TTTTTTCTTG TCACTCTGAT GCACCTATTC CGACTATTGG ATTAGTGGGA ATGGGAGGGA TTGGAAAAAC GCAGTTGGCT GTGGAATTTG TCTATCGCTA CGGTTCATAC TTTGCTGGCG GGATCTTTTG GCTTTCCTTT GCACAGCCAG ATTCGATTAA TACAGAGGTG ATTGATTGTT ATAAATATTA CTGTCCACAA GTTATCGAAG ATTCGGCTGA AAAACAGATT GCCTATATGA AATCCCTTTG GATGAATCCT TTACCTCGAT TACTAGTTTT TGATGACTGT AACGAGGTTG ATTTGCTAGA GAAATGGCGG CCCCAATCAG GGGGATGTTA CGTGTTGGTC ACGAGCCGTA GGCAGCAATG GCCTGCAACT GTAGAACTTT CCCTCCTGTC TGTATCAACA CTTGATCTTG CAGGCAGCCT TGACTTACTT TGCCTTTATC GCCCTGATAT AAGAGAAGAC CAAGCCTTAG GACAAAAGAT TGCTCAAAAA CTTGCTAATC TGCCATTAGC TATTCATATG GCTGGAAGTT ATCTAGCACA CTATAAACTT AAGCTAGAAG TATATTTAGC CCAATTGGAT CAAGGAATTA CCCACGAATC AATGAAGGGG AGAGGAACGT TCCATCAACC TACTAATCAT GAAAGTGTTA ATGTTACGTT TAATATGGCG TTAAATAACC TTTCCAACCA TGAACCAGTG AATATAATTT CACGTTTGCT CTTAGCTAGG ACAAGTTGGT TAATGTGTAA TGAGCCAATT CCTAAAACAC TATTACAATC GTTTGCAGTA GATAAAGGGT ATGATGATTT AGATATTATA GACTCTATAC ATAAAATTGT TAACTTAGGC CTCTTAGAAA TAACTATCCA TACAGGATTT CGAATTCATG AACTGATAGC AGGATTAATA AAGGATAAAA TTAATGATGT TTCTGCATAT TCAGATGTAG AGAGGATTTT AGGATCAAAA CTAGCTTCTC GGTCTACTTG GGAGGAAAGA GAAGAATTAC AGTGGCTCAT TCCTCATGCA TATACAATTG TTCAATATGC ACTGAAGAGA CAAGATACTA ATAGTGCAGA TTTTATGTAT GGTTTTGCAA GATCCCTCAA GAGAAATTCT GATTATAAGG CATCTTTTAT GTATCATCAG AAAGCCCTAG CAATTCGTAA AAAAATCTTT GGTGATAATC ATGTAAATAC AGCAAAAAGT TTGAATATGC TTGGGATGTT ATGTCGAAAA ATGGCCAACT TTAATATGGC CAAGGAGTTT TATGAACAGG CATTAGAAAT TTATCAAAGA GATTTGGGTG ATGATCATCC TACTACACTG AGTACTTTAA ATAATCTAGG CTATCTATTA AAAGCGCAGG GCGATTTACT TCAGGCCCGG GAGTGTTATC AAAAAGTACT AAAGAGTCGT CTTATAAATA GAGGGGAGGA ACACAGAACT ACAGGTACGA TTTTTAATAA TTTAGGCATG GTTCTAAAGG ATCTGGGTGA TTTAAAATCC GCCCAAGAAC ATGTAAAGAA AGCAGTAGAA ATTCGTAAAA GGATATGTGG TGATAACCAT CCCGATACTC TTATAGCTCT ATATAATTTA GCTAGCATTC TTTTTGATTT AGGGTTGATT GAGGATTCCT ACAAAGTTGC AAAATCAGTC CTTGACGATC GTTGCCGTAT TTTAGGGGAT GAACATTCTA GTACTGCAAG TAGTTTGTAT CAGGTTGGAA AATTGCTCCA TACTAAAGGT GAGATTGATT CAGCGCTGTA TAATCTTGAG AAAGCATTGA CGATTCAAAA GAAATTGCTT GGTATTGATA ATCCATATAC AGCTTTAACA CTTCAAGAAT TGGGTAGGTT GTTTCAATCA AAAGGAGAGT TTGAATTGGC ACGACATAAT ATTGAGTATG CGCTTGGTAT TCAACAAAGA ATATTCGGAT TAAATCATCC TGCTATCGGT TTAAGTTTTC ATAATCTAGG TGAATTATAT GAGAAGATGG GGAATTTGCA GATTGCTCAC TTCTACTATA AGCAAGCATT AGAGGTTAGA ATACATATTC TTGGAGAAAA TCACCCATCA ACAATAGGCA CAATAGATTG CCTTAATCGT AGCAGTAATT GGAATAAAAT ATAG
|
Protein sequence | MDIRTLIPPL VDAVLAVCPV YERATIATAL ERVLVGEHIT LGNNTISMLF GQNNDFSNAK IMIGSIQAGH TISINVQPII DKSFSDSTHT TDDDLKQARL LLSHIPLASL PKKGLLPKGS RIPFKDNPFF VGRDDMLLSI ASTFFSCHSD APIPTIGLVG MGGIGKTQLA VEFVYRYGSY FAGGIFWLSF AQPDSINTEV IDCYKYYCPQ VIEDSAEKQI AYMKSLWMNP LPRLLVFDDC NEVDLLEKWR PQSGGCYVLV TSRRQQWPAT VELSLLSVST LDLAGSLDLL CLYRPDIRED QALGQKIAQK LANLPLAIHM AGSYLAHYKL KLEVYLAQLD QGITHESMKG RGTFHQPTNH ESVNVTFNMA LNNLSNHEPV NIISRLLLAR TSWLMCNEPI PKTLLQSFAV DKGYDDLDII DSIHKIVNLG LLEITIHTGF RIHELIAGLI KDKINDVSAY SDVERILGSK LASRSTWEER EELQWLIPHA YTIVQYALKR QDTNSADFMY GFARSLKRNS DYKASFMYHQ KALAIRKKIF GDNHVNTAKS LNMLGMLCRK MANFNMAKEF YEQALEIYQR DLGDDHPTTL STLNNLGYLL KAQGDLLQAR ECYQKVLKSR LINRGEEHRT TGTIFNNLGM VLKDLGDLKS AQEHVKKAVE IRKRICGDNH PDTLIALYNL ASILFDLGLI EDSYKVAKSV LDDRCRILGD EHSSTASSLY QVGKLLHTKG EIDSALYNLE KALTIQKKLL GIDNPYTALT LQELGRLFQS KGEFELARHN IEYALGIQQR IFGLNHPAIG LSFHNLGELY EKMGNLQIAH FYYKQALEVR IHILGENHPS TIGTIDCLNR SSNWNKI
|
| |