Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1145 |
Symbol | |
ID | 5733037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1310335 |
End bp | 1312545 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278284 |
Product | TPR repeat-containing protein |
Protein accession | YP_001543921 |
Protein GI | 159897674 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACAG AACATTCCGA TCTCCTTCCA CGTTACACCA CTGCTCTAAT TGGGCGTGAT GCTGAAACTG CTGAAATTGG GGCTTTATTG GCCCAAGGTC AGCGGTTAAT TAGTTTAGTT GGGGCAAGCG GCGCGGGCAA AACCCGTTTG GCGGTTGATT GTGCCCGTTT ATATGCCGAC CAATTTGTTG GTGGCTGTTT TTTTGTGTCG TTGGTGCCGA TTCGGGCGGC GGGTTTAGTG CTAGCAACGA TTGCCGAAAG CCTCGGTATC GCTCCCACCT CCGATCAACC ATTGCTCACC ACGATTGCCG CCCATTTTCC ACATCAACCG AGCTTGTTAA TTCTCGATAA TATTGATCAT GTAGTCGAGG CTGCCTCGGA TGTGCAAGCC TTAGTCACAG CCGTGCCGCA ATTAACCATT TTGGTCACTA GCCAAGTGGC GCTGAACGTG GCTGCTGAAA CGATCTATCG GGTGCCATTA TTGAGCGTGC CTGCCGAAGA TGCTCGCTTG AGCGCCAGCG AAATTTTGCA ATATGGCGCT GTGCAATTGT TTAGCGAACG ATTGCGACGG TTGCAGCCAA TGCTCAAAGT CGATCAACCA CAAGCTCAGG CGATCGCCGA AATATGTCGT TTGGTGCAGG GCTTGCCGTT GGCCGTCGAG CTTGTCGCTA GCCATAGTCG CAGCTTACCG CCCTCTGATT TGGTGCGCAT GGTGCGGCAT CACTTGCGCT TGGGTGCGGC TATGATCGAC AAAAACAGCA CCATGCAACG CAAAGAAATT TTGTGGCCAG TGCTCGATTG GTGCTATCGC CTGCTTTCGC CAACGCTGCA AGTGTTGTTT ATTCGCTTGG GGATCTTTCG CGGCAGTTGG ACGCTCGAAT CGGCTGAAGC GATTTGCGCG GGCATTCCCG ACGTACCGAT CAATGTGGTT GAAGGCTTAC AAACCTTGGT CGATAAATCG TTGGTACAGC TTGAAACTCT GCCCAACGGC GATCATCGCT ATCTGATGCT TGATGCGGTG CATGCTTATG CCGAGGGTCG TTTACAACGT CGCCGCGAAG CTAACCACTT GCAACGATTG TATAGCTCAT ACTTTACCCA TTTGGCTGAG AGTGCCGAAA AACCCTTGCT CGGAGCCGAT CAACCGCTAT GGATGGCTCG CTTGCAGAGT GATATTTACA ACTTGCGCTC GGTGCTGGAT TGGGCGGTTG AGCAGCATCC CGCCACAGCC TTGCGGATTG CTGGCTCGTT GTGGTTGTTT TGGTTTACCA AAGGCTATGC CCAAGAAGCG CGGGTCTGGA TTCGCCAAGC ATGGCGGCGC GAACACGAAA TTGAGCCTGT GGTTCGCGCT AAAGCAGCGA TTGCCGCAGG CATTTGTGCC CAATATTTTG CCGATCATGC TGATGCAACG TTATGGTATG AGCGTGGTCA AGCCTTATTC AAGCAAGTTG GCGATACCTT TGGCGAAACC CGCGCCTTGC ATAATTTATC GTCATTGGCC TATCAACAAC GCTACTATCA AAAAGCAGTG GAGCTTGGCG AAGATGTTGT GGCACGCTGG CAAGCCATGG ATGATCGTTC AGGTGAAGCT GCGGCCTTGA GCAATTTGGC CAGCTCCTAC ACTGGTTTGG GTCGTTTCGA TGATGCTGAA CGTTGCTACC AACAAAGCTT GCAGATCAAT CGTAGCCTTG GCAATCAAGT TGGGATTATT CTCTGTCAGA GTGCTCGTGG CTGGCTCGCC TGTGCGATTG ATGATTTGGA ATTAGCCCAA GAGGCACTGG AGGAATCACT CAATCTAGCC CTGCAAAACG ATGCTCGCAA CTTAATTCCT GGTTCGCAGA GTGCATTGGC TAGAGTTTGG TATAAAAAAG GCCAAATCCA ACGTGCTTTC GAGCTTTTGC GCCCAGTTTT TGATATTCAA GCTCAATTGC AAAATTTGGA GCAATCGGGC GAAGAGATGA TCACTCTCGC GGCAATTTTG GCTGAGCATT ATCCCAAATT GGCCCAACAA GCCTTGGAGT TTGCTGAGCG CATGCAAGAT CCTTATAACC AAGAGCTTCA GCCTGATCCA GCCACACTGC TCTTTTTCAA GCAAACTGCC GAGCAGGTGA GCCATTTTGT GGGGCAAGCT GCGGCTACAT CAACCCTGCC AACAATTAGC GAATTTCTCG ATTCGGTCGA TCAAACGGTT GATCCACGAG TTTTAGGTTA G
|
Protein sequence | MQTEHSDLLP RYTTALIGRD AETAEIGALL AQGQRLISLV GASGAGKTRL AVDCARLYAD QFVGGCFFVS LVPIRAAGLV LATIAESLGI APTSDQPLLT TIAAHFPHQP SLLILDNIDH VVEAASDVQA LVTAVPQLTI LVTSQVALNV AAETIYRVPL LSVPAEDARL SASEILQYGA VQLFSERLRR LQPMLKVDQP QAQAIAEICR LVQGLPLAVE LVASHSRSLP PSDLVRMVRH HLRLGAAMID KNSTMQRKEI LWPVLDWCYR LLSPTLQVLF IRLGIFRGSW TLESAEAICA GIPDVPINVV EGLQTLVDKS LVQLETLPNG DHRYLMLDAV HAYAEGRLQR RREANHLQRL YSSYFTHLAE SAEKPLLGAD QPLWMARLQS DIYNLRSVLD WAVEQHPATA LRIAGSLWLF WFTKGYAQEA RVWIRQAWRR EHEIEPVVRA KAAIAAGICA QYFADHADAT LWYERGQALF KQVGDTFGET RALHNLSSLA YQQRYYQKAV ELGEDVVARW QAMDDRSGEA AALSNLASSY TGLGRFDDAE RCYQQSLQIN RSLGNQVGII LCQSARGWLA CAIDDLELAQ EALEESLNLA LQNDARNLIP GSQSALARVW YKKGQIQRAF ELLRPVFDIQ AQLQNLEQSG EEMITLAAIL AEHYPKLAQQ ALEFAERMQD PYNQELQPDP ATLLFFKQTA EQVSHFVGQA AATSTLPTIS EFLDSVDQTV DPRVLG
|
| |