Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1597 |
Symbol | |
ID | 5733484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1850567 |
End bp | 1852960 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278736 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001544368 |
Protein GI | 159898121 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00177125 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTAT TACCATCACA ACTCCCCCAA TATGCCACGC GCCTGATTGG TCGCACCCGT GCCCGCACGT TGGTAATCGA CCTGTTGCTT GATGCTCAAG CGCGGTTGGT AACATTATAT GGCCAAAGTG GTGCTGGCAA AACGCGCCTA AGTTTGGAAG TTGCCGAGCA GGTGGGCGAA ATTTTTCGTG ATGGCCGCTA TTTTGTAGCG CTCGCTCCTG TTTCGCAAGC CCAGTTTGTG CTGCCAACAA TTGCCGCAAC GTTAGGCGTT GAAGAATCTC AGCACGAAGC AATTTTAGAT TCGTTAATAT TGGCCTTGGC CGATAAGCAA ATTTTATTAA TTCTTGATAA TTTTGAGCAA GTTGCTGGAG CCGCCAGCGA ACTATTGGAG TTAATCCGAC GTGCACCGAA CCTAACATGT TTAATCACGA GTCGTCAAGC GCTCGAAGTT GCTGGCGAAA CCGCGATTAT GGTTCCAGCC TTGCAATATC CTGAGCTTGG TGAAGACTAT CAGCTTGAAG ATTTAGAGCA ACATAGCGCA ATTGGCTTAT TTGTTGATCG CATGCGCACA CGTCAGCCAC GGTTTCGCTT GAGCGCTGAT AATGCTGGAG CTTTGGTCGA TATTTGTCGT TTGGTGCAGG GCTTGCCCTT GGCGATTGAG TTAATCGCGG CCCATAGTGC TTCGTTGACC CCCCAAGATT TGCTGTTTTT CGTGCGTAAT CATCTTTCCA TGGCGGCTTT GAATCCGAAA CAATCGGCGC GGCAAGCGAT TATCAAGCCA GTGCTGGCTT GGAGTGTGAG CATGCTGCCA GCTGATGCCA AAGATATTTT TGCTCAATTA GGTGTTTTTG CTGGTGGTGC AACCGTCGAA ACAATTAAAC AGGTTGGTTT GGTTGAAACC ATGCCCTTTG AATCCAGCCT AAATGCCTTG ATTGATCGCC ATTTATTGCA AACTGAACAA TTGCCAGGCC AAAAAGCCCG TTTTATTATG ATTGATGCGG TGCATGAATA TGCTTTAGAG CAATTGCAAA AAACGGGCCG CCTATATTAT TTGCAAGAAC GTCATGCCAT CTATTATCAA ATATTGAGCG AAACAGTGCA TCAGAATATC CGGGGGGCTG ATGGTGCTAA ATGGATCGAA CAATTGCGTG GTGAGATTCA TAATATTCGC CAAGCGATGA TTTGGTCGCT TGATAGTAGC GATGGTTTGG TCGCTCAGCG AATTGCTGGC AATCTCTATT TTTATTGGTA TCGCACGAGT GCTTATCGCG AGGCTGTGGC ATGGCTCGAA CAAACCTATC AACATTCCAA TCGCAGCGAT TTAAGTGCAA TTGCCCGAAT TGCAACAGGA TTAGGTGGTT TATTAATTAG TTTGCTCCGT TTTGCCGAAG CTGAACGCTA TTTAATTGAA GCTCGTCGTT TGTGGCAAGA GCTTGGCTTA CCGCACGATG AAATTAGCGC AATTGGTAAT TTAGCAGTAT TGTATGGCAC GCTTGGCCGT TTGCATGATT CGCAATTGGC GTTTGAAGCA GCCTTGGCTT TAGCCCGCAA GGTAGGTAAT CAACAGCGCG AAATTTTGAT GCTGCATAAT CTTGGCACAG TAGCCCAAGA ACGGAATCAA TTAGCGACCG CCCAAGCCTA TTTTGAGCAA GCTTTAGCGC TCAAACAACA GGTTAATCAA ACCTGGGATA TGTTTCTAAC CCAAATTAAT CTCGGCTTAG TAGCGGTTGA TCAAGGGCGT TATGCTGAAG CTGAGCAATG GTTTGAGCAA GCGTTTATCA ATGCCTATGC AATCGGTGAT CACGATAGTT TGGCCTATAT TCGTTATGCA CGCGGGATAT CCGCAGCTGA GCAAGCTGAT TATGTCCAAG CTGAGTTGCA TTTTCGCGAG TCAGAGCGAG GGTGGCATAC CGTTGGCAAT CTGGAAGGAG TTCAGCGTAG TTGGCTTGAG CAAGCAGCAC TTTTAATTGC CACTGCCAAT TATGCCCAAG CTGCTGAATA TTTGCATAAG GTTGAGCCAA TTGAGGCACT AAGCCAAGAA TTACAATTAC GCCATATCAT TTTGGCAACT CGTTTAGCAA TCGCAATCGA TGATCAGGCT GCGATGCAAC ACCAAGCTCA ACGAATGCTC GCAACTGCTT TGGCCAGCGA GCTACGCCGG TTTGATCTGA CGATGTTGCA GCATAGTGCG GCAGTCCTAG TCGCAACCCA ACCAACACTG GCGGCTCAAC TTTTAGCAAC CGCCGAGCAG CTTCGGGTTG AACGTAACTT GCACCAAAGT GTTGCTGAGC AACAATGGCT GGCCCAAACC AATGTAGCTC GGCTTGTACC AACCATAGCT TTGGATTTAA CCGCTGCTTT GCAAGCGGCT CAGGCTAGCT TAGCTGCGCA ATAA
|
Protein sequence | MPLLPSQLPQ YATRLIGRTR ARTLVIDLLL DAQARLVTLY GQSGAGKTRL SLEVAEQVGE IFRDGRYFVA LAPVSQAQFV LPTIAATLGV EESQHEAILD SLILALADKQ ILLILDNFEQ VAGAASELLE LIRRAPNLTC LITSRQALEV AGETAIMVPA LQYPELGEDY QLEDLEQHSA IGLFVDRMRT RQPRFRLSAD NAGALVDICR LVQGLPLAIE LIAAHSASLT PQDLLFFVRN HLSMAALNPK QSARQAIIKP VLAWSVSMLP ADAKDIFAQL GVFAGGATVE TIKQVGLVET MPFESSLNAL IDRHLLQTEQ LPGQKARFIM IDAVHEYALE QLQKTGRLYY LQERHAIYYQ ILSETVHQNI RGADGAKWIE QLRGEIHNIR QAMIWSLDSS DGLVAQRIAG NLYFYWYRTS AYREAVAWLE QTYQHSNRSD LSAIARIATG LGGLLISLLR FAEAERYLIE ARRLWQELGL PHDEISAIGN LAVLYGTLGR LHDSQLAFEA ALALARKVGN QQREILMLHN LGTVAQERNQ LATAQAYFEQ ALALKQQVNQ TWDMFLTQIN LGLVAVDQGR YAEAEQWFEQ AFINAYAIGD HDSLAYIRYA RGISAAEQAD YVQAELHFRE SERGWHTVGN LEGVQRSWLE QAALLIATAN YAQAAEYLHK VEPIEALSQE LQLRHIILAT RLAIAIDDQA AMQHQAQRML ATALASELRR FDLTMLQHSA AVLVATQPTL AAQLLATAEQ LRVERNLHQS VAEQQWLAQT NVARLVPTIA LDLTAALQAA QASLAAQ
|
| |