Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3228 |
Symbol | |
ID | 5735096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4086020 |
End bp | 4088200 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280374 |
Product | TPR repeat-containing protein |
Protein accession | YP_001545993 |
Protein GI | 159899746 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCATC TTCCCCAGTA TTCGACCCGC TTAATTGGCC GAAACCGTGT GGTTGCTGCC TTAGTTGCCC TCTTTCAAGA GCAAGCGCAT CGGCTTGTGA GCTTGATCGG CGCGAGTGGC ACTGGCAAAA CGCGGCTGGG GCTTGAAGCA ACCGAAACTA TCCGCGATAG TTTTACTGAT GGTTGCTATT TTATCAATTT AGCGCCAGTT GATGATGCGG TATTTGTGCT GCCAACCATT GCCCATACCT TGGGCGTGCA TGAAACCGCC AATCAATCAT TGCTCGATAG CGTGGTGAAT TTTCTGCGTG GCAAGCGGGT GTTGTTGATT CTCGATAATT TCGAGCAGGT CAAACGGGCT GCTGATGAGC TTAAATTGTT GATCGAACGC ACTGATCAGG CTCAATTTAT GGTGACAAGC CAAGTTGCGT TGGGCTTGGC TGCTGAATAT GAGTTTAGTG TGCCACCGCT CGAAGTGCCT GAGCAGTCCA ATCTGCCATC CAACCAATTA TTAGAATATT CAGCAATTGC CTTGTTTGTT GATCGCATGC AGGCGATTCA GCCGCGCTTT GTGTTGACTG ATACCCAAGC CAAGGCTGTG GTCGAAATTT GTCGGTTGCT GCATGGCTTG CCTTTGGCGA TTGAATTAAT TGCCGCCCAT AGTTCGGCAC TCTCGCCGAC CGATTTGCTG TTGCTCGTGC GCAATTATTT GGCGCTTGGC CGAGGCGCGG CAATTGCCAA ACCCGCCCGT CACCATGTGC TCTACCCAGT GCTCGATTGG TGCTTTAGCC GTTTGTCAGC GCCGCAACAA ACTTTATTTA GCCGCTTGGG TGCGTTTATT GGCGGTTGCT CTAACGATGC AGTGATGGCG CTCTATCAAA CAGTTGGCGA GTTCGCGACT GCTGTTGATC CAGGCATTAC CAGCTTGAAG CAAAAACATT TGCTCTTGGA AGAAACCATG CCTGGCCAGC AGTCACGCTA CATTATGCTC GATGCCGTGC GCGAATATGC TCACCAGCGT TTGCGCAAAC GCAAAGAAGC CCATCAAATT GATCTGTATC ATGCCCAGTA TTACCGTGAT TTGAGCATAA CCGCCAAAGC TGAGTTGGGC GGCCCCAAGC ATGAATACTG GACAAATCGG CTGCTCAGCG AAATTCATAA TATTCGTTCA GCGATTCAAT GGGCTTTGAG CCATAACGAA GCCAATTTGG CCTTGCAATT AAGCAGCAAT TTGGTGATGT TTTGGGTACG CCAAGGCTAT CTCACCGAAG GCCGCCGCTG GATCACCGCT GCATTAGAAC ATCAAGCTAA GGCAGACCCC ATAATTGTGC GGCCAGCGTT GGGGGCAGCC GCAGGCATGG CTTGGTCGCA AAGCTGTTAT CAAGAAGCCA AACAATACCT TGAAGCAGCC TTAACAATTG AATCAACCAA CCAAGCCGAA ACTGCACGGC TACTCAATAC GATGGGCTTA GTGCTCTATG AACAAGGCTT GCATCAACCA GCTAGCGATT ATTTTGAGCA GGCCTTGGCG ATCTATCGCA CTCTCGACGA CCAGACAATG CTGGGAATTA CACTCAATAA TTTGGGCTTG GTCGAAATTG ATCGTGGCAA TTTGGCGGCA GCACGCTACA TTTATGCTGA AGTTTTGGCC TTATTCCGCC CAGCCAATAA TCCCTTTAAT CTCACCATGC CGCTAAGCAA CTTGGCCTTG ATTGAAATTC TCGAATATCG TTATGCTGAG GCTGAACCCT TGCTGGAAGA GGGTTTAGCG CTGTGTCGTG ATAATCGCGA TTACAATGGC TTGGGCTATT TTCTGACCAA TTTAGCTGCT TGTTTGGGTG GCCAAGGTCG TTTTGAGCGA GCGCGGGATT GCTTGGTAGA AGCGTTGATT CTACGCAAAA ATGCCCAAGC TAAAATTGGC ATGTTGGTTG GAGCCAAAAC TGCCAGCGAA ATCTTGCTCA AGGCCGGCTT GGCTGAGGCA GCGGCGCTGG CATGGGGCTA CGCCAGCGCA TTGTTCAGCG AACTGCAATT AACCCCGCTT TGGTACGATG AGCGTCAACA AACCATGCTC GAAACTGAAT TAAGCACCAT GCTTGGTAAC GAACACTTGG CCAGCCTCAA ACAAACCGGC ACAAGCAAAA CGATCGACGA CATTATTAGC CTGATACAAA CCGCCATATA A
|
Protein sequence | MHHLPQYSTR LIGRNRVVAA LVALFQEQAH RLVSLIGASG TGKTRLGLEA TETIRDSFTD GCYFINLAPV DDAVFVLPTI AHTLGVHETA NQSLLDSVVN FLRGKRVLLI LDNFEQVKRA ADELKLLIER TDQAQFMVTS QVALGLAAEY EFSVPPLEVP EQSNLPSNQL LEYSAIALFV DRMQAIQPRF VLTDTQAKAV VEICRLLHGL PLAIELIAAH SSALSPTDLL LLVRNYLALG RGAAIAKPAR HHVLYPVLDW CFSRLSAPQQ TLFSRLGAFI GGCSNDAVMA LYQTVGEFAT AVDPGITSLK QKHLLLEETM PGQQSRYIML DAVREYAHQR LRKRKEAHQI DLYHAQYYRD LSITAKAELG GPKHEYWTNR LLSEIHNIRS AIQWALSHNE ANLALQLSSN LVMFWVRQGY LTEGRRWITA ALEHQAKADP IIVRPALGAA AGMAWSQSCY QEAKQYLEAA LTIESTNQAE TARLLNTMGL VLYEQGLHQP ASDYFEQALA IYRTLDDQTM LGITLNNLGL VEIDRGNLAA ARYIYAEVLA LFRPANNPFN LTMPLSNLAL IEILEYRYAE AEPLLEEGLA LCRDNRDYNG LGYFLTNLAA CLGGQGRFER ARDCLVEALI LRKNAQAKIG MLVGAKTASE ILLKAGLAEA AALAWGYASA LFSELQLTPL WYDERQQTML ETELSTMLGN EHLASLKQTG TSKTIDDIIS LIQTAI
|
| |