Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2533 |
Symbol | |
ID | 5734411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3237571 |
End bp | 3239967 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279673 |
Product | TPR repeat-containing protein |
Protein accession | YP_001545299 |
Protein GI | 159899052 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.617329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGC ATCTCTGGCA GGTTGAATGG CTCCAACAAT TACTGCAACA ACCGCATATC GCCCTCAATC AGCCAGCCAT CAAGCAGCTT GTCCATGCCA AAGGTGGTCT GCATCCATTA TTGCAATGGC TTTGCCAACA ACCATTACAG CCAGATCAAC AACGCTTGCT ACAATTGATT GTGGCCGAGC CTCAACGCAG CATCGCCTAT TATGCCGATA GTTTAGGCAT TCATGCGACT ACCTATCATC GCCATTTCAA AAAGTTATGT CAATATCTAC TGACGCGCCT AAATGATGGT ATTGCCGAAC CGCAAGCCAG TGCCAGTTTT AACCTGCCAA TTCCACCAAC CCGCTGCTTG GGCCGTGGCA ATGAACAACA AACCATCAAT CGGCTGTTTG CCAAAGGCCA ACGCTTAATC AGCATTTTGG GGTTTGGTGG GGTGGGCAAA ACCCGCTTGG CATTGGCAAT TGCCGAAATC CAGCAGGCGC ATTATCGCGA TGGCGTTTGT TTTTGTGGGC TAGCCAGCAT CAGCCAGCCG CAATTGGTTT TAGCAACGAT CGCCGAAGCA CTTGGCGTAG CGATTGGCCC ACAACAAACC CCTGAAAAAG CGCTACAACA ATTCCTTGCT AAGCGCCAAA TCCTGCTAAT TTTGGATAAT GTTGAGCATG TCGTTGAAGG GGTTGCCGCG ATTGGTCAGC TGTTGCGCGA AGCACCACAG TTGCAAATTT TGGCAACAAG CCGTGTACCG CTCAATTTAT ATGGCGAATA TATGCTGCAA CTTCAGCCGT TGGTCGTGCC AACCAAGCCA ATTGCCAGCC AAGATTTAGC TGAAACGCCA GCAATTGCCC TGTTTATTGA GCGTGCTCAA AGCCATGCTG CTAGATTTAG CCTCGATGAC GCTAGCCTTG AAGCCATCCG CCAAATTTGT AGTCAACTTG AGGGCTTACC TTTAGCACTC GAATTAGCGG CTGCGCATAC CCGGGTACTT TCGCCGCAAC GCTTAGTTCA ACAATTAAGC AACCATGTGC TTGGCCTCAA AACCAGCATC CGCGATTTGC CCGAACGTCA ACGCAGCCTG CGCAACCTGA TTAGCTGGAG CGTCGATCTG CTTGCGCCAA GCCAGCAACA AGCCTTGCAA GCCTTGGCAA TTTGGCCCGC TGGCTGGACG CTCAGCAGTG CGAGCTTTGC GCTTGATCTA GCCGAGGATG ATCCGGCACT CTACGACATT TTGGCAAACT TGGTCGATCA TCATTTGATC ATGCAAGTGC CTCATGCTGA GCGCTGGGTT TTGCACCCGT TTGTCCGTGA ATATGTGCTC GAACAACTAG CGCCAGCCCA ACAGCAACAA TTTGCCCAGC AACATTTGGC GTGGGCTACC CAACTGAGCG AGGCCTTTAA CCAGCATTTT GCCGCTGCCC AACAACAATG GCTCGATTTA TTCGAGGCCG AGCACGACAA CCTCCGAGCA GCCTTGGCTT GGGCCAGCAG CAACCACTAC CCGATTGCTG CATTGAATAT TGCCGTTAAT AGTTGGCGTT TTTGGTGGGC ACGCAGCTAT TTCAATGAAG GCCAACAATG GCTGGATCAA ACATTAGCCC AAGCCACGAG CCAAACCCAA AAGCTTGAGC CAACCCTGCA TTCACGCGCC TTGAATGCCT GTGGAGCTAT TGCTTGGAGC CGTGGCGATA TTGCAGCAGC CCAAGCGCAT TTTAGCCAAA GCCTCGCCCT CTACGATGCC AACAACAACC CAATTGGCGC GGCGATGGTG CGCAATAATC TAGCACTGGT GGCAATTAAA CAGCAGGCCT ATGCCCAAGC TGAAAGCTTA TTTGAGCTGA ATGCAGCAGT TTTTGGTAGC ACGGTCGAAC AAGCTGCAAA TTATGCCGCA ACCTTGAGCA ATTTGGCCAT GTTGGCGCGT TATCGTGGCG ATTTGGAACG CGCCTATGAT TTGGCCCAGC AAACCCTCAC CCTACGTCAA ACCCTCAATA ACCAGTGGGC AACCGCAACG TCATTGACCA ATTTGGGTGC GATTAGCCTA CAACGCGGCG AATTTCAGCC AGCCCAACGT TATTATCAAC AAAGCTTACA GGTGTTGCAC CAACTTGGCG AGCGCGAAAG TATCGCCGAA TGTTTGGAGG GCTTGGCAAT TTTGGCAATT CAAGCAGGCC AATACCAGCT TGGGGCACAG CGCTTGATTG TGGTCGAACA CTTACGCGAA TCAATCGGCG CACCCCGCTC TGAGCCAGAG CAAGCCCTCC TAGCACCATG GATCAGCCAA CTAGAGCAGC AGCTTGACCT AGGCATTCGT CAACAACTCC GCCAACAAAC CAGCCTTGAT CGGCTCGAAT CCATGATCGA GCAAGCGTTG AACGAGGGGG TAGAGGCTAG GGGTTAG
|
Protein sequence | MNQHLWQVEW LQQLLQQPHI ALNQPAIKQL VHAKGGLHPL LQWLCQQPLQ PDQQRLLQLI VAEPQRSIAY YADSLGIHAT TYHRHFKKLC QYLLTRLNDG IAEPQASASF NLPIPPTRCL GRGNEQQTIN RLFAKGQRLI SILGFGGVGK TRLALAIAEI QQAHYRDGVC FCGLASISQP QLVLATIAEA LGVAIGPQQT PEKALQQFLA KRQILLILDN VEHVVEGVAA IGQLLREAPQ LQILATSRVP LNLYGEYMLQ LQPLVVPTKP IASQDLAETP AIALFIERAQ SHAARFSLDD ASLEAIRQIC SQLEGLPLAL ELAAAHTRVL SPQRLVQQLS NHVLGLKTSI RDLPERQRSL RNLISWSVDL LAPSQQQALQ ALAIWPAGWT LSSASFALDL AEDDPALYDI LANLVDHHLI MQVPHAERWV LHPFVREYVL EQLAPAQQQQ FAQQHLAWAT QLSEAFNQHF AAAQQQWLDL FEAEHDNLRA ALAWASSNHY PIAALNIAVN SWRFWWARSY FNEGQQWLDQ TLAQATSQTQ KLEPTLHSRA LNACGAIAWS RGDIAAAQAH FSQSLALYDA NNNPIGAAMV RNNLALVAIK QQAYAQAESL FELNAAVFGS TVEQAANYAA TLSNLAMLAR YRGDLERAYD LAQQTLTLRQ TLNNQWATAT SLTNLGAISL QRGEFQPAQR YYQQSLQVLH QLGERESIAE CLEGLAILAI QAGQYQLGAQ RLIVVEHLRE SIGAPRSEPE QALLAPWISQ LEQQLDLGIR QQLRQQTSLD RLESMIEQAL NEGVEARG
|
| |