Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5271 |
Symbol | |
ID | 5737229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 53865 |
End bp | 56726 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641282435 |
Product | TPR repeat-containing protein |
Protein accession | YP_001548026 |
Protein GI | 159901781 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.650809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTGC CAGCCCTCAT TGCCGTTCTG GTCGATGCGA TTCTCATCAA ATGCCCACAG TATGATCGGG CGAGCGTGAC TATTGCGCTC GAATCAGTCT TTGCGGGTCA TCCCACGCTG TTGGGTGGCA ACTCCATTTC GATGCTGCTT GGCCAAAACA ATGATTATAC CAATGCCACG ATCACGATTG GCGATGTCAA TGCCGGAAAT CAGGTGCATG TCACGGTGAC GCTCCCCCAG CCGATTGATC CCTTACCCGC TGCCCTTGCA GCCCTTGCCT CGATTCCCTT GACAGATGTG CCCACGCCGC GTTCCGATCT CCCCCAAGCC TCACGCCTGC CCTTTGAATC GAGTCCCCAT TTTGTTGGGC GCGAGGCCGA ATTGAACGCG CTCGCGCAGG CGATTGGCAC GGCGCAGCCA GCGGTTGTTA TGCCAGCGGT GGCCACGGGT CTTGGCGGGA TTGGCAAAAC GAGCCTGGTG ACGGAGTTTT CCTATCGCTA TGGGGTGTAT TTTCAGGGCG GGGTGTTTTG GCTGAACTGT GCCGATCCTG ATCAGGTGGC CAATCAAATT GCGGCCTGTG CGGTTGGCTT GAAGCTTGAT ACCACCGGAA TGGCACTCGA TGAGCAGGTG CAACGGGTTT TGCATGCCTG GCAATCGCCC ATGCCGCGCT TGCTCATTTT TGATAACTGT GAAGATCCAG CGATTCTTGA CCAATGGAAG CCTACGGTGG GCGGCTGTCG CGTACTGGTG ACGGCGCGAT CCGATGCGTG GCCAACACTC ACGCAGATTC GGCTTGGGCT GTTGTCGCCT GTCGAAAGTC GCGCGTTATT GCAGCGACTC TGTGCGCGGT TGACCGATGC TGAAGCCGAT GCGATTGCCG AGGATCTGGG GCATTTGCCG CTGGCGTTGC ATCTGGCGGG CAGTTATCTC GCAACCTATC CCCATCATAC GATTGGCCAA TACCGCAAGG ATTTAACGAT TGCCCACCGC TCGCTCAAGG GACGGGGAGC ACTGCCCTCA CCCACGCGCC ATGAACTGGA TGTCGAAGCG ACCTTTATGC TCAGTTTTAA CCAGCTTGAT CCCAATAATG CGCTTGATGC GTTGGCCTTA GGCATGCTCG ATGGCGCGGC GTGGTGTGCG CCAGGTGTGC CAATCCCGTG TGATCTGGTG CTGGCGTTGG TGCTTGCCGA GGCTGGTGAC GATGGACATC CGAGTCGAAC GACGGATCGT GATGATGCGG TTGATGCCCT GCGTCGTTTG CAGCAGCTTG GATTGCTCAA TGGCACTGAA CTCGTCGTAC TGCATCGCTT GCTGGCGCAA GTTGTCCAGT CGCGGTTAGG ATCGTTAGAC ATGTTGGCCG AGGTTGAATC GTGTCTTGCA TGGGTAACGG CGACGGTAAC GGCGAGCGGG AACCCGCAAC GGTTGACACC CCTGATTGCC CATATGCGGT ATGTGACCCT ACGGGCATTA GACCGTGGCG ATCTGGTGGC GATTCAGTTA GCCGATCAGC TTGGCGCATT TGAGCAATTG CACGGTGCCT ATGCCGCCGC ACAGATCGTC TATGAACGCG CACTCGTCAT TTGTGGGTCA CACCTTGGTA TGGGGCATGT CATAACCGCA GGAATCCTCC ATAATTTAGG AGTTACTCTC GCCCATCAAG GGCGGTATAA GGAAGCCCAA GCATGGTATG AACACGCATT GGTTATTACG GAGCAGGTAG TGGGTGCGGA TCATCCATAT ACAGGGGGGA TTCTGTCGAA TTTGGGGGTC GTTTTGGATC ACCAGGGCGC GTATGCTGAA GCCCTGCCAT TAATCGAACG AAGCATCGCG ATTCGGGATA GGGTGTTAGG AGCCGATCAT CCCGACACCG CGATGTCCTT GAATAATCGT GGTGTTGTGC TCGAACACCA AGGACGATAT CGCGAGGCGC AGCACTGCTA TGAACAGGCG GTGGCGATCA CGACCGCTCT CGTGGGAGAT AGCCATCCGA CAACGGCAAA GTATCGGAGT AATATCGCGC TGATGTTGGA GCGGCAAGGA CAGTATGCTG CTGCTGCACG CATTCATGAA ACGGTTGTTG CGATCATCGA AACCGTCTTG GGTGCGGAGC ATCCGGATAC CGCGATGAGC CTCCATAATT GGGCCTTCGC GTTAATCAAC CAAGGGCAAG CAGCCCAGGC GCAGACCCTT ATGGAGCGGG CGATTGGGAT TAATGAACGT GTTCATGGAA GGGAGCATCG AGCGACCGCA CTCTGCATAC ACCATCTCGG CCTTGCGTTG ATTCACCAAG AACGCTATGC GGAGGCGCAG CCCATCTTGG AGCAGGCGAT TGGGATCTAT GAGCGGGTCG TAGGACCACG GCATCCCGAG ATCGCAGCAG TTATCAGTAA TTTGGGAGGA GTCCTTGCGC ATCAAGGGCG GTATGGCGAT GCGGAGCAGT GCTATGAACG GGCCTTGGCG ATACGGGAAG CGGTGTTGGG GTCGGAGCAT CCCGATACGG CGACAACAAG AAATAATCTG AACAGTCTGA TTACGGCTAA AGGGTATGGT CTCCGAGCCG TACTCCTTAA TAATTGTGCG GTTTTGTTCG CGTCCCAAGG CTGTTTTAAG GATGCCCAAT ATCTGTTTGA ACAGGCGCTG GCTCTTTATG AGCCGTTACT CCGCCTCCAC CATTCCGATA CGACAACCGT TATTGAGAAT ATGGGATGCC TGCTTATGCT TCAAGATCGA TCAATTGAAG CGGTTGGATT AATCGAACGC GCATGCATGC TGTATGAACA GAGTATTGGG CGAGATCATG AAATGACCAA TCGACTACGG GTATATGGTG CCGAATTGAA GGAATCTATC CAGTTCCAAT GA
|
Protein sequence | MELPALIAVL VDAILIKCPQ YDRASVTIAL ESVFAGHPTL LGGNSISMLL GQNNDYTNAT ITIGDVNAGN QVHVTVTLPQ PIDPLPAALA ALASIPLTDV PTPRSDLPQA SRLPFESSPH FVGREAELNA LAQAIGTAQP AVVMPAVATG LGGIGKTSLV TEFSYRYGVY FQGGVFWLNC ADPDQVANQI AACAVGLKLD TTGMALDEQV QRVLHAWQSP MPRLLIFDNC EDPAILDQWK PTVGGCRVLV TARSDAWPTL TQIRLGLLSP VESRALLQRL CARLTDAEAD AIAEDLGHLP LALHLAGSYL ATYPHHTIGQ YRKDLTIAHR SLKGRGALPS PTRHELDVEA TFMLSFNQLD PNNALDALAL GMLDGAAWCA PGVPIPCDLV LALVLAEAGD DGHPSRTTDR DDAVDALRRL QQLGLLNGTE LVVLHRLLAQ VVQSRLGSLD MLAEVESCLA WVTATVTASG NPQRLTPLIA HMRYVTLRAL DRGDLVAIQL ADQLGAFEQL HGAYAAAQIV YERALVICGS HLGMGHVITA GILHNLGVTL AHQGRYKEAQ AWYEHALVIT EQVVGADHPY TGGILSNLGV VLDHQGAYAE ALPLIERSIA IRDRVLGADH PDTAMSLNNR GVVLEHQGRY REAQHCYEQA VAITTALVGD SHPTTAKYRS NIALMLERQG QYAAAARIHE TVVAIIETVL GAEHPDTAMS LHNWAFALIN QGQAAQAQTL MERAIGINER VHGREHRATA LCIHHLGLAL IHQERYAEAQ PILEQAIGIY ERVVGPRHPE IAAVISNLGG VLAHQGRYGD AEQCYERALA IREAVLGSEH PDTATTRNNL NSLITAKGYG LRAVLLNNCA VLFASQGCFK DAQYLFEQAL ALYEPLLRLH HSDTTTVIEN MGCLLMLQDR SIEAVGLIER ACMLYEQSIG RDHEMTNRLR VYGAELKESI QFQ
|
| |