Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5154 |
Symbol | |
ID | 5737112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 224079 |
End bp | 228869 |
Gene Length | 4791 bp |
Protein Length | 1596 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282319 |
Product | TPR repeat-containing protein |
Protein accession | YP_001547910 |
Protein GI | 159901664 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTGA TTCTTGATCT GACCATGATG GATGATGGGC AGGCGACGGT CACTTTGGAT GGACAGGTGT TGGGCACGTT TGATCCCCAT CCGTTGCTCG CCCAGCGCCC GCTGGCCACG CCCGATGCCG ATGGGCTTGC CTATGGGGAT GCGTTGTATA CGGCGTTGCA TGGTGCTGCG TGGCCCGCTG ATTGGCTGAT GGCTCCCGCC CCTGATCCAG CGGCCATCCT CGCGATTCGG AGCAGCGATC CGGTTCTTCA GGCGATTCCA TGGGAGTGGT TGCGCATCGA GGGACGGTGG GCGATTACCG AAACCCTGTT CATCCGGCTC GTTCCCATGA CCGAGCGCAT GCAGGCCCAG CTGGCCGCGA ATCGTCCTGA TGGAACACAG CCCTATCGGT TGGTGGTTCA ATGCTGCGAG CCGCTGCTGT GGTATCGCGA CGAGCAGTGG CAGCCCCTCG ACGCACTTCC TGCATCGCTC CTCGTCACAC CACTCGCCCA AGAGGTCCAG CGGGCGACTC CCGCCCTACC CGTCATTTGG CACGACCTGG CCCCGACGAT GAATGCGTTG TTGCGCACGA TTCCCGCGCA CCGCGAGCCG CTGCTCTATC ACTTCACGGG GCATGGCGAC TGGCATGATG GCCGACCCGT AGTGCTGTTT GACGATGGGA GCGGGCGGGC CGACCCGAAG GATATGGCGC TGCTCAGCCA GCGGTTGCGC CCGCGGACGC AGTTGGCGTT TGTGAATGCG TGCCATAGCG CGGAAGCTCA GGGTGACACC GCCAGTCTCG CGTTCCAACT CTGCCAGCAA GGGACTCCGG TGGTCATTGG GATGGACGGC CCCGTAGAAG ATCGCCATGG ACAGCAGTGG GCCACGGATT TTTATCCGTC GCTGTTGCGG GGTTCCGATC CACCGACCGC GTTATGGGAC GCGCGGTTGG CGATGCAGGA TCGGGAACGA CACCATCCTG CGGCCTGGGT GCAGCCCGTG CTGTATGTGG CCGATGGCTA CCAGTGGCAG CCGAGCACGG TGACCGTACC CGCGCCATTG CCCGCCGTGC GGATTGTGCC ACCCGTGATC AGCGATCTTC AGGTGGCGCA GCGCGGATTC ATTGGGCGAC GGACGGAATT GATTGAGGTG GCGGATCTGG TCGCGCAGCA TCCAGTGGTC ACGGTGCGGG GGGCCGGGGG GATGGGCAAG ACGGCCTTAG TGGCAGCACT CGCCCAACGG CTGGCCTGGC GGTTTCGTGA TGGCGTGTAT GCCTATTCGT TTGCGAACCA ACCATCGCCC GATCTCATGA GCGTGTTGCG CTGGTGTGCG GGCTGGTTGG GACTGGCGCT TGATCCGGCA TGGCAGGACG CAGACGTGCA GCAGCAGGTC ATCCAGCAGT TGCGAGGCAA GGCGTGTTTG CTGGTGGTCG ATAATTATGA AACGATTTTG TGGGCGTTAG GTCGCCAAGA TGAAGGACTG GATCTGTGGG CTGATGCTGA TCCGGCGGAT AGCGCCGCAG AACCACGGAA TGAGCGGCAG CAGGCGGCGC ATGCGATCCA GACGTTCTGG GAACAGGTGG CGAGCCTCGG TATTAAGGTG GTGTTTACGA CGCGCCATTC GCCCGTTCGC TTGGAGGGGA TTGCGGAAGC ATGCTATCCG TCCGATGACC GGATTGGCCA ACTCCAAGGC TTGGCCGAAG CCGATGCGGT AGCATTGTTT GAACGCTGGT GTGGGGGGAC GGCGTTGCCG CACGACCCAC CGCCACGAGG GATCGTGCTC CAGATCGTGC GCCTGATTGG GGCGATCCCG TTGGTGATTC AGTTGACGGC GCGGCGCTGG GCGACCCTCC CGAATCCACA GCCCGACCAA TTTCTGACGG ATTTGCACAC GCATCTGGTC GCAGCGCAGA CGATGGATGG GGCGCGCCAC CAACAATCGC TGGTGGTGAA TGTACGGTTA TCAGTTGATG CCCTCGCGCC GACGATGCAA GCAGCATTAT TCCAACTGAG CCTGCTCGAG AATCCGTTGA TTTGGGGACT GAATGCGGCG GCGATCTGGG GCTTGACGAC AGAGACGGAG GAGGGTATCT GCTATGAAAC CGATCCGGCG ATTGCCCGCC TGCATCAATT GGAAGCGACC TCATTGATTC AGGTCGTTGC TGCGGAACAG GATGTATTTG GGTTTCAGCC CGCGCTGCTG CAAACCCTGC GCTATCTGCG AGATCATCCG CCCTTGCGCG AACGGATGAC TACACCGATG CAGGATGCTC AAACGCGGTA TGGCCACTAT GCTTGGCTGA CAACCAAGGA TTATGCTAGG CAACAGGAAG CCGGTATGTT AACAAAGGAA GCACAAGCGA AATTCCCAGA TTTGCTGGCT GGGCGACAAT ATCTTGAGCC AGCCCAAACC GGATGGGTGG CCTATTGGGT CGCCGATATA CAGCGTCAAT TTGGATTACT CGGCGATGCC CAGCGGCTAC ATGAAGAAGC ATTACAGATA GCTAAGTCAT ATAAACTGCT AACGTTACAA AGTAATGTTA CTTATGCCTT AGCCTCAATT CACCAAATCC ACGGAGCATA TGAGGAGGCG GAGGAATTAT ACCGAGAGTC CTTAGCGCTC GATGATGACC TAGACGATCT CCAAGGTCGC GCTGCTAACC TTCACGAACT CGCCACTCTC GCACAATTGC GCGGGGAGTA TAGAGATGCA GAGCAGTTGT ATCGCGAATC CTTGGCTACA CACGACATGA TGAGTGAACG TGAGAGTCCA TTTTTTACCA TGTATGCTCT CGCACAGATA GATGCGGTTC GCAGGGCATA TGAGGATGCG GAGAGACTGT ATCATGAATC CTTGGCGATA CACGACGAAC CCGTCACTCT CGCACAATTG CGTGGGGAGT ATGAGAATGC GGAACGGCTG TATCGTGAAT CCTTGGCGAC ATACAACGCT GTGAGTAAGC GTCAGCATGC TGCTACCCTC AATGCCCTCG CCCAGATTGC GGTAGTACGC GGATCGTATG AGGATGCGGA GAGACTGTAT TGCGAATCCT TGGCGATACA CGACGCACTA GGAAATCGCA AGAGTCGCGC TGCTACTCTG TATGGTCTCG CTCATATTTT TATAGTGCGT GGGGCGTATG AGGATGCGGA GGGATTGTAT CGCGAATCCT TAGCAATCGC TGATGACTTG GGCGATCTTC AGGGTCGCGC TGCCATTCTC CACGAACTCG CCACTCTCGC GCGGGTACGC GGGGAGTATA GAGATGCGGA GAGGTTGTGT TATGAATCAT TGGCAATAGA CGATGCTCTA GGAGATCGTA AAAGTCGTGC CTCAACCCTC CATGAACTTG CCAATCTCGC ACAACTGCAA GATATGTATG GGAAGGCGGA GGAGTTTTAC TATGAATCAT TGGCCATTAA AGATGCTCTA GGCGACCGCA AAGGCCGCGC CGATACCCTC CATGAACTTG CTACGCTCGC ACGGGTGCGT GGGATGTATG AGAAGGCGAA GGAATTGTAT TGCGAATCAT TGACGATCTA CGATGATCTT GATAACCGTC AGGGCCGCGC GGATACCCTC AATGCCCTTG GCCGAGTTGC GGTGGTGCTC GGGGCGTATG AGCACGCAGA GGAGTTGTAT CGCGAATCAT TGATGATCTA CAACGACTTA GGTAACCGTA AAGGTCAGGC CGATACTATC CACGGATTTG CCAATCTTGC GCAATTGCGA GGCGTTTATG ACGATGCGGA AGGTTTATAC CGCGAATCAT TGGTAATCTA CAACGACTTG GGCGACCGCA AAGGCCGCGC GGATACCCTC AATGCCCTTG CCCAAGTTGC GGTGGTGCGC GGGGCGTATG AGCACGCAGA GGAGTTGTAT CGCGAATCCT TGGCTGTAAC CGAAGCGCTG GACGACCACA AGGGTCGCGT CTCCACGCTG AATGCTCTTG CCCAGATTTC GGTGGTGCGT GGGTCATATA AGGATGCGGA GAGGTTGTAT CGTGAATCAT TGGCGATAAC CGACATGTTG GACGATAGCT ATGCAAAAGC CAGAATAACA GTGATGCTGG GGCAACTTCT GCTCAAACAA GGCAGCAATA TAGGTACGAC TATGATTGAG CAGGCCTATG AGATCTTTCA TCAGCTAGGA GCAGCCAATG ATGCTGAACA AACGAAGACG ATTCTTGAGC TCGTGCAGCA TCCCACACTC ATTGAGTGCA TCAACCAATG GATGACGAGT GCTCGTGAGG CGACAGGTCT TACAACTTTG CTGAATCGGG TCTGTCAGAC GGTGGTGGCT GTGATGAAAA CTACTGATCC AGAGGCTCGA CAGCAGGTTG TAGAACATCT TGAACCTTTA GTCGCTACCG ACTCATTGCC GATAGATGGT GCAATGAGCT TTTTGCAGAC ACTTCAGGCG TGGCTACGTG GGGATGAAAC CCAATGGCAA ACACTACTAC CGCAGTTGAA TGATCGTTTC CAATCCGTCA TCACGCAGAT GCAGCTTGCT GTTCATCCCA TCTATCGCCA GGTTATGCCG TTATTGTGGG CTACTGCCGA TGCGCTCCAC CGCAATGATC CCGCCGTTAC CGATCAACTT GTCGCACGCC TGAGCACCAT GAGTGACCAA GCCGCCGAGG GAGAACCAGA GGATTCGCCT TGGATGGACG CAGCTCGCGC GTTACGAGCA GCACGAGCCA TCCTTCAAGG GGATGCGATT GAGACGACGG GATTGGGAGA GATCTATCAG GCAATGCTTG GTCAGCTTCA TGCGATAGCG GCGAATCGCC CATTGGTGTA A
|
Protein sequence | MTLILDLTMM DDGQATVTLD GQVLGTFDPH PLLAQRPLAT PDADGLAYGD ALYTALHGAA WPADWLMAPA PDPAAILAIR SSDPVLQAIP WEWLRIEGRW AITETLFIRL VPMTERMQAQ LAANRPDGTQ PYRLVVQCCE PLLWYRDEQW QPLDALPASL LVTPLAQEVQ RATPALPVIW HDLAPTMNAL LRTIPAHREP LLYHFTGHGD WHDGRPVVLF DDGSGRADPK DMALLSQRLR PRTQLAFVNA CHSAEAQGDT ASLAFQLCQQ GTPVVIGMDG PVEDRHGQQW ATDFYPSLLR GSDPPTALWD ARLAMQDRER HHPAAWVQPV LYVADGYQWQ PSTVTVPAPL PAVRIVPPVI SDLQVAQRGF IGRRTELIEV ADLVAQHPVV TVRGAGGMGK TALVAALAQR LAWRFRDGVY AYSFANQPSP DLMSVLRWCA GWLGLALDPA WQDADVQQQV IQQLRGKACL LVVDNYETIL WALGRQDEGL DLWADADPAD SAAEPRNERQ QAAHAIQTFW EQVASLGIKV VFTTRHSPVR LEGIAEACYP SDDRIGQLQG LAEADAVALF ERWCGGTALP HDPPPRGIVL QIVRLIGAIP LVIQLTARRW ATLPNPQPDQ FLTDLHTHLV AAQTMDGARH QQSLVVNVRL SVDALAPTMQ AALFQLSLLE NPLIWGLNAA AIWGLTTETE EGICYETDPA IARLHQLEAT SLIQVVAAEQ DVFGFQPALL QTLRYLRDHP PLRERMTTPM QDAQTRYGHY AWLTTKDYAR QQEAGMLTKE AQAKFPDLLA GRQYLEPAQT GWVAYWVADI QRQFGLLGDA QRLHEEALQI AKSYKLLTLQ SNVTYALASI HQIHGAYEEA EELYRESLAL DDDLDDLQGR AANLHELATL AQLRGEYRDA EQLYRESLAT HDMMSERESP FFTMYALAQI DAVRRAYEDA ERLYHESLAI HDEPVTLAQL RGEYENAERL YRESLATYNA VSKRQHAATL NALAQIAVVR GSYEDAERLY CESLAIHDAL GNRKSRAATL YGLAHIFIVR GAYEDAEGLY RESLAIADDL GDLQGRAAIL HELATLARVR GEYRDAERLC YESLAIDDAL GDRKSRASTL HELANLAQLQ DMYGKAEEFY YESLAIKDAL GDRKGRADTL HELATLARVR GMYEKAKELY CESLTIYDDL DNRQGRADTL NALGRVAVVL GAYEHAEELY RESLMIYNDL GNRKGQADTI HGFANLAQLR GVYDDAEGLY RESLVIYNDL GDRKGRADTL NALAQVAVVR GAYEHAEELY RESLAVTEAL DDHKGRVSTL NALAQISVVR GSYKDAERLY RESLAITDML DDSYAKARIT VMLGQLLLKQ GSNIGTTMIE QAYEIFHQLG AANDAEQTKT ILELVQHPTL IECINQWMTS AREATGLTTL LNRVCQTVVA VMKTTDPEAR QQVVEHLEPL VATDSLPIDG AMSFLQTLQA WLRGDETQWQ TLLPQLNDRF QSVITQMQLA VHPIYRQVMP LLWATADALH RNDPAVTDQL VARLSTMSDQ AAEGEPEDSP WMDAARALRA ARAILQGDAI ETTGLGEIYQ AMLGQLHAIA ANRPLV
|
| |