Gene Haur_5154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5154 
Symbol 
ID5737112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp224079 
End bp228869 
Gene Length4791 bp 
Protein Length1596 aa 
Translation table11 
GC content57% 
IMG OID641282319 
ProductTPR repeat-containing protein 
Protein accessionYP_001547910 
Protein GI159901664 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTGA TTCTTGATCT GACCATGATG GATGATGGGC AGGCGACGGT CACTTTGGAT 
GGACAGGTGT TGGGCACGTT TGATCCCCAT CCGTTGCTCG CCCAGCGCCC GCTGGCCACG
CCCGATGCCG ATGGGCTTGC CTATGGGGAT GCGTTGTATA CGGCGTTGCA TGGTGCTGCG
TGGCCCGCTG ATTGGCTGAT GGCTCCCGCC CCTGATCCAG CGGCCATCCT CGCGATTCGG
AGCAGCGATC CGGTTCTTCA GGCGATTCCA TGGGAGTGGT TGCGCATCGA GGGACGGTGG
GCGATTACCG AAACCCTGTT CATCCGGCTC GTTCCCATGA CCGAGCGCAT GCAGGCCCAG
CTGGCCGCGA ATCGTCCTGA TGGAACACAG CCCTATCGGT TGGTGGTTCA ATGCTGCGAG
CCGCTGCTGT GGTATCGCGA CGAGCAGTGG CAGCCCCTCG ACGCACTTCC TGCATCGCTC
CTCGTCACAC CACTCGCCCA AGAGGTCCAG CGGGCGACTC CCGCCCTACC CGTCATTTGG
CACGACCTGG CCCCGACGAT GAATGCGTTG TTGCGCACGA TTCCCGCGCA CCGCGAGCCG
CTGCTCTATC ACTTCACGGG GCATGGCGAC TGGCATGATG GCCGACCCGT AGTGCTGTTT
GACGATGGGA GCGGGCGGGC CGACCCGAAG GATATGGCGC TGCTCAGCCA GCGGTTGCGC
CCGCGGACGC AGTTGGCGTT TGTGAATGCG TGCCATAGCG CGGAAGCTCA GGGTGACACC
GCCAGTCTCG CGTTCCAACT CTGCCAGCAA GGGACTCCGG TGGTCATTGG GATGGACGGC
CCCGTAGAAG ATCGCCATGG ACAGCAGTGG GCCACGGATT TTTATCCGTC GCTGTTGCGG
GGTTCCGATC CACCGACCGC GTTATGGGAC GCGCGGTTGG CGATGCAGGA TCGGGAACGA
CACCATCCTG CGGCCTGGGT GCAGCCCGTG CTGTATGTGG CCGATGGCTA CCAGTGGCAG
CCGAGCACGG TGACCGTACC CGCGCCATTG CCCGCCGTGC GGATTGTGCC ACCCGTGATC
AGCGATCTTC AGGTGGCGCA GCGCGGATTC ATTGGGCGAC GGACGGAATT GATTGAGGTG
GCGGATCTGG TCGCGCAGCA TCCAGTGGTC ACGGTGCGGG GGGCCGGGGG GATGGGCAAG
ACGGCCTTAG TGGCAGCACT CGCCCAACGG CTGGCCTGGC GGTTTCGTGA TGGCGTGTAT
GCCTATTCGT TTGCGAACCA ACCATCGCCC GATCTCATGA GCGTGTTGCG CTGGTGTGCG
GGCTGGTTGG GACTGGCGCT TGATCCGGCA TGGCAGGACG CAGACGTGCA GCAGCAGGTC
ATCCAGCAGT TGCGAGGCAA GGCGTGTTTG CTGGTGGTCG ATAATTATGA AACGATTTTG
TGGGCGTTAG GTCGCCAAGA TGAAGGACTG GATCTGTGGG CTGATGCTGA TCCGGCGGAT
AGCGCCGCAG AACCACGGAA TGAGCGGCAG CAGGCGGCGC ATGCGATCCA GACGTTCTGG
GAACAGGTGG CGAGCCTCGG TATTAAGGTG GTGTTTACGA CGCGCCATTC GCCCGTTCGC
TTGGAGGGGA TTGCGGAAGC ATGCTATCCG TCCGATGACC GGATTGGCCA ACTCCAAGGC
TTGGCCGAAG CCGATGCGGT AGCATTGTTT GAACGCTGGT GTGGGGGGAC GGCGTTGCCG
CACGACCCAC CGCCACGAGG GATCGTGCTC CAGATCGTGC GCCTGATTGG GGCGATCCCG
TTGGTGATTC AGTTGACGGC GCGGCGCTGG GCGACCCTCC CGAATCCACA GCCCGACCAA
TTTCTGACGG ATTTGCACAC GCATCTGGTC GCAGCGCAGA CGATGGATGG GGCGCGCCAC
CAACAATCGC TGGTGGTGAA TGTACGGTTA TCAGTTGATG CCCTCGCGCC GACGATGCAA
GCAGCATTAT TCCAACTGAG CCTGCTCGAG AATCCGTTGA TTTGGGGACT GAATGCGGCG
GCGATCTGGG GCTTGACGAC AGAGACGGAG GAGGGTATCT GCTATGAAAC CGATCCGGCG
ATTGCCCGCC TGCATCAATT GGAAGCGACC TCATTGATTC AGGTCGTTGC TGCGGAACAG
GATGTATTTG GGTTTCAGCC CGCGCTGCTG CAAACCCTGC GCTATCTGCG AGATCATCCG
CCCTTGCGCG AACGGATGAC TACACCGATG CAGGATGCTC AAACGCGGTA TGGCCACTAT
GCTTGGCTGA CAACCAAGGA TTATGCTAGG CAACAGGAAG CCGGTATGTT AACAAAGGAA
GCACAAGCGA AATTCCCAGA TTTGCTGGCT GGGCGACAAT ATCTTGAGCC AGCCCAAACC
GGATGGGTGG CCTATTGGGT CGCCGATATA CAGCGTCAAT TTGGATTACT CGGCGATGCC
CAGCGGCTAC ATGAAGAAGC ATTACAGATA GCTAAGTCAT ATAAACTGCT AACGTTACAA
AGTAATGTTA CTTATGCCTT AGCCTCAATT CACCAAATCC ACGGAGCATA TGAGGAGGCG
GAGGAATTAT ACCGAGAGTC CTTAGCGCTC GATGATGACC TAGACGATCT CCAAGGTCGC
GCTGCTAACC TTCACGAACT CGCCACTCTC GCACAATTGC GCGGGGAGTA TAGAGATGCA
GAGCAGTTGT ATCGCGAATC CTTGGCTACA CACGACATGA TGAGTGAACG TGAGAGTCCA
TTTTTTACCA TGTATGCTCT CGCACAGATA GATGCGGTTC GCAGGGCATA TGAGGATGCG
GAGAGACTGT ATCATGAATC CTTGGCGATA CACGACGAAC CCGTCACTCT CGCACAATTG
CGTGGGGAGT ATGAGAATGC GGAACGGCTG TATCGTGAAT CCTTGGCGAC ATACAACGCT
GTGAGTAAGC GTCAGCATGC TGCTACCCTC AATGCCCTCG CCCAGATTGC GGTAGTACGC
GGATCGTATG AGGATGCGGA GAGACTGTAT TGCGAATCCT TGGCGATACA CGACGCACTA
GGAAATCGCA AGAGTCGCGC TGCTACTCTG TATGGTCTCG CTCATATTTT TATAGTGCGT
GGGGCGTATG AGGATGCGGA GGGATTGTAT CGCGAATCCT TAGCAATCGC TGATGACTTG
GGCGATCTTC AGGGTCGCGC TGCCATTCTC CACGAACTCG CCACTCTCGC GCGGGTACGC
GGGGAGTATA GAGATGCGGA GAGGTTGTGT TATGAATCAT TGGCAATAGA CGATGCTCTA
GGAGATCGTA AAAGTCGTGC CTCAACCCTC CATGAACTTG CCAATCTCGC ACAACTGCAA
GATATGTATG GGAAGGCGGA GGAGTTTTAC TATGAATCAT TGGCCATTAA AGATGCTCTA
GGCGACCGCA AAGGCCGCGC CGATACCCTC CATGAACTTG CTACGCTCGC ACGGGTGCGT
GGGATGTATG AGAAGGCGAA GGAATTGTAT TGCGAATCAT TGACGATCTA CGATGATCTT
GATAACCGTC AGGGCCGCGC GGATACCCTC AATGCCCTTG GCCGAGTTGC GGTGGTGCTC
GGGGCGTATG AGCACGCAGA GGAGTTGTAT CGCGAATCAT TGATGATCTA CAACGACTTA
GGTAACCGTA AAGGTCAGGC CGATACTATC CACGGATTTG CCAATCTTGC GCAATTGCGA
GGCGTTTATG ACGATGCGGA AGGTTTATAC CGCGAATCAT TGGTAATCTA CAACGACTTG
GGCGACCGCA AAGGCCGCGC GGATACCCTC AATGCCCTTG CCCAAGTTGC GGTGGTGCGC
GGGGCGTATG AGCACGCAGA GGAGTTGTAT CGCGAATCCT TGGCTGTAAC CGAAGCGCTG
GACGACCACA AGGGTCGCGT CTCCACGCTG AATGCTCTTG CCCAGATTTC GGTGGTGCGT
GGGTCATATA AGGATGCGGA GAGGTTGTAT CGTGAATCAT TGGCGATAAC CGACATGTTG
GACGATAGCT ATGCAAAAGC CAGAATAACA GTGATGCTGG GGCAACTTCT GCTCAAACAA
GGCAGCAATA TAGGTACGAC TATGATTGAG CAGGCCTATG AGATCTTTCA TCAGCTAGGA
GCAGCCAATG ATGCTGAACA AACGAAGACG ATTCTTGAGC TCGTGCAGCA TCCCACACTC
ATTGAGTGCA TCAACCAATG GATGACGAGT GCTCGTGAGG CGACAGGTCT TACAACTTTG
CTGAATCGGG TCTGTCAGAC GGTGGTGGCT GTGATGAAAA CTACTGATCC AGAGGCTCGA
CAGCAGGTTG TAGAACATCT TGAACCTTTA GTCGCTACCG ACTCATTGCC GATAGATGGT
GCAATGAGCT TTTTGCAGAC ACTTCAGGCG TGGCTACGTG GGGATGAAAC CCAATGGCAA
ACACTACTAC CGCAGTTGAA TGATCGTTTC CAATCCGTCA TCACGCAGAT GCAGCTTGCT
GTTCATCCCA TCTATCGCCA GGTTATGCCG TTATTGTGGG CTACTGCCGA TGCGCTCCAC
CGCAATGATC CCGCCGTTAC CGATCAACTT GTCGCACGCC TGAGCACCAT GAGTGACCAA
GCCGCCGAGG GAGAACCAGA GGATTCGCCT TGGATGGACG CAGCTCGCGC GTTACGAGCA
GCACGAGCCA TCCTTCAAGG GGATGCGATT GAGACGACGG GATTGGGAGA GATCTATCAG
GCAATGCTTG GTCAGCTTCA TGCGATAGCG GCGAATCGCC CATTGGTGTA A
 
Protein sequence
MTLILDLTMM DDGQATVTLD GQVLGTFDPH PLLAQRPLAT PDADGLAYGD ALYTALHGAA 
WPADWLMAPA PDPAAILAIR SSDPVLQAIP WEWLRIEGRW AITETLFIRL VPMTERMQAQ
LAANRPDGTQ PYRLVVQCCE PLLWYRDEQW QPLDALPASL LVTPLAQEVQ RATPALPVIW
HDLAPTMNAL LRTIPAHREP LLYHFTGHGD WHDGRPVVLF DDGSGRADPK DMALLSQRLR
PRTQLAFVNA CHSAEAQGDT ASLAFQLCQQ GTPVVIGMDG PVEDRHGQQW ATDFYPSLLR
GSDPPTALWD ARLAMQDRER HHPAAWVQPV LYVADGYQWQ PSTVTVPAPL PAVRIVPPVI
SDLQVAQRGF IGRRTELIEV ADLVAQHPVV TVRGAGGMGK TALVAALAQR LAWRFRDGVY
AYSFANQPSP DLMSVLRWCA GWLGLALDPA WQDADVQQQV IQQLRGKACL LVVDNYETIL
WALGRQDEGL DLWADADPAD SAAEPRNERQ QAAHAIQTFW EQVASLGIKV VFTTRHSPVR
LEGIAEACYP SDDRIGQLQG LAEADAVALF ERWCGGTALP HDPPPRGIVL QIVRLIGAIP
LVIQLTARRW ATLPNPQPDQ FLTDLHTHLV AAQTMDGARH QQSLVVNVRL SVDALAPTMQ
AALFQLSLLE NPLIWGLNAA AIWGLTTETE EGICYETDPA IARLHQLEAT SLIQVVAAEQ
DVFGFQPALL QTLRYLRDHP PLRERMTTPM QDAQTRYGHY AWLTTKDYAR QQEAGMLTKE
AQAKFPDLLA GRQYLEPAQT GWVAYWVADI QRQFGLLGDA QRLHEEALQI AKSYKLLTLQ
SNVTYALASI HQIHGAYEEA EELYRESLAL DDDLDDLQGR AANLHELATL AQLRGEYRDA
EQLYRESLAT HDMMSERESP FFTMYALAQI DAVRRAYEDA ERLYHESLAI HDEPVTLAQL
RGEYENAERL YRESLATYNA VSKRQHAATL NALAQIAVVR GSYEDAERLY CESLAIHDAL
GNRKSRAATL YGLAHIFIVR GAYEDAEGLY RESLAIADDL GDLQGRAAIL HELATLARVR
GEYRDAERLC YESLAIDDAL GDRKSRASTL HELANLAQLQ DMYGKAEEFY YESLAIKDAL
GDRKGRADTL HELATLARVR GMYEKAKELY CESLTIYDDL DNRQGRADTL NALGRVAVVL
GAYEHAEELY RESLMIYNDL GNRKGQADTI HGFANLAQLR GVYDDAEGLY RESLVIYNDL
GDRKGRADTL NALAQVAVVR GAYEHAEELY RESLAVTEAL DDHKGRVSTL NALAQISVVR
GSYKDAERLY RESLAITDML DDSYAKARIT VMLGQLLLKQ GSNIGTTMIE QAYEIFHQLG
AANDAEQTKT ILELVQHPTL IECINQWMTS AREATGLTTL LNRVCQTVVA VMKTTDPEAR
QQVVEHLEPL VATDSLPIDG AMSFLQTLQA WLRGDETQWQ TLLPQLNDRF QSVITQMQLA
VHPIYRQVMP LLWATADALH RNDPAVTDQL VARLSTMSDQ AAEGEPEDSP WMDAARALRA
ARAILQGDAI ETTGLGEIYQ AMLGQLHAIA ANRPLV