Gene Haur_5149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5149 
Symbol 
ID5737107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp213995 
End bp217312 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content53% 
IMG OID641282314 
ProductTPR repeat-containing protein 
Protein accessionYP_001547905 
Protein GI159901659 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000182587 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTTAC TCGATATTCA ACACAAAATC TCAAGGCTCA TGGCCCGGTT TGTCGAAGAA 
GTAAAAAGTT CCACAGCGAT GGGTCATAGT GATATTAATC GTGTTGCTGA AACGGTGCTG
ATTCCGCTCT TAGGCCGTGT CTATGAATGC CCCAGTTTAC AGAATCTCAA TAGTCTCCAT
CCCAATTATC CAGCGGTTGA TTTAGGGGAT GTAGCCCGTC GGATCGCCTT TCAGGTCACG
ACAACGCCGG ATAGTAAAAA AATCAAAGAT ACCCTCACCA CGTTCATCGC TCATAACCTG
CATACCCAGT TTGATACGGT GTATGTATAT ATTTTGACGG AAAAACAACA CTCATATAGT
CCTGCTATTT TTAGTACTAT CACCGGCAAT CACCTCCTCT TTGATCCAAA AAGACATATC
CTTGATGCGA ATGATCTGCT GAAACAGATT GCTACCTACC ACGTTGACAA GGCCCAGCAA
ATTCTCACGA TACTTGAGGC GAATTTTGAT ACGCCCATTG ATCCCCTTGC TCATGCGCTT
GCCGTATATG GAACATTGCC GCTGGATTAT GTTCCTATGG CACGGTTGGA TCTGCCCCAA
GCCTCACGCA TTCCCTTTGA ATCGAGTGCC TATTTTGTTG GGCGCGAAGC CGAATTGAAA
GCTTTAGCCC GCGCGATTAT CCAAACCCAA CCGACCGTCG TTGTGCCTGC GGTTACGACC
GGACTGGGGG GGATTGGCAA AACGAGTCTG GTGACGGAAT TTGCCTATCG CTATGGGGTC
TATTTTCATG GCGGGATATT TTGGTTGAAC TGTGCTGATG CTAATCAGGT GGCGAGCCAG
ATTGCAGCCT GTGCGGTTGG TTTGAAGATT GATACTACTG GGATGGCGCT CGATGAACAG
GTGCAGCAGG TTTTGTATGC CTGGCAATCT CCGATGCCCC GCTTGCTGAT TTTCGATAAC
TGTGAAGATC CAGCGATTCT TACGCAGTGG AAGCCCACTA TTGGTGGTTG TCGGGTGCTG
GTGACGGCGC GGTCAGATCA GTGGCCAACG CTGACGCAGA TTCGTTTAGG GTTGCTCTCA
CCTGTCGAAA GTCGCGCGTT ATTGCAGCGA CTCTGCACGC GGCTGACTGA CACCGCAGCT
GATGCGATTG CCGAGGATCT AGGGCATTTG CCATTGGCCT TGCACCTAGC AGGCAGTTAT
CTTAATACCT ATTCCCATCA CACGGTCGAG CAGTACCGCA CGGAGTTAAC CATTGCCCAC
CGCTCGCTCA AGGGGCGAGG GGCGTTTCCA TCCCCAACCC AGCATGAACT GGATGTGGAA
GCCACTTTCA TGGTGAGCGT GAATCAGCTT GATCCAAATG ATCCAATCGA TGCGCTCGCC
TTGGGCATGC TGGATGGTGC TGCGTGGTGT GCGCCAAGCG TTCCCCTTCC GCGCTATGTG
ATACTATCGT TCGTTCCCGA TGGAACGGAT GGTGATGATG CCGTTGATGC GCTGCGGCGT
TTGCAAGCAT TGGGCTTATT GGATGGTATC GAGACAGTGA TCTTACATCG CTTGCTCGCC
CAAGTCATTC ATGTCCATAT GGGATGGTCT GCGACATTGG CGCTGGTAGA GCAGCGGATG
GTCGCTGCGG CGGAACAAGC GCATAAGACG GGGATTCCGA AGCAGATGAA TCCGCTCGAA
CCCCATCTGC GGGGTATGAC GCTCCGGGTG TTAGATCGTG ATACAGAACA AACGGCACGA
CTTGCAACGA ACCTTGGACT GTTTGCACAA CACCAAGGAT GGTATGCAGA GGCACAGGCG
CTACATGAAC GGGCGTTTGG TATACGAAGA GTGCTTGTTG GTGAAAACCA TTCTTCTACG
GCAATGAGCA TCAATAATCT TGCAGAAGCG TTACATCAGC AAGGGCGGTA TTTGGAGGCG
CAGGACTTAT TTGAACGGGC GTTGGCGGTG CGGGAAGTGG TGTTGGGGTT GGATCATCCC
GATACGGCAC GGAGTGTGAA CAATCTGGCG TTGGTCTTGG AGAGTCAAGG GCGGTATTCG
GAGGCGCAGG ACTTATTTGA ACGGGCGTTG GCGGTGCGGG AAGCGGTGTT GGGGTTGGAT
CATCCCGATA CGGCGGTGAG TGTGAACAAT CTGGCATCGG TTTTGGAGAG TCAAGGGCGG
TATTCGGAGG CCCGAGGCTT GTATGAACGG GCGTTGGAGG TCACGGAAGC AGTTTTAGGT
AGGGAACATC CTGATACTGC GCGAAGTGTG AACAATCTGG CATCGGTTTT GGCGCGGCAA
GGGCGGTATT CGGAGGCACA ACCCTTGTAC GAACAGGCGT TGGCGGTGAA TGAAGCAGTT
TTAGGTAGGG AACATCCTGA TACTGCGCGA AGTGTGAACA ATCTGGCATC GGTTTTGGAG
AGTCAAGGGC GGTATTCGGA GGCACAACCC TTGTACGAAC AGGCGTTGGC AGTGCGCGAA
GCGGTGTTAG GCGAGAATCA TCCGGATACG GCCATGAGTA TGAACAATCT GGCAATGGTA
CTGTTGAATC AAGGACGGTA TTCGGAGGCG CAGGGCTTGT TAGAACGAAC CTTGACGGTG
CATGAAGCGG TGTTGGGGGC GGAGCATCCG GACACGGCCA TGAGTGTAAA CAATCTTGCT
GTGGTCTTGG AGAGTCAAGG GCGGTATTCG GAGGCGCAGG GCTTGTTAGA ACGAGCATTG
GCGGTGCGGG AAGCGGTGTT GGGGGCGGAA CATCCGGATA CGGCCATGAG TGTGAACAAT
CTTGCGGGGG TCTTGGAGAG TCAAGGGCGG TATGGGGATG CGCAGCGGTT GTATGAACGG
GCATTGGTGG TTACGGAAGC GGTGTTGGGG GCGGAGCATC CAAATACGGC GCGAAGTATG
AACAATCTGG CAATGGTACT GTTGAATCAA AGGCGGTATT CGGAGGCGCA GGGCTTGTTA
GAACGGGCAT TGACGGTGCA TGAAGCGGTG TTGGGGGCGG AGCATCCGGA TACAGCCATG
AGTGTACACA ATCTGGCGGT GGTTTTGGAG CGGCAAGAGC GGTATAGCGA TGCACAAATG
TTATATGAAC GGGCGTTAGC CATCAATAAA GCGGTGTTAG GCCGCGAGCA TCCGGATACC
ATGACAACAA TGGGCAGCTT GGCAGGTGTG CTTGAAAGGC AACGGCAGTA TGGGAAAGCC
CAATCCCTCT ATGAACACGC ATTCGCCATC AGGAAACGCG TCTTGGGATT AACGCACCCA
GATACCCAAT CCCTCCAACG GGATGTAGGA CGAGTCCAAC GCTTGCATCT GACTACCAAA
AAGAAAAAGC GGAAATGA
 
Protein sequence
MHLLDIQHKI SRLMARFVEE VKSSTAMGHS DINRVAETVL IPLLGRVYEC PSLQNLNSLH 
PNYPAVDLGD VARRIAFQVT TTPDSKKIKD TLTTFIAHNL HTQFDTVYVY ILTEKQHSYS
PAIFSTITGN HLLFDPKRHI LDANDLLKQI ATYHVDKAQQ ILTILEANFD TPIDPLAHAL
AVYGTLPLDY VPMARLDLPQ ASRIPFESSA YFVGREAELK ALARAIIQTQ PTVVVPAVTT
GLGGIGKTSL VTEFAYRYGV YFHGGIFWLN CADANQVASQ IAACAVGLKI DTTGMALDEQ
VQQVLYAWQS PMPRLLIFDN CEDPAILTQW KPTIGGCRVL VTARSDQWPT LTQIRLGLLS
PVESRALLQR LCTRLTDTAA DAIAEDLGHL PLALHLAGSY LNTYSHHTVE QYRTELTIAH
RSLKGRGAFP SPTQHELDVE ATFMVSVNQL DPNDPIDALA LGMLDGAAWC APSVPLPRYV
ILSFVPDGTD GDDAVDALRR LQALGLLDGI ETVILHRLLA QVIHVHMGWS ATLALVEQRM
VAAAEQAHKT GIPKQMNPLE PHLRGMTLRV LDRDTEQTAR LATNLGLFAQ HQGWYAEAQA
LHERAFGIRR VLVGENHSST AMSINNLAEA LHQQGRYLEA QDLFERALAV REVVLGLDHP
DTARSVNNLA LVLESQGRYS EAQDLFERAL AVREAVLGLD HPDTAVSVNN LASVLESQGR
YSEARGLYER ALEVTEAVLG REHPDTARSV NNLASVLARQ GRYSEAQPLY EQALAVNEAV
LGREHPDTAR SVNNLASVLE SQGRYSEAQP LYEQALAVRE AVLGENHPDT AMSMNNLAMV
LLNQGRYSEA QGLLERTLTV HEAVLGAEHP DTAMSVNNLA VVLESQGRYS EAQGLLERAL
AVREAVLGAE HPDTAMSVNN LAGVLESQGR YGDAQRLYER ALVVTEAVLG AEHPNTARSM
NNLAMVLLNQ RRYSEAQGLL ERALTVHEAV LGAEHPDTAM SVHNLAVVLE RQERYSDAQM
LYERALAINK AVLGREHPDT MTTMGSLAGV LERQRQYGKA QSLYEHAFAI RKRVLGLTHP
DTQSLQRDVG RVQRLHLTTK KKKRK