Gene Haur_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3158 
Symbol 
ID5735030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3986763 
End bp3989207 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content54% 
IMG OID641280301 
ProductTPR repeat-containing protein 
Protein accessionYP_001545923 
Protein GI159899676 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.385095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATGC CAGCCCTAAT TGGATTGCTC GTTGATGCTG TGCTTGCTGT GTCGCCTGCC 
GCTGATCGCC TGACCATTGA AGCGGCGTTT GGTGCGATGT TGGCAGGTAA ACCAACGCTA
CTTGATGATG CTCCTTTATC GCTGCTAATT GATCAATCGA GCGCTCAATC CCATGCGACA
ATCGTGTATG CAGGCCAGAC GATTGCTGTT GAGCTGGCTG AGCCGCTTGA CCCGCTGGTT
GTAGCGCTAG CGGCGGTCGC CTCGTTGCCA CTGACAACTG TGCCTGCCCC GCGTGCCGAT
CAACCAGCGG CCTCGCGACT ACCATTTGAA TCAAGTAGCA CGTTTGTTGG GCGTGAGCAA
GAATTGTTGG CGCTGGCCGC CGCGCTTAGC CATGCTCAGC CTACGATTGT GCTGCCCGCC
GTGGCAACTG GCCTTGGCGG AATTGGCAAA ACCAGCTTGG TCACTGAGTT TGCCTATCGC
TATGGCAGCT ATTTCCACGG CGGCGTATTT TGGATCAACT GCGCTGATCC TGAGCAGGTT
GAAAATCAAA TTGCTGCATG TGCCGAGGCG CTGGCGATCG ATCCAACAGG TTTGACGCTT
GATGCCCAGG TGCAACACGT TTTAGCAGCA TGGCAAGCGC CGCTGCCGCG CCTACTCATT
TTCGATAATT GTGAAGATCC AGTGATCCTT GAGCGTTGGA TGCCGACGTT GGGCGGTTGT
CGGGTGTTGG TGACCGCCCG CAATCAATTA GCAACGATGA GCGCGATTCG GCTTGGAGTT
TTGGCTCCTG CCGAAAGCCG TGCCTTACTA CAACAGCTTT GCCCACGCCT GACAACTGCT
GAAGCCGAGG CAATTGCCGC CGATCTTGGG CATTTGCCGT TGGCCTTGCA GCTTGCAGGC
AGCTACCTCA ATACTTATGA TCAACAGAGT GTTGCTCAAT ATCGCCAGGA TTTGGCGGTT
ACTCATCATT CGCTCAAAGG TGGTGCGGGA TTGCCCTCGC CAACCCGCCA TGAACAAGAT
GTTGAAGCGA CGTTTATGCT CAGTTTGCAC CAGTTTGATT CGGCTAATGC ACTGGAGATG
TTGGCCTTAG ATATGTTGGA TGGTGCGGCT TGGTGTGCGC CAGGTGTGCC AATCCCCCGT
CAGCTTGTGC TCGATTTTGT TCCCGATGAA ACGAATGCTG AAACTGCACT CGCAGCGCTG
CAATTGTTGG AGCAACGTGG ATTAATTGAT GGGAGCGAGG CGCTGGTTGT GCATCGTTTG
TTGGCGCAAG TTGTTCAGGT TCATCGCGGC TCAGCACAAA TCCGCGAACT GGCCGAATAT
CGAATTAACG AGCATGCTGT GCGAATTAGT GCCACGCGTG TGCCAAAGCA GATGCTGCCA
CTTGAGCCAC ATTTGCGCCA TGTGACCGTG CGAGCATTGG CGCGTGAGGA TGAACGGGTC
GCGCGTTTGT GTAATAGTCT GGGCTATTGG GAGCACTTGC GTGGCGTTTA TGGTGAGGCC
GAGCGCTGGT ATGAACGGGG CTTGGCGATT ATGCAAAAGG TCTTAGGGCC AGAGCATCAA
AATACTGCCC GCATGATGAA TAATTTGGCA GGTATTCGTT TGGAGCAAAT GCGCTATGCC
GAGGCGCAGG CCTTGTATGA GCAGGTTTTG GGGATTTGGA ATGTCACTTT TGGCCCAGAG
CATCCTGATA CCGCGCGATG TATGAACAAT CTGGCCTCGG CTTTAGGGCG ACAAGGACAG
AATGCCGAGG CCTTGGCGAT GCTTGAACAA GCATTGGTAG TTTGGGAAGC AGCCTTAGGC
CCAGAGCACC CCGATACCGC GATTAGTATC AACAATCTGG CGGTAGCCTT GGAGCGCGAA
GGGCGCTATG CCGAATCGCA GGTGTTACAG GAACGAGCAC TGAAGGTATG GAAAAAAACT
TTAGGGCCAG AGCATCCCGA TACTGCGGCA AGTTTGAACA GTTTGGCCCG CTTGTTGGAA
CATCAAGGCA AATATTCACA AGCTCTGCCA TTCTATCAAC AGGCGTTGGC AATTCGCGAA
ACCGCCTTGG GGCCAGAACA TCCCGACGTT GCTTCTAGCC TGAACGATCT GGCGGGATTG
CTGATCGAAC AAAAACGCTA TACTGACGCG CAGGCCTTGT ATGAACGGGC GCTGACGATT
CGTGAATTGG TGTTTGGCCC AGAGCATGCC GATACGATCA CTGCTATGGC AAATTTGGCG
GTGGCATTGG AGCGGCGCGG CCAATACCGC GAAGCGTTAG AGTTGCATGC CCAGGCATTG
ACCATTAGCC GAAAAGTTTT TGGCGATAAT CATCAGACGA GCCAACGGAT TCGTGCTAGC
CATGCCCGAA CCGTCCAAGC GATTCAAGAA GCCTTCGACC AATCGGCAGC CAAACGATCC
AGTGGCAAAC ATACCAACGA TGATCACGTT AAACAATGGA AGTAA
 
Protein sequence
MEMPALIGLL VDAVLAVSPA ADRLTIEAAF GAMLAGKPTL LDDAPLSLLI DQSSAQSHAT 
IVYAGQTIAV ELAEPLDPLV VALAAVASLP LTTVPAPRAD QPAASRLPFE SSSTFVGREQ
ELLALAAALS HAQPTIVLPA VATGLGGIGK TSLVTEFAYR YGSYFHGGVF WINCADPEQV
ENQIAACAEA LAIDPTGLTL DAQVQHVLAA WQAPLPRLLI FDNCEDPVIL ERWMPTLGGC
RVLVTARNQL ATMSAIRLGV LAPAESRALL QQLCPRLTTA EAEAIAADLG HLPLALQLAG
SYLNTYDQQS VAQYRQDLAV THHSLKGGAG LPSPTRHEQD VEATFMLSLH QFDSANALEM
LALDMLDGAA WCAPGVPIPR QLVLDFVPDE TNAETALAAL QLLEQRGLID GSEALVVHRL
LAQVVQVHRG SAQIRELAEY RINEHAVRIS ATRVPKQMLP LEPHLRHVTV RALAREDERV
ARLCNSLGYW EHLRGVYGEA ERWYERGLAI MQKVLGPEHQ NTARMMNNLA GIRLEQMRYA
EAQALYEQVL GIWNVTFGPE HPDTARCMNN LASALGRQGQ NAEALAMLEQ ALVVWEAALG
PEHPDTAISI NNLAVALERE GRYAESQVLQ ERALKVWKKT LGPEHPDTAA SLNSLARLLE
HQGKYSQALP FYQQALAIRE TALGPEHPDV ASSLNDLAGL LIEQKRYTDA QALYERALTI
RELVFGPEHA DTITAMANLA VALERRGQYR EALELHAQAL TISRKVFGDN HQTSQRIRAS
HARTVQAIQE AFDQSAAKRS SGKHTNDDHV KQWK