Gene Haur_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0090 
Symbol 
ID5731983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp114888 
End bp118160 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content52% 
IMG OID641277212 
ProductTPR repeat-containing protein 
Protein accessionYP_001542870 
Protein GI159896623 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.763002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGGA ACCAAGCAAT TTTTGACCGT GCGTTAGAAC AATATCAGCT TGCCTCACGT 
GCTGGCGATT GGAACAAAGC TTTAACCGAA GCTGCTCGGG CCATGACCGA ATTTCCGACT
CACGAGCAGG CTCGGATTGC TGTCGCCACC GCGTTGTTTC ATACCAATAA GCTTCCACAA
GCGCTGCAAT TGTGGCAGGA GTTGCTCAAG CGCAATCCTG ATAATCCGAT TTTTACCGAA
TATATTGCCA AGATTTATCG CGCCCAAGGC GATACCGACC AAGCGATCGA GTTGTTTATG
CAATTGGCTG AACGCTTGAT CGCCCAAAAA TTGCCTTTGA AGGCAGTGCG AGCTTATCGC
GATATTTTGG ATATTCACCC CAGCCACGAA GAAGCCCGCG TGCGCTTGGC GATGGTACTG
GCTGAATCGG GCGATAAGCC TGGCGCAATC CGTGAATATT TGCATTTGGG TCAGCAATTT
TATGATGCTG AACAATACGA TGCTGCTGAT GAAGCCGCCG TCGAAGCCCT CAATGTTGAC
CCTGCTAGCC TCTCGGCCCA AGAATTCAAG AAGAAAGTTG ATTTGGCACG GCAGGCCTTA
ATCACTGCTG CCACAGGCGA TCAATTGCCC GATCAAGGTC GTTTGCAAAC CAAAGAACAA
ATCGAACGCG CGGTCAATGA GGCGATTGAG TATCAAAATA ATGGGTTGCT GGAGCAAGCC
GCCCAAATGT ATGAACGGGT GGTTGAAGTG CTGCCCGAAC GCGCCGACAT GCAGTATAGC
TTGGGCTTGA TTTACGATGA GCTTGAGCGG CATAAAGATG CCTTGCGGGT GCTCGAAATT
GCCAGCCATA ACGATGAATA CGCCCTCTCT GCCCACTTTG CAATTGGTAC GATCTACCAA
AAGCTGGGCC AAATGGAACG TGCTGCCCAA GAGTTTGAGC AAGCGATTCG CTATGTCGAT
TTGCAATCGA TCGGCAAAGA AGAAGCCAAC GATTTGATCG ATATGTACGA AGCGACCTCG
GCGATTTATC TTGAACTAGG CGATGTGGCG CGGGCAGCCT CGTTGTATTC AACCTTGGCA
GGCTTTTTGC AGGGCAAACG TTGGGGCGCG GAACAAGCGG CGCAATACAA TGCCAAGGCT
AAAGAGCTGA CCGACCGCAG CATGATGTCG AAATTGCGCA TGCTTGGCAC GGGTATTTTG
AATGCGCCGC CACCTGAAGA GGCTGTGCCT GAACCACAGA CCGAAACTTG GGGCAAGATG
CCCTCGATCA CCGATTTCCT CAGTCCCAAA AGTCGCGTCG AAGCCAGCCC AGTCGATGAT
ATGGCCAGCC TTGAGGCCTT GCTTAGCGCT AATACCGGCC CAGCACCCCA AACGATGCTT
ACCGATATTG ATCCGCTTGA TGTGCTGGCC GCCAATTTGC CACCTGCTGG CGAAGTACAT
TTTGCGCCGC TCACGCCAAT TGATACCGAA GGCCGTAGCG AGCGGATTCA GCGTTTGGTT
GAGGCCAGCG AATCGTTTGC CGAGCAAAAT TTCTTGTATG CGGCGCTTGA TGCCTGTCAT
GAAATTATTC GCTACGATCT GGAGTTTTAT GCTATTCACC TGCGGATGGC TGAAATTCTC
GAACGTTTGG GGCGTATGGA TCAAGCGCTC GCCAAACTCA ATTTGTTGAT CGAAACCTAC
AAAGCGCGGG GCGAAACCCA CAAGGCGATT GGGGTCTATC ACAAACTGAT CGATCTTTCG
GCTGATAGCA CGATCATTCG GGCTGAATTG GCTGAGGTGC TACGCAAACA AGGCCGTAGC
GATGAGGCCG CTGAGCAGTT GGCCTATGTT GCCAACCAAC AATTCCGCCA AGGCCAAACC
GTCAAAGCCT TGGAGCAATT TCGCAAGTTG TTGGAATGGG CACCTGATAG CGTCAATTTA
CGTGCTCAAT ATGGCCAAGT GTTGCTCAAA CTCGAACGTT GGGAAGCAGC CTTGGAAGAA
TTCCGCCGAG TTATCGTGCA AAAACCCGAT GATTTGGTGG TGATGGCCCA AGCCAATATT
GCCCTGGCCA TGATGGGCGA ATTCCCCGAT GCGATTTGGG ATTCAGTCGC AGCCTTGATT
CAAAAGCTGG CTAGCAATCC GCAACAGCTT AACGATGTGC AAGCCGAATA TCGCGCCGTC
ACCCTGATTA CTGATCGGGC GATTATTCAG TTTCTCTTGG GCTTGATTCA GCAATCAACC
AAGCAACATC CGGCGGCGAT GCACTCATTC AATCAAGCGC TCGAATTGCT CGAAATTGAT
GCTGATCCCT TGGTTCCGCC AGTGCTGGTC TATCAAGCGA TTGCCGATAG CTACATTGCC
GAGGGTAATG CTGCTGGCTC AATCGAACAG TTGCGCAAAA TCGAGGCGAT TTTGGTGACT
GGCACTGTGC CAGCCATGAG CACCAAGCAT GCCTTTGCCC AACCCTTCAA CGAGGGTGAG
TTGCAACGGC GTTTGGCCGA AGCCTATGCC GCTAATGAGA ATTTCGAAGG CGCAATTCAG
GCCTTGCAAC GGGTCAAGCA ATTGTTGCCC TACGACCGCC AAGCTTACAC CAAATTGGCT
GATATCTACT TCCGCCAAGG CCGCCTAAAC GAAGCCTTGA CCCAACTTGA TGAGTTGGCA
ACTTACTACG AAAGCCAAAG CCAGCTTGAC CGTGCCTTGG AGATTTTGGC CACGGGCTTG
CAATTAGCAC CCAAGAATAT CCCAATCAAA TCGCGCCATG CCCAGCTGAT GATGCGGCGC
GGCTATCTTG ATGAGGGCTT GGTAGGCTTG GATGAATTGG CTGAGTTGCA GCGCAAACAA
GGCTTGGTCA AGGATGCAGT GGCGAGCATT CAGCAAGTCG CTGATGTGTA TTGGACGTTG
GGCAAGGTTG ATAAAGCTGC CGAGATGTAC AACCGAATTG TGCAGATCGC GCCCAACGAC
ACCGAAGCTC GTCAACACTT GGTCAGCTTT AACATTCTTT CGTTGCGCAC CAAGGATGCA
ATTATCCAGT TGCGTGAAAT TGCACGGCTC TCAATTCAGC AACGCAACTA CGAAGAAGCG
ATTGCTTCCT ATCACCAAGT GATTGCGCTC GATCAAAAAG ATGCCGATGC CTACGAGCAA
TTGGCCGATG TGCTGATGCG AGCACAAGAG TATGGTCAGG CCGTGCGTAC CTATAAACAA
TTGGCGAAAT TGTTGCCCGA TGATGATCGG GTCGAAGCTT TGCAAAGCGC CGCCCAACGC
ATGCTTGATC AGCAACAGGT GGCCAAAGGC TAG
 
Protein sequence
MAGNQAIFDR ALEQYQLASR AGDWNKALTE AARAMTEFPT HEQARIAVAT ALFHTNKLPQ 
ALQLWQELLK RNPDNPIFTE YIAKIYRAQG DTDQAIELFM QLAERLIAQK LPLKAVRAYR
DILDIHPSHE EARVRLAMVL AESGDKPGAI REYLHLGQQF YDAEQYDAAD EAAVEALNVD
PASLSAQEFK KKVDLARQAL ITAATGDQLP DQGRLQTKEQ IERAVNEAIE YQNNGLLEQA
AQMYERVVEV LPERADMQYS LGLIYDELER HKDALRVLEI ASHNDEYALS AHFAIGTIYQ
KLGQMERAAQ EFEQAIRYVD LQSIGKEEAN DLIDMYEATS AIYLELGDVA RAASLYSTLA
GFLQGKRWGA EQAAQYNAKA KELTDRSMMS KLRMLGTGIL NAPPPEEAVP EPQTETWGKM
PSITDFLSPK SRVEASPVDD MASLEALLSA NTGPAPQTML TDIDPLDVLA ANLPPAGEVH
FAPLTPIDTE GRSERIQRLV EASESFAEQN FLYAALDACH EIIRYDLEFY AIHLRMAEIL
ERLGRMDQAL AKLNLLIETY KARGETHKAI GVYHKLIDLS ADSTIIRAEL AEVLRKQGRS
DEAAEQLAYV ANQQFRQGQT VKALEQFRKL LEWAPDSVNL RAQYGQVLLK LERWEAALEE
FRRVIVQKPD DLVVMAQANI ALAMMGEFPD AIWDSVAALI QKLASNPQQL NDVQAEYRAV
TLITDRAIIQ FLLGLIQQST KQHPAAMHSF NQALELLEID ADPLVPPVLV YQAIADSYIA
EGNAAGSIEQ LRKIEAILVT GTVPAMSTKH AFAQPFNEGE LQRRLAEAYA ANENFEGAIQ
ALQRVKQLLP YDRQAYTKLA DIYFRQGRLN EALTQLDELA TYYESQSQLD RALEILATGL
QLAPKNIPIK SRHAQLMMRR GYLDEGLVGL DELAELQRKQ GLVKDAVASI QQVADVYWTL
GKVDKAAEMY NRIVQIAPND TEARQHLVSF NILSLRTKDA IIQLREIARL SIQQRNYEEA
IASYHQVIAL DQKDADAYEQ LADVLMRAQE YGQAVRTYKQ LAKLLPDDDR VEALQSAAQR
MLDQQQVAKG