Gene Haur_0136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0136 
Symbol 
ID5732031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp168061 
End bp170391 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content44% 
IMG OID641277260 
ProductTPR repeat-containing protein 
Protein accessionYP_001542916 
Protein GI159896669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.955072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGGAA ACTCGCACGT CCGGTTCTTA GGGGGCGGGA GTGTAGTAAT ACGCTCCTGC 
TACCCGACAG ATCATTTGCC TGAACGCAGC CAATTGCCAC CACATTCGGT TATGCCCTAT
CAACCACTGA GCGATTTTGT AGGGCGTGAG GCCCAACTAT ATCAATTGGC TCAAGCAATG
TTGCGCTCCG ACCCAACCCT GATTACACCA ACTGCTCTTG CCACAGGTAT GGGCGGGATT
GGCAAAAGTA GCCTAGCCCT AGAATTTGCC CATCGTTATG GTAGCTATTT TGCGGGCGGG
GTATTTTGGC TGTATGCCGC TACCAACGAA ACCCTCCAAG CCAGCCTTGA TCGTTGTTGG
GATAGCCTGA AACCAGACGA GTGGCGTTAT GAAGTTAAGC CTGAAACGCG GCTGCGGGTG
GTGCGTGAAT TATTTAATCA ACCTATACCA CGCTTGTTGA TTTTCGATAA TTGTGAAGAT
CCAGCCTTGC TCACGGCCTA TCGTCCCCAG GCTAGCAGTG GTTGCCAACT GTTGGTTACT
AGTCGGCGCA GCCAATGGCA AGGCACAAAT TTAATTACAC TTGATACCTT GCCGCCGCTT
GAAAGCCGCC AATTGCTACA ACAACTAGCT GCTCAGCCAA ATATCAACAA CTATCTCAGC
GATACCGATG CTGATCAGTT GGCTGAATTG GTGGGGCATT TGCCTTTAGC TTTGCATTTA
GTTGGCTCAA GCTTGAAATT TTATTTTCGC AAGCCTGCGG CTGAGTATAT TGCAGCGCTG
CAAAACCAAC GGATTGCCAG CTTGCAAGCG ATGGTTAAGC CAACCAGTAA GCTCCATCAA
AATACAATTA ATAATTTTTG GAGCGTGCGC GATACAGTTG AAGTGAGTTA TGGGTTGTTG
CCTGCCGAAT TAGGCCAAGC CTGTCGGCGT TTATTGTTGA TGATGGCCTA TTGTGCGCCG
AATGTGGTTA TTCCATGGGA GTTATTGCAA GCTGCTAGCG GCTACGATGA TGATAGCCTG
ACCGAATATC TATGGGAATT AACCCAAGCG GGCTTTTTCA ATGACCCAAC TCAACCGCGT
TTACATCCGC TGATGGCTGA TGTAATTATT GATCTTGATG CGGCGAATGA GCCTGAAAAT
TATAGATCGC TTGAACAAGC ATTGATAATG CTCAGCAACC GTTATCATGA TCAATGGGCA
ATGCGGGAAA TTGAAATGTT GCTGCCCCAT CTTGAATATT CGGATCGCCA AGCGAGCCAA
CATCCAGATT ATGCTGGCGA ATTAGGCTAT CAAGCAGGAC TTTGTTTATA TCGTCAAGGT
AAGTATGTTG ATGCAGAACA CATTTATCGA GAAGTTTTAT CAACCCAAAA CCAACTATTT
GAAACAGAAA ATCCAATCAT TCTTAATACA AAACATGCCC TGGCTGATGT ACTTGGCGAT
CAAGGGTTAC TTCAAGAGGC TGAACAACTT TTTAACGAAG TATATAGGTT ACGTAAAAAA
GTTTTAGGCC AATATCATCC CCATACCCTT AAAAGTAAAA AAGAATATGC AACTGCCATG
TTTTTGCGAG GAAATTATGC TGATGCAGAA CAAATTCTGC GCGAGATATT AACGATTCAA
GAACAGAGTC TAGGAAAAGA ACATTGGGAT AGCTTATTGA CAAAACATAA TTTAGCTTCA
ATTCGCAGTA AACAGGGATA TTATGCCTTA GCAGAGCGGA TGTATCGTGA AATATTGAAG
GATCAAGAGC AAATATTTGG TGTAAATCAT CCTGATACTC TAGCAACCAA GCGTCAAATT
GCTAACAACG TAGGTTATCA GGGTCGATAT GCCGAGACCG AACGTATCTA TCGGGAAGTT
CTGCCAATCT ATGAGCTTAT TTTAGGCTTG AACCATCCAT ATACGTTAAC AACAAAACAT
GGAATCGCCT GGGCATTAAA TGGACAAGGG CTTTATAAAC AGGCTGAATA TATGTACCGT
GAGGTATTAC TCATCTGTGA ACAAACGCTT CGGATTAATC ATCCTGAGAT TATTACTACT
AAACATAATA TTGCTTGGAT ATTGAGTAAA CAGCAACATT ATCTTGAAGC AGAAGTAATC
TATCGCGAAG TGCTAGAAAT TCGTGAGCAA AGCTTAGGAA CTAATCATCC TGACAGTTTA
TCAACAAAAT ATAACTTGGC AGCTACACTG TATCATCAAT CTTGCTATCG TGAAGCAGAA
TTATTATTTG ATCAAGTATT GAGCATCCGT GAAAAGGTTT TGGGAACTCA ACATGCAAGT
ACACAGGCAA CGCAAGAGTG GCTTGAACTG GTGCGCAGTA AATTGTGTTG A
 
Protein sequence
MRGNSHVRFL GGGSVVIRSC YPTDHLPERS QLPPHSVMPY QPLSDFVGRE AQLYQLAQAM 
LRSDPTLITP TALATGMGGI GKSSLALEFA HRYGSYFAGG VFWLYAATNE TLQASLDRCW
DSLKPDEWRY EVKPETRLRV VRELFNQPIP RLLIFDNCED PALLTAYRPQ ASSGCQLLVT
SRRSQWQGTN LITLDTLPPL ESRQLLQQLA AQPNINNYLS DTDADQLAEL VGHLPLALHL
VGSSLKFYFR KPAAEYIAAL QNQRIASLQA MVKPTSKLHQ NTINNFWSVR DTVEVSYGLL
PAELGQACRR LLLMMAYCAP NVVIPWELLQ AASGYDDDSL TEYLWELTQA GFFNDPTQPR
LHPLMADVII DLDAANEPEN YRSLEQALIM LSNRYHDQWA MREIEMLLPH LEYSDRQASQ
HPDYAGELGY QAGLCLYRQG KYVDAEHIYR EVLSTQNQLF ETENPIILNT KHALADVLGD
QGLLQEAEQL FNEVYRLRKK VLGQYHPHTL KSKKEYATAM FLRGNYADAE QILREILTIQ
EQSLGKEHWD SLLTKHNLAS IRSKQGYYAL AERMYREILK DQEQIFGVNH PDTLATKRQI
ANNVGYQGRY AETERIYREV LPIYELILGL NHPYTLTTKH GIAWALNGQG LYKQAEYMYR
EVLLICEQTL RINHPEIITT KHNIAWILSK QQHYLEAEVI YREVLEIREQ SLGTNHPDSL
STKYNLAATL YHQSCYREAE LLFDQVLSIR EKVLGTQHAS TQATQEWLEL VRSKLC