Gene Haur_5147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5147 
Symbol 
ID5737105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp211406 
End bp212815 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content37% 
IMG OID641282312 
ProductTPR repeat-containing protein 
Protein accessionYP_001547903 
Protein GI159901657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0779428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGGC CTGGTGTGCG CCTGGCATTC CAATTCCACG CGAGTTGGTG CTGGCGTTTG 
TTCCCGATGC CCGATGATAC CGATATCGAT GATGCCGTTG ATGCATTGCG TCGGTTACAG
CAGTTGGGTT TGCTGGATGG AACGGATGCC GTGGTGTTGC ATCGCTTGCT GGCCCAGGTT
GTTCAGGTAC AGTTGGGATC GACCGAAACG CTGGCCGTGG CGGAGGAGCG GATTGGTAGA
GTTGTTACTT GGGGCAATAT AAGAGACGAA CCGCATACTT TATATCCGTT TGTTATTCAT
TTACGCCATG TTGCATTACG AGCATTAGAT CGTGCAGATA AAATAGGACT CATTCTTGTT
AATAGTCTTG CACTTTATGA AAGAAATATA GGATCATATT CAACAGCACG ATCTCTACTA
GAGAAAATAA AAGAAACACA TTCGAATTTG TTGGTGAATG ATGCGAATAT TATGGAAATA
ATTACAAATA ACCTAGCAGC GGTATTATGC CATCAAGGAT TATACAGTGA GTCACAAGAA
TTATATGAAG GGATTTTGAG GAATTTAGTA AATTCAAACG TCAATAAAGA TATATATATT
GTCGATATTA TGAATAATTT GTCAATAGTA TTATCCTATC AATACAAATA TTTAGAGGCA
AAATTATTAC TTGAAAAAGC AATAAGACTA TATAGAAAAG AGCTGGGAAA ATCACAAGAT
AAACTATTAG CAACAATGAT GAATAATCTT GCTAGTATTT TAGAAAAACA AGGCATTTAT
CATGATTCAC AGTCTCTATA TGAAGAAGCA TTAATAATCC AAGAGCAAAT ATTTGGTCCA
GAAAATCCTA ATACATTATC AACTAAAAAT AATTTGGCCT TAGTCTTAAG CCTTCAAGGG
CTTCATGGGG AGGCAGAGAG TATCTGGAAA CATGTCTTAG AGATTAGAAA AAAAGTTTTG
GGATTAGATC ATGTTCATAC AGCTCAAAGC ATGAATAATT TAGGAGTAGC GTTCGAAGAA
AAAGGATCTT ATAACGAAGC ATATGAACTC CATATCAGTG CATTAAATAT ACGAAACCAT
ACATTAGGCC AGAATCATCC TGATACCATC CAAAGCATGA ATAACTTAGC ATTAGTGTAT
GCAAGATTAG GTCAGTATAA GGAGTCACAG CAGTTATATA CACAAACTCT TAAATGGTAT
CAACAACAAC CTTCGTTGAT GTCAACGGAT GATGTTCGAA TAATGAATAA TTTGGCATAT
GTTCTTGAAA ATCAGAAGCT TTACTCTGAG GCTTATGATT TATATCAACG AGTGCTAGTT
CTTACGACAC ATAGATTTGG TATACATAGC GATTATAGCC AGCTAATAAG AGAAACCTTA
GACCAATTAC GACCTAAAAT ATCAAAATGA
 
Protein sequence
MVRPGVRLAF QFHASWCWRL FPMPDDTDID DAVDALRRLQ QLGLLDGTDA VVLHRLLAQV 
VQVQLGSTET LAVAEERIGR VVTWGNIRDE PHTLYPFVIH LRHVALRALD RADKIGLILV
NSLALYERNI GSYSTARSLL EKIKETHSNL LVNDANIMEI ITNNLAAVLC HQGLYSESQE
LYEGILRNLV NSNVNKDIYI VDIMNNLSIV LSYQYKYLEA KLLLEKAIRL YRKELGKSQD
KLLATMMNNL ASILEKQGIY HDSQSLYEEA LIIQEQIFGP ENPNTLSTKN NLALVLSLQG
LHGEAESIWK HVLEIRKKVL GLDHVHTAQS MNNLGVAFEE KGSYNEAYEL HISALNIRNH
TLGQNHPDTI QSMNNLALVY ARLGQYKESQ QLYTQTLKWY QQQPSLMSTD DVRIMNNLAY
VLENQKLYSE AYDLYQRVLV LTTHRFGIHS DYSQLIRETL DQLRPKISK