Gene Haur_5158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5158 
Symbol 
ID5737116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp230372 
End bp232330 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content59% 
IMG OID641282323 
ProductTPR repeat-containing protein 
Protein accessionYP_001547914 
Protein GI159901668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.455207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTGC CAGCCCTCAT TGCCGTTCTG GTCGATGCGA TTCTTGCCCA ATGCCCGCAG 
TATGATCGGG CGAGCGTTAC TGTGGCGCTC GAATCGGTCT TTGCAGGCCA TCCCACGCTG
CTTGGTGGCA ATTCGATCTC CATGCTGTTG GGCCAGAACA ATGATTTTAC CAATGCCACA
ATCACGATTG GCGATGTGCA TGCTGGCAAT CAGGTGCATG TGACGGTACA ACTCCCGCAG
CCGATTGACC CCCTGCCCGC TGCGCTTGCC GCCCTTGCCT CGATTCCCTT GAAGGATGTG
CCCGCGCCCC GTTCCGATCT CCCCCAAGCC TCACGGCTGC CGTTTGAAGC CAGTCCCCAT
TTTGTTGGGC GCGAGACGGA ATTAAAAGCG CTCGCGCAGG CGATTGGCAC GGCGCAGCCA
GCGGTGGTGA TGCCAGCGGT GGCCACGGGG CTTGGCGGGA TCGGCAAAAC GAGCCTGGTG
ACGGAGTTTG CCTATCGCTA TGGCGTGTAT TTTCATGGTG GGGTATTTTG GTTGAACTGT
GCCGATCCTG ATCAGGTGGC CAGTCAGATT GCGGCCTGTG CGGTTGGCTT GAAGCTTGAT
ACCACGGGCT TGTCGCTTGA TGAGCAGGTG CAACGGGTTT TGAATGCCTG GCAATCCTCG
ATGCCGCGCT TGCTGATTTT CGACAACTGT GAAGATCCAG CCATTCTTGA GCGATGGAAG
CCCACGGTCG GCGGCTGTCG GGTGCTGGTG ACGGCACGGT CGGATCAGTG GCCAACGTTG
ACGCAGATTC GCTTAGGGTT GCTCTCTTCT GGCGAAAGTC GGGATCTCTT GCAGCGCCTC
TGTGCGCGGT TGAGCGATGC CGAGGCTGAT GCGATTGCTG AGGATTTGGG GCATTTGCCG
TTGGCCTTGC ATTTGGCGGG GAGTTATCTG GCGACCTATC CCCATCATAC GGTCGAACAC
TACCGCACGG ATTTGACGAT TGCCCACCGG TCACTCAAGG GCCGTGGCGC ATTGCCCTCA
CCCACGCGCC ATGAACAGGA TGTGGAAGCC ACCTTCATGC TGAGTTTTAA GCAGCTTAAT
CCGACCGATG CCCTTGATGC CTTGGCCTTA GGCATGCTCG ATGGCGCGGC GTGGTGTGCG
CTAGGCGTAC CGATTCCTCG TGCCTTGATA CTGGCATTCG TGCCGGATGG GACGGATGCC
GATGATGCCG TTGATGCATT GCGGCGATTA CAACAGCTTG GGCTTCTTGA TGGCGTAGAT
GCGGTGGTCT TGCATCGCTT GCTGGCTCAA GTTGTTCACG CACGATTAGG ATCAATGGAC
ATGCTGGCCG TGGTCGAAGA TACGATTACC ACGATGGCAT CGCAGATTAA TGATAGTGGG
ATACCGACCC GCATGCTGCC GCTTGCGCCG CATCTGCGGT ATGTCACCAT GCGGGCGTTG
GATCGCGGTG ATGAACGTAC CGTCCGTTGC GCATATACCT TGGGGATGTT CGCGTATCTG
CGAGGCGCGT ATGGAGAGGC GCAGCCACTG TTCGAACGGG CGTTGTGGCT CTGCGAGCAG
CGGGCGGAGG TATCGCTGGT GACCGTCGGG TTGCTCAATC AGATGGGGCT GGTGCTGACT
CACCAAGGGC ACTATGGGGA AGCCCAACAG TGTTATGAAC GTGCTGTCGC CATCGGCGCA
GCGCTCATGG GGGACGATCA TGTCCATGTT CATGGGATTC GCCTGAATCT CGCCCAAGCC
CTGCATGCCC AAGGACAGGT GCAGGCCGCC CAGGCACTCG TCGCGGACGC GGCGATGAGC
GACCGCATGG ATGCCGCAAC GCGGGCAGGT TCCCTGAATC AGCGTGCGTT GCTGCTGGAG
CAACAAGGCC AGTATCCCCA GGCCCAAGCA CTCTATGAAC ACGCCGTTGC CCTGGCGACC
TCCGTGTTTG GCGCGGCTCA TCCCACCACG GCGAAGTAG
 
Protein sequence
MELPALIAVL VDAILAQCPQ YDRASVTVAL ESVFAGHPTL LGGNSISMLL GQNNDFTNAT 
ITIGDVHAGN QVHVTVQLPQ PIDPLPAALA ALASIPLKDV PAPRSDLPQA SRLPFEASPH
FVGRETELKA LAQAIGTAQP AVVMPAVATG LGGIGKTSLV TEFAYRYGVY FHGGVFWLNC
ADPDQVASQI AACAVGLKLD TTGLSLDEQV QRVLNAWQSS MPRLLIFDNC EDPAILERWK
PTVGGCRVLV TARSDQWPTL TQIRLGLLSS GESRDLLQRL CARLSDAEAD AIAEDLGHLP
LALHLAGSYL ATYPHHTVEH YRTDLTIAHR SLKGRGALPS PTRHEQDVEA TFMLSFKQLN
PTDALDALAL GMLDGAAWCA LGVPIPRALI LAFVPDGTDA DDAVDALRRL QQLGLLDGVD
AVVLHRLLAQ VVHARLGSMD MLAVVEDTIT TMASQINDSG IPTRMLPLAP HLRYVTMRAL
DRGDERTVRC AYTLGMFAYL RGAYGEAQPL FERALWLCEQ RAEVSLVTVG LLNQMGLVLT
HQGHYGEAQQ CYERAVAIGA ALMGDDHVHV HGIRLNLAQA LHAQGQVQAA QALVADAAMS
DRMDAATRAG SLNQRALLLE QQGQYPQAQA LYEHAVALAT SVFGAAHPTT AK