Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5158 |
Symbol | |
ID | 5737116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 230372 |
End bp | 232330 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282323 |
Product | TPR repeat-containing protein |
Protein accession | YP_001547914 |
Protein GI | 159901668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.455207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTGC CAGCCCTCAT TGCCGTTCTG GTCGATGCGA TTCTTGCCCA ATGCCCGCAG TATGATCGGG CGAGCGTTAC TGTGGCGCTC GAATCGGTCT TTGCAGGCCA TCCCACGCTG CTTGGTGGCA ATTCGATCTC CATGCTGTTG GGCCAGAACA ATGATTTTAC CAATGCCACA ATCACGATTG GCGATGTGCA TGCTGGCAAT CAGGTGCATG TGACGGTACA ACTCCCGCAG CCGATTGACC CCCTGCCCGC TGCGCTTGCC GCCCTTGCCT CGATTCCCTT GAAGGATGTG CCCGCGCCCC GTTCCGATCT CCCCCAAGCC TCACGGCTGC CGTTTGAAGC CAGTCCCCAT TTTGTTGGGC GCGAGACGGA ATTAAAAGCG CTCGCGCAGG CGATTGGCAC GGCGCAGCCA GCGGTGGTGA TGCCAGCGGT GGCCACGGGG CTTGGCGGGA TCGGCAAAAC GAGCCTGGTG ACGGAGTTTG CCTATCGCTA TGGCGTGTAT TTTCATGGTG GGGTATTTTG GTTGAACTGT GCCGATCCTG ATCAGGTGGC CAGTCAGATT GCGGCCTGTG CGGTTGGCTT GAAGCTTGAT ACCACGGGCT TGTCGCTTGA TGAGCAGGTG CAACGGGTTT TGAATGCCTG GCAATCCTCG ATGCCGCGCT TGCTGATTTT CGACAACTGT GAAGATCCAG CCATTCTTGA GCGATGGAAG CCCACGGTCG GCGGCTGTCG GGTGCTGGTG ACGGCACGGT CGGATCAGTG GCCAACGTTG ACGCAGATTC GCTTAGGGTT GCTCTCTTCT GGCGAAAGTC GGGATCTCTT GCAGCGCCTC TGTGCGCGGT TGAGCGATGC CGAGGCTGAT GCGATTGCTG AGGATTTGGG GCATTTGCCG TTGGCCTTGC ATTTGGCGGG GAGTTATCTG GCGACCTATC CCCATCATAC GGTCGAACAC TACCGCACGG ATTTGACGAT TGCCCACCGG TCACTCAAGG GCCGTGGCGC ATTGCCCTCA CCCACGCGCC ATGAACAGGA TGTGGAAGCC ACCTTCATGC TGAGTTTTAA GCAGCTTAAT CCGACCGATG CCCTTGATGC CTTGGCCTTA GGCATGCTCG ATGGCGCGGC GTGGTGTGCG CTAGGCGTAC CGATTCCTCG TGCCTTGATA CTGGCATTCG TGCCGGATGG GACGGATGCC GATGATGCCG TTGATGCATT GCGGCGATTA CAACAGCTTG GGCTTCTTGA TGGCGTAGAT GCGGTGGTCT TGCATCGCTT GCTGGCTCAA GTTGTTCACG CACGATTAGG ATCAATGGAC ATGCTGGCCG TGGTCGAAGA TACGATTACC ACGATGGCAT CGCAGATTAA TGATAGTGGG ATACCGACCC GCATGCTGCC GCTTGCGCCG CATCTGCGGT ATGTCACCAT GCGGGCGTTG GATCGCGGTG ATGAACGTAC CGTCCGTTGC GCATATACCT TGGGGATGTT CGCGTATCTG CGAGGCGCGT ATGGAGAGGC GCAGCCACTG TTCGAACGGG CGTTGTGGCT CTGCGAGCAG CGGGCGGAGG TATCGCTGGT GACCGTCGGG TTGCTCAATC AGATGGGGCT GGTGCTGACT CACCAAGGGC ACTATGGGGA AGCCCAACAG TGTTATGAAC GTGCTGTCGC CATCGGCGCA GCGCTCATGG GGGACGATCA TGTCCATGTT CATGGGATTC GCCTGAATCT CGCCCAAGCC CTGCATGCCC AAGGACAGGT GCAGGCCGCC CAGGCACTCG TCGCGGACGC GGCGATGAGC GACCGCATGG ATGCCGCAAC GCGGGCAGGT TCCCTGAATC AGCGTGCGTT GCTGCTGGAG CAACAAGGCC AGTATCCCCA GGCCCAAGCA CTCTATGAAC ACGCCGTTGC CCTGGCGACC TCCGTGTTTG GCGCGGCTCA TCCCACCACG GCGAAGTAG
|
Protein sequence | MELPALIAVL VDAILAQCPQ YDRASVTVAL ESVFAGHPTL LGGNSISMLL GQNNDFTNAT ITIGDVHAGN QVHVTVQLPQ PIDPLPAALA ALASIPLKDV PAPRSDLPQA SRLPFEASPH FVGRETELKA LAQAIGTAQP AVVMPAVATG LGGIGKTSLV TEFAYRYGVY FHGGVFWLNC ADPDQVASQI AACAVGLKLD TTGLSLDEQV QRVLNAWQSS MPRLLIFDNC EDPAILERWK PTVGGCRVLV TARSDQWPTL TQIRLGLLSS GESRDLLQRL CARLSDAEAD AIAEDLGHLP LALHLAGSYL ATYPHHTVEH YRTDLTIAHR SLKGRGALPS PTRHEQDVEA TFMLSFKQLN PTDALDALAL GMLDGAAWCA LGVPIPRALI LAFVPDGTDA DDAVDALRRL QQLGLLDGVD AVVLHRLLAQ VVHARLGSMD MLAVVEDTIT TMASQINDSG IPTRMLPLAP HLRYVTMRAL DRGDERTVRC AYTLGMFAYL RGAYGEAQPL FERALWLCEQ RAEVSLVTVG LLNQMGLVLT HQGHYGEAQQ CYERAVAIGA ALMGDDHVHV HGIRLNLAQA LHAQGQVQAA QALVADAAMS DRMDAATRAG SLNQRALLLE QQGQYPQAQA LYEHAVALAT SVFGAAHPTT AK
|
| |