Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3321 |
Symbol | |
ID | 5735191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4186112 |
End bp | 4189537 |
Gene Length | 3426 bp |
Protein Length | 1141 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280468 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546085 |
Protein GI | 159899838 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTGA AGGAGTCTGC CTTGGATTCA CGCGAGACAA CGATCAGCCT ATGGTCGCGG CGGGTGATGG AAGCCTGCTG GCTATTGGCT TTAGCAATGA TCCCGGTCTA TTTTAGTTTG CTCAGTGACC GCCACTTTGA GCCAGATAAA GCGGTAGCCT TGCGCTCAAT CGTGATGATC CTTGGCGGTG CGTGGATTAT CAATTGGCTC GAACGTGGCC AAGTTTTTCG TTCATGGCCG CGTTGGCGCG ATTGGTGGCG CTCGCCGTTG GTTGCTCCGG CTATCGTTTA TGTTGGGGTC TTTTTCTTTA CCACGCTCAC CTCGGTGTTG GTGTTTACCA GCTTTTTTGG CGGCTATAAT CGGCTTCAAG GCTTTTACAC CAATTTCTCG TATGTCGTGG TGTTTGGGGC GATGCTGGCG CATGTGCGTC GCCGCGAGCA ACTTGAGCGG ATTATTACGG TGATTATTGC CACAACCTTG CCAACCTTGG GCTATGGTTG GGTGCAATAT CAGCGCAGCG ACCCCTTGCC GTGGGCTGGC GATACTGCGG CACGGGTTGC CTCAAGCATG GGCAACTCGA TTTTTGTGGC AGCCTATTTG ATTTTAACCC TGCCCTTTAT GTTCTATCGC TTGATTACCA GCGTGATGGC TACCAAACGT GCTGAGAGCG AAGCTACAAA CTCGGTTGGC TTAGATGCAG CGTGGTTTGT TACCTTGGGC TTGATTCCGC TTGGTCAATT AAGCCTCTTG TATGCCACCC TCAAGTTGGG AGCATTGCTG CAAGCGCCCT TGTTGGGCAT CGGCCATTGG TGGATTTTCC CAATGTCGGT GATTGTAGTA GGCAGTACCT TGCCCTTAAT TTCGTGGGTA ACCAGCACCC GCAGTCGCAG CGATTGGCGC TTATATCTGC CCGGCGGCTT GATTTTGCTG TATATGCTGA GTTTAGTGTT GGGTGGGTAT TTAACTGCCG ACCAATGCCA AGGCACAATT AGCGAAACCT GCTACAACCT TGATATGGCA ACTGCTAAGC GTGCTAGCGA CTTTCGTACC TGGTTCTTGT TGGCGATGGC GGCCTATTTT GGTTTTTATG GCTTAGTGCT AGCCTTGCCG CGCCGCAGCG AAGCAGCAGC CCATGCAATT GTGCAATGGC TCAGTGCTGC AATCTATGCT GGGTTAAGCC TGTTCACGAT TGTCATTATC TTCTTTACCC AAAGCCGTGG CCCACAAATC GGCATGTTCG TAAGCATTTT CGTCTTCTTC ACCCTCTTTT TGTTGCAAGG CCTACGCAGC ACCAGCTTCA AACGCATTTT TGGCGCGGCG CTGAGTGCAT GGGTTGTGCT GGCCTTAGCA GGAGCCGCCT TCTTGGTGGT GCTTAATACA GACTCAAGCA GTTTTAGTGG GCTACGCCAA AGCAACCGCT ATATCAGCCG TTTGGGCAAC TTGCTTGAAA CTGATGGCGG CACTGGTTTG GTACGGGTGT TGATTTGGCG TGGCGATGAG CATACCCAAG GCGCAGTTGG TTTGGCCTTG AGCGATCCGT TACGCACAGT AATTGGTTGG GGGCCAGAAT CGATGTTTGT GGCCTACAAC CCATTTTATC CACCACGGTT GGCCAATTAT GAATCGCGCG GGGCTTCGCC CGACCGTTCG CACCAAGCCT TGTTGGATGA ATTGGTGACC AAAGGCGCGA TTGGCTTGTT TAGCCATTTA TTCTTATTTG GCTCATTCTT AATTATTATG CTGCGGCTAC TGTGCATTCC ACGACTGATC AATTTAGGCC TAACCAGCCT TTTGATGCTT GGAATTGGCA TCTTCTTTGC CGTCTTTTTG AAGAGCCTCG CACTTGGTTT GATCGCTGGT AGTGTGGGTT TGTTGGTGGT TGGCCTCGCC ACGTGGTTGG GCTATGCCAA GCCGCTCGAA GCATCGCTGA GCTTTACCTG GCAATTGTTG ATTATCACGG TGCTTTCGGC GGTTGCCGCC AATTTTGTCG AAAACCTGTT TGGGATTCCA ATTGTTTCGT CGTTACTCTA TACCTGGGTG ATTATGGCGG TTGGCATACT TGCCGGGGCA CACGCTGGAG CCTATCAACT TGGCACAAAG CCAGTGGTTG TGGCAGCACC AGTTGTCGAA GAAGCCGCCG AATCAACCCC AGCCAAAGCT GGCACCAAAC GCCAAGCAGC CCAAAATGCT CGTCGCACCC CTGCTGGCCG TGGTCGTACC AGCAGCGGCG TAACTGGCGC TGCACCTGCT CGCATTTTGT ATGCAGTCGT TGTGCCAATT GTCTTGCTGT TGGTCTGGTT CCTCAACCTC GATAATATTT TTGCTGATAT GCGCTATTTG CAGGGCAAAC AATTTATCGA CCAAGGCCAA GGCCTTGATC AACATCTTTT GGGCTTTGCC GCGATTCAGG ATGCAGTTGA GCATGCACCC AACGAAGATT TATATTTCTT GATGTATGGT CGGGCCTTGA TGACCTTGGC AACTGATTTG AGCATTGAAC AAAACAAATT GGTTAGCGAA AATCAAAATG CGGCGATTGC CCAAGTACTG AATAGTCGCC CACGGCCTGA TGCTGAACTC GCCGATTTGC CCGATGCTGA ATATAGCGTG GCTGGTTTGC AAACCGTTGC CCGCGATTTC TTGACCAAGT TTGGGCCATT GCAAGTGCTC GATTATGCCC GCTTGGCCTT GGAAGAAGCC CAGCGGCTCA ATCCCCAGAA CAAAGATCAT CCGGCTAACT TAGGCCGTTT GCATTCATCA TGGTTCCGTA ACACCGAGCA AAGCGACCCT GAAGGCGCAC GGATGCACCT TGATGCCGCG ATTGAGGCCT ACAAACAAGC GCACACGGTC GCGCCCCAAG ATGTTGAGTT GACTGGTCAA TGGGCTATGT TGTATTTGTA TCGCCAAGAA TACGATACAG CGATTGCCGA ATTGACCAAA GCGACTACGC TTGATCCATT ATGGTCGCTC AATTTCATTC GCTTGGGCGA GGCCTACCGC CGCAAGGGCG ATTTGCCCAA CGCAGCTTTA GCCTTTGCCA ATGCCTTGGC GCTTGATCCA CGGGCGCTCA GCAGCAGTGG CTTGGTTGAC GTAGCCGAAT TGCCAGCCGA ACGCACAGCC CGTGTCCAAG CAACCTTTGC CTCGATGCAA AGCGATCCTG CGGTGTTTGA TAGCTTCTTG ACAGGCTTTG AGCGAGCGAT TGCCAGCAAG CCAGGCGATA TGTCGTATCG CCAGATTTAC ACCCAAGTGT TGAGCGATAG CCAACGCTAT GATGCAGGCT TGACCCAAGT CCAACTAGCT TTGGCCGAAA TGGACAAAAT GGGTGCAGCT GATCCGACAT TCAACACCAC CTATGCTGAT ACCCGAACGG CCTTTGAAAA GCTAGTTAGC TTTTTTCAAA GCCAACTCGG CCAAAGTAAA CCATAA
|
Protein sequence | MPLKESALDS RETTISLWSR RVMEACWLLA LAMIPVYFSL LSDRHFEPDK AVALRSIVMI LGGAWIINWL ERGQVFRSWP RWRDWWRSPL VAPAIVYVGV FFFTTLTSVL VFTSFFGGYN RLQGFYTNFS YVVVFGAMLA HVRRREQLER IITVIIATTL PTLGYGWVQY QRSDPLPWAG DTAARVASSM GNSIFVAAYL ILTLPFMFYR LITSVMATKR AESEATNSVG LDAAWFVTLG LIPLGQLSLL YATLKLGALL QAPLLGIGHW WIFPMSVIVV GSTLPLISWV TSTRSRSDWR LYLPGGLILL YMLSLVLGGY LTADQCQGTI SETCYNLDMA TAKRASDFRT WFLLAMAAYF GFYGLVLALP RRSEAAAHAI VQWLSAAIYA GLSLFTIVII FFTQSRGPQI GMFVSIFVFF TLFLLQGLRS TSFKRIFGAA LSAWVVLALA GAAFLVVLNT DSSSFSGLRQ SNRYISRLGN LLETDGGTGL VRVLIWRGDE HTQGAVGLAL SDPLRTVIGW GPESMFVAYN PFYPPRLANY ESRGASPDRS HQALLDELVT KGAIGLFSHL FLFGSFLIIM LRLLCIPRLI NLGLTSLLML GIGIFFAVFL KSLALGLIAG SVGLLVVGLA TWLGYAKPLE ASLSFTWQLL IITVLSAVAA NFVENLFGIP IVSSLLYTWV IMAVGILAGA HAGAYQLGTK PVVVAAPVVE EAAESTPAKA GTKRQAAQNA RRTPAGRGRT SSGVTGAAPA RILYAVVVPI VLLLVWFLNL DNIFADMRYL QGKQFIDQGQ GLDQHLLGFA AIQDAVEHAP NEDLYFLMYG RALMTLATDL SIEQNKLVSE NQNAAIAQVL NSRPRPDAEL ADLPDAEYSV AGLQTVARDF LTKFGPLQVL DYARLALEEA QRLNPQNKDH PANLGRLHSS WFRNTEQSDP EGARMHLDAA IEAYKQAHTV APQDVELTGQ WAMLYLYRQE YDTAIAELTK ATTLDPLWSL NFIRLGEAYR RKGDLPNAAL AFANALALDP RALSSSGLVD VAELPAERTA RVQATFASMQ SDPAVFDSFL TGFERAIASK PGDMSYRQIY TQVLSDSQRY DAGLTQVQLA LAEMDKMGAA DPTFNTTYAD TRTAFEKLVS FFQSQLGQSK P
|
| |