Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5126 |
Symbol | |
ID | 5737084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 173372 |
End bp | 177967 |
Gene Length | 4596 bp |
Protein Length | 1531 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641282291 |
Product | hypothetical protein |
Protein accession | YP_001547882 |
Protein GI | 159901636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATC GCGCCCTGCG TCTTGCCCTG GAAGCCTTGC TGGATGAACC CACCCTGAAA TCAACGCATC GCGCGTCGAT TGAGCGTGGC CTTACCGATC TCAAGACGCA GCGCATAACT CCCGATGAGT GGGAATATCT TTGCTATCTC ATCGAGCACT ATGGAGCATC AACGCCGCTA CCTGCCGATC CCGTGGCCGC ATTGATCGCC GCCTTTCAAG ACAAATTACC GTCTGTTGAT CCCATAGCCT TGAGTACGGC GATCCACGCC CTCGTGGGGG GTGATACCGC CCAGCTTGAT GGCAAGTCGC TCGCGCTTAC GATTGAACGC CGCGGGCAAA CCCAGATTGC GAACTTGGCG GAACGCAATG TCGTGATGGG GAATCTGAAT CAGCTTGATA TTTCCGTACA TATCCCAATC CCTGATGCCC GCTTTGATAG CCTGATGCAG GCCGCGTTCC AGCGCCAAAC GCATGAACGC ATCACCAACG CTGCCCAACA AACCGAGAGC GATCGCCTGC TGAATCTGCG CCTTCAGGGC TTTGTTGGTC GGGACAATGA ACTAGCAGCC ATTCGTGCGC AGATTGCAAC GGTACGACCC ACGGGCGGCT ATGTATTAAT CAAATCAGAA GCAGGTGAAG GTAAGAGCAG CATCATTGCT AAACTGATTC AGGATGCGGG GTTTGCGCAA ACCCCGCATC ACTTTATTGC CCTGACTACG GGCCGAGACT ATCAATTGAG CTTGCTCCGC GCGGTGGTGG CGCAATTGAT TCTGAAGCAT AGGCTGCCTG TTTCATACTT CCCCGAAGAA AGTTATTCCA CAATGAAGGG GGAATTTGCC CGTATTCTTG ACGAGTTATC CAAACACGGC ATTCAGGAAA CAATTTATCT CGATGGCCTC GATCAACTTC CACCGGAGGG GGATAGGTTG ATCGATCTAT CGTTTTTGCC ATCACAGCCG CCACCAGGGA TCGTGATTGT ACTTGGGTCG CGACCGGATG AAGCCTTCAA ACCGTTACAC CACTTGAACA AAGCTGCGTA TCATTTACCA CCAATGAGCG AAATAGATGC ATTCACCGTA TGGCGATCAG TCCAGTCTGG CGTGGCAGAT ACTTTGTTTC ATGACCTCTA TACCGCATTA AAAGGCAATG CATTGTTTGT CTATTTAGCG GCGGATACGA TACGCAATCA ATCTGTGGTC GATATGGCCA GTTTAATTCG ACAAATTGAG CAGAATCCAA AGGACATATT TGGTATTAAA CTGGAGCGGA TCAAAAACCG TTCTGAGCAT CGTTGGCCAA CGATATGGAG GCCCATGTTG GCGCTCTTAG TCGTCACACA AGAGCCATTA CCTATGAGGG TTATTGGGCG GTTATTGGGG CAACCACGGG CCGTGATTCA GGATGCCGTA TGGGTAATGG GTGGCTTGGT CAGCCAAGGC AGCGATCAAC GGGTGGCGCT GCATCATCTG CTGTTTCGTG AGTATATCCA AGCACACGAG TTTGATAGTG AGGAACTTCA TGACATACAT CAACGGCTGG CCGACTGGTG TGCGCAGGAT GGGGATGCGA TTTGGGCCGA TCATCGTGAT GCATTAGAGC AAGCGCGTCG GGTCTATGCG CGGCATCACT ACATCACCCA TCTTGCACTG GCCGAGAATT GGCCAACACT CTGGCGGGTT TTGGATGCAG GTGACTATGG CGCACAGAAA ACGCGCTTTG ATCCCAGTAT GCGGCTCTAT GCGCTGGATT TGGATCGGGG GCGTGAGAGT GCCATTAAGG CTGGGCAATC GACCGAAGAA CATATCCAGC ATTTACCACG CCTGTGGAAG TATAGTTTAT TGCGGACAAG TTTAACCAAT CGCGTCGATC AGTGGCCAAA TAAAGTGTTC GCGATTTTGG CAACGATTGG GCAAACACAC GAGGTATTAG AACGAATAGA GCTCCTTTCG AGCACGAGGA AGCAGGTTCA GTGTTGGGAT ACGATTCTTC CATGGTGTGA TACACAACAA CAGAAGACAA TGCTTGTCCG ATTGTATCAA GTAGCTGGAC ATCTTTCGGG TATCGATGAG ATTTTCACCC TCAGTATCAT TGCGAAAATA ACCATCATGC TTGGAGAAAA TGAACAGGCG ATAGGGATTC TTGACCATGC CCTTGCCATT GCCCATACCC TTAAAACTCC AGAACAATGT AACGAGGCAA TCCGTGCCAT TACGAAAATA ACGACAATGC TTGCTACGAA TGAACATGTA CTAGGGATTC TTGACCGATC CTTCACTATC GCTCAATGCA TTGACCATGC AAACTATCGT TCAGAGGCCC TTAGTGTCAT TGCAGCAGCT ATTGCTGTTC ATGGTGATAG AGGCAGAGTT CTGACACGCA ACCAGATGAT TGATACTTCA GGACGACGCA ACGAAACCAT CAGTGTCATT GCAGCAGCTA TTGCTGTTCA TGGCGATATC GACTATGCTT TGACCATCAC CCAATCTATT GATAATCCAG TACAATATAT CTTGGCCATT GCTTTCATCG CAAACATACT GACCATATTG GATAAACCTG AACACGTACT AGGGATTCTT GACCGTGCCC TGTTTATCGC CCAGTCCATT GATAATCCGG AACAACGTGA CAAAGCCATG GGTGCCATTG CAGCAGTGAT TGTGACCCAT GGCGATATCG ACCGTGCCCT GTCTATCGCC CAATCCATTG ATAATGTATG GCAGCGTGAC AGCAGCCTTG CTGAAATTGC GGTAGTCATC GCTACTCATG GCGATATCGA CCGTGCCCTA TCTATCGCCC AATCCATTGA TCCTCCAAAG GGGCATACAT TGACTCTCGC TGCCATTGCG GTAGCCATCG CTACCCATGG CAATATCGAC CGTGCCCTGT CTATCACCCA ATCTATTGAA CACACAAAGG AGCGGGCTGA GACCATGAGT AGCGTTGCAG TAGCCGCTGC TACCCATGGC GATATCGACC GTGCCGTTAT TATTGCTCAA TCTATTAACT CATGGCAATA TTCTGAGACC ATGAGTAGCG TTGCAGTAGC CGCTGCCACC CATGGCGATA TCAACCGGGC CCTCTCCATC ACGCAATCTA TTGAGTCCAC AAAGGAGCGG GCTGAGACCA TGAGTAGAAT TGCAGAAGTT GCTGTTACCT ATGGACAACA TGAACAAGCA CTGGCGATTC TCAACATGGC CCTATCATCC ATCACCCAAT CCGTTAACGA GGCATGGCAA CAACGTGATG AAGCTATCAA GGCTATTGCC GTAGCCATCG CTACCAATGG TGATATCGAC CGTGCCCTGA CCATCGCTCA ATCCATTGAT AATATATGGG AACGTTATCA GACACTTGGT ATCCTTGCTG AAGCTATTGC TGGCTATGGC GATATCGACC ATGCCCTGAC CATCGCTCAA TCCATTGACG ATCCTGTGCA ATGTACCTGG GCTCTCGCTG CCATTGCAGA AGTCACTATT ACTCGTGGGC AGGATCAACA AGCATTAGCG ATTCTCAACA TGGCCCAGAC CATTACACAC GCCATTGACA ACGCAAAGGG ACGGGCCATG GCATTCCCTG CCATCGCAGA AGTCACGGCC ACCCATGGCG ATATCGACCA TGCCTTGTCC ATCACTCAAT CTATCGATGG GTCATGGGAA CGTAATAGGA CTCTTGCTGC CATTGTGGAA GCCATCGCTT CCCATGGCAA TATCGACCGT GCCCTGTCTA TCGCCCAGTC CATTGATAAT CCGGAACAAC GTAACAAAGC CATGGGTGCC ATTGCAGCAG TGATCGTGAC CCGTGGCGAT ATCGACCGTG CCCTGTCTAT CGCCCAATCC ATTGATAATG TATGGCAGCG TGACAGCACC CTTGCTGAAA TTGTGGTAGT CATCGCTTCC CATGGTGATA TTGATCGTGC CCTCACCATC GCCCAATCCA TTGATCCTCA AAAGGAGCGG GCTGAGGCCC TTGCTGCCAT CGCAGAATTC ACTGTTACCC CTGGGCAGCG GAAACAAGCG TTAGAAATAC TCAACACGAC CCTAACTATT TTCCACACTC TTGACGATCA ACTAGGATTA CACGATGAGG TTATCGGTGC CATTGCGGTA GCCATCGCTT CCCATGGCGA TATCGACCGT GCCCTGTCCA TCACCCAATT TATTGGCAAT ACATGGCCAC GACGTGACAA TACCGTGGGT GCCATTGCAA AAGTCGCAGC CACCCATGGC GATATCGACC GTGCCCTGTC TATCGCCCAA TTCATCGATA ATCTCGATCG ACGTGATAAT GCCGTGGGTG CCATTGCAGA GGCTACTGCC ACCCGTGGCG ATACCGATCG TGCCCTGACA ATCGCTCAAT CCATTGATGA TCCAACTCTA CGGGCTCAAA CCTTTCACGT GATTCTTCAA AAAGCTCAAT CAGTCATAGG GATTTTAAGG ATACTTCATC ATGAATGGTT TCGGAGCCTA AGATCTACCG ATGTATGGGC AATGACAGTG ATTTCGACTC CATTAATAAA CGAATATCCA TGGCTTGGTG TAACCATGCT TGATGCAGAG GAATGGGTGA ATGTACAGTT GAAACGGCTG GAGTAA
|
Protein sequence | MDDRALRLAL EALLDEPTLK STHRASIERG LTDLKTQRIT PDEWEYLCYL IEHYGASTPL PADPVAALIA AFQDKLPSVD PIALSTAIHA LVGGDTAQLD GKSLALTIER RGQTQIANLA ERNVVMGNLN QLDISVHIPI PDARFDSLMQ AAFQRQTHER ITNAAQQTES DRLLNLRLQG FVGRDNELAA IRAQIATVRP TGGYVLIKSE AGEGKSSIIA KLIQDAGFAQ TPHHFIALTT GRDYQLSLLR AVVAQLILKH RLPVSYFPEE SYSTMKGEFA RILDELSKHG IQETIYLDGL DQLPPEGDRL IDLSFLPSQP PPGIVIVLGS RPDEAFKPLH HLNKAAYHLP PMSEIDAFTV WRSVQSGVAD TLFHDLYTAL KGNALFVYLA ADTIRNQSVV DMASLIRQIE QNPKDIFGIK LERIKNRSEH RWPTIWRPML ALLVVTQEPL PMRVIGRLLG QPRAVIQDAV WVMGGLVSQG SDQRVALHHL LFREYIQAHE FDSEELHDIH QRLADWCAQD GDAIWADHRD ALEQARRVYA RHHYITHLAL AENWPTLWRV LDAGDYGAQK TRFDPSMRLY ALDLDRGRES AIKAGQSTEE HIQHLPRLWK YSLLRTSLTN RVDQWPNKVF AILATIGQTH EVLERIELLS STRKQVQCWD TILPWCDTQQ QKTMLVRLYQ VAGHLSGIDE IFTLSIIAKI TIMLGENEQA IGILDHALAI AHTLKTPEQC NEAIRAITKI TTMLATNEHV LGILDRSFTI AQCIDHANYR SEALSVIAAA IAVHGDRGRV LTRNQMIDTS GRRNETISVI AAAIAVHGDI DYALTITQSI DNPVQYILAI AFIANILTIL DKPEHVLGIL DRALFIAQSI DNPEQRDKAM GAIAAVIVTH GDIDRALSIA QSIDNVWQRD SSLAEIAVVI ATHGDIDRAL SIAQSIDPPK GHTLTLAAIA VAIATHGNID RALSITQSIE HTKERAETMS SVAVAAATHG DIDRAVIIAQ SINSWQYSET MSSVAVAAAT HGDINRALSI TQSIESTKER AETMSRIAEV AVTYGQHEQA LAILNMALSS ITQSVNEAWQ QRDEAIKAIA VAIATNGDID RALTIAQSID NIWERYQTLG ILAEAIAGYG DIDHALTIAQ SIDDPVQCTW ALAAIAEVTI TRGQDQQALA ILNMAQTITH AIDNAKGRAM AFPAIAEVTA THGDIDHALS ITQSIDGSWE RNRTLAAIVE AIASHGNIDR ALSIAQSIDN PEQRNKAMGA IAAVIVTRGD IDRALSIAQS IDNVWQRDST LAEIVVVIAS HGDIDRALTI AQSIDPQKER AEALAAIAEF TVTPGQRKQA LEILNTTLTI FHTLDDQLGL HDEVIGAIAV AIASHGDIDR ALSITQFIGN TWPRRDNTVG AIAKVAATHG DIDRALSIAQ FIDNLDRRDN AVGAIAEATA TRGDTDRALT IAQSIDDPTL RAQTFHVILQ KAQSVIGILR ILHHEWFRSL RSTDVWAMTV ISTPLINEYP WLGVTMLDAE EWVNVQLKRL E
|
| |