Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5011 |
Symbol | |
ID | 5736970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 17604 |
End bp | 20459 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641282178 |
Product | hypothetical protein |
Protein accession | YP_001547769 |
Protein GI | 159901523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.261042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCGAC GTTGGATACT TCTATGTTTG GTTATTGGCA TTGGGTTGTC GGTAGTGCCG CAGGATACCT CTGCCCAATC AGCGGTCTAC TATACGTTTG ATGCACATTC GTTATTGATG AATCAATCTG ATGAGAATTT TGATAAAGTC GCCTTTTTGT TATCGGTACA AGGGAATGTG AATCGAGAAA GTCCTCGACT CTTCATTAAG CAACCACAGA TGATCAGCGT GAAAGGAAGT GATTATAGCC CTGATACACT ATGGAAAAAC GCTATCCAAA GTTCATTCAT TTGGCTTAAC CCGCGACAGT ATCAGGAAAC AGTGTTAACC GATATTAATC AGGTTATTAC CACCTTCGCG GCATCATCCT ATGGCATTGA TGGAAGTGTC GTATGGGATA AGGATCGTCC TTGGACATTG AATATTGCCG CATCAATTGC TGGTGCTCGA AATCTCGCTA TTGTCCGCAA GCAAAGCCCG ATCTATGCGA CGCTTACCGC ACACTATCCT GTCGTGGTTG ATTTAACGAC GGATCCGATT CAGGGTTTTA TGACGACGAA ATCGAGTGCG TATCAATGGC TCCTCCAAGA GTACCTGTTG AATTCTGCGA ACCCGCATCG ACTGGAACCA ACGCTCGCAA GCCTCAAAGA TGGCTATGCC CTCAAAAATC GGGCAATGAA TACCCTTTAT TTTGGCCGAT GGATTAGTCT GGAAATGGAT GCTGCCATTG CTCGAAAGGC ATTAATTTTT GATCTTTCCC CTAATGCCGA TCTGGGATCG CCAGAAGAGC AGTGTCAAGC ACCAAGTAGT GATTATACAA CGTTGACGAC CTTGCTTCAG AATGTTCGTA ATCGCAAAGG AACCCAGCCT ATCGAAGTGA TGGGATTTTT AGATTTTCGC TATATCTATT GTGAAGGCAA TGGTCAGCCC CATCCAACGG ATGAGCAGCG ACGGTTACAA AATGTTGTGA CAGCGTTAGA GCATCCATTT GCAAAAGTGA TTTCCCAGTA TGGTGGCATG ATGACGGTTG GTGGGATTGG TCCAGCTGAT GCTGCGAATG GATCATTTTT TCGTCATAGT CCTGGTGTGC GGTATATCCC CCAATCCCCA GCAATGACTC CTGAAACCTT GCTCAGAAAT GATTATGCTG ATGGCTATCC CCTCAATTTT TCATTTGAAA AAGGTGGCGT AAGTCAATGG ACGATGTGGA CGACTAATTA TGCCACCTAC GCTGGAACAA ATCTCCCCCA CGGATCAACG TTTCTTGAAA TGAATACCAG TACGACTGAC TGGCAGAATG GGAAGAATAC GCTGTATCAA GATGTGCCGA TTGCGCTCTT ACGCGGATCA CGGTATCAAC TCCGGCTTAG TGCTCGGCGG AATCCGAGCG AGGTGGGTAG TATCCAAGGT GGAATTGCCT TGTGGGGGAA ACGCGCTAAT GGAAGTTATA CCCAACTCAA TCACTGTCCC TTCACGCTTA CAAGTGGATC ATGGGTTCCC ATTGCATGTG ATACCGATAT TCGTGAAGAT GGACTCCATG GGATACGACT TCAAATTGCC CTTTATACAC CGGATAAGAA CTATGATTTT GATGCGATCA CCTTCCTTGG TCCGAATACC TTGCGCGTGA ATCCGACGAA GACCTTTGGC TTATTCTACA TGGGCGATTA TGATGGACCT GGTGCGGCTT ACAGTACCTT GATGGCCGAT GTGAACGATG AGAAAAGTTA TAATGAACTG ATTTGGACGA GCAAAATAAC GACCACCGTT CCCGTTGCGT GGGCAATTGC ACCGAGCTTT CGTGATGCCT ATCCCAGTGC CTATGCCTAT CTTGCAAAAA CCAAAGGGCC ATATGATTAC TTCATGATGC CCAACTCAGG GCCAAATTAT ACCAATCCCC TGCATTTTGA TAGCCTAGCG AGTGGTCGCC CACGGGTTGG GCCTTTCAAA CAGCAAACGG CAACCCTCAA TCGCGAGGTG GGGTATCGCG TGGGATGGGT CTTGGATGGT GCAGAAAGTC ATCTCAGTTA CAGTGATCCC ACGGTGCGGA GTATTTTTAA GATTGCTACC CCCGATGGCT ATATTCACAA TAGCAATGTT CCACCAATCG GGCCAACTGA TCCGTCGGTT GCAACCTATG ATGGACATGC GGCGATTTTA CGCAAAACAA CCGATTTAGT GGATCATGCA ACCACAGACA CAGATGGTGC TGATCGCTTA ATTGCACACG TCCTCGCACC ACAAGCCGCA CAATTTCAAG TCTATCGAAG TATTTTTGTC TCATCAAACT TCATTTCGCA CGTCGTGTCC ACCGCGAAGA CAAAAAACCT TCAGTTTGCG AATCGATTTG CCGCCCTTGA TCCGATGAGT TTCTTTGGTT TGTATAAGAG CCAGCATGGA CTGTATCCAC GCTTACGGAT GAGTATGGTC AGTGATACCC TGCCGCAGGT GATGTACACT GGTCAATCGT ATGCCGTCCA GGTCACGATT CGGAATGATG GATGGGATAT CTGGCGACCA AAGCCCTCAG GCGCAACCGA TTGTGATGGG AGTGGGCTGG CATATAAAGG ATGTGATCGC TTTGTTTGGA CATTCCAACC GCCAACCAAT CCCATTATTC CGACTGGACC AGGTGCGATT CCAACCGTTA CCTATCCATC GGGAAATCGG ATTGATTTTG GAACAACAAT TGCTCCAGGT GCAACGACCA CGGTGAACCT GATGTTGACC ATTCCTGCGA ATGCGACACT TGGCTATCAC ACCTTCCAAG CAGATCTGGT TCAAGAAGGG TATGGATTTG GGGAAACCTA TGGGAATCAG CCATGGCAAG GACGTGTCCT CGTGGCTACA CCCTAG
|
Protein sequence | MLRRWILLCL VIGIGLSVVP QDTSAQSAVY YTFDAHSLLM NQSDENFDKV AFLLSVQGNV NRESPRLFIK QPQMISVKGS DYSPDTLWKN AIQSSFIWLN PRQYQETVLT DINQVITTFA ASSYGIDGSV VWDKDRPWTL NIAASIAGAR NLAIVRKQSP IYATLTAHYP VVVDLTTDPI QGFMTTKSSA YQWLLQEYLL NSANPHRLEP TLASLKDGYA LKNRAMNTLY FGRWISLEMD AAIARKALIF DLSPNADLGS PEEQCQAPSS DYTTLTTLLQ NVRNRKGTQP IEVMGFLDFR YIYCEGNGQP HPTDEQRRLQ NVVTALEHPF AKVISQYGGM MTVGGIGPAD AANGSFFRHS PGVRYIPQSP AMTPETLLRN DYADGYPLNF SFEKGGVSQW TMWTTNYATY AGTNLPHGST FLEMNTSTTD WQNGKNTLYQ DVPIALLRGS RYQLRLSARR NPSEVGSIQG GIALWGKRAN GSYTQLNHCP FTLTSGSWVP IACDTDIRED GLHGIRLQIA LYTPDKNYDF DAITFLGPNT LRVNPTKTFG LFYMGDYDGP GAAYSTLMAD VNDEKSYNEL IWTSKITTTV PVAWAIAPSF RDAYPSAYAY LAKTKGPYDY FMMPNSGPNY TNPLHFDSLA SGRPRVGPFK QQTATLNREV GYRVGWVLDG AESHLSYSDP TVRSIFKIAT PDGYIHNSNV PPIGPTDPSV ATYDGHAAIL RKTTDLVDHA TTDTDGADRL IAHVLAPQAA QFQVYRSIFV SSNFISHVVS TAKTKNLQFA NRFAALDPMS FFGLYKSQHG LYPRLRMSMV SDTLPQVMYT GQSYAVQVTI RNDGWDIWRP KPSGATDCDG SGLAYKGCDR FVWTFQPPTN PIIPTGPGAI PTVTYPSGNR IDFGTTIAPG ATTTVNLMLT IPANATLGYH TFQADLVQEG YGFGETYGNQ PWQGRVLVAT P
|
| |