Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1831 |
Symbol | |
ID | 5733719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2128839 |
End bp | 2130368 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641278974 |
Product | hypothetical protein |
Protein accession | YP_001544602 |
Protein GI | 159898355 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAG ATTTCCTCCG CTACAATGAT TGGCCCTTGG TTTTAGCAAC TGCCCGTGAC GAGCAATGGA CGAACTTATT GCTGAGCTGT TGTCCACTTG AAACAAACCC TGAACTTCTG AATCTCCTGG GCCAATGTCA TAGCCTGAAT AGCCTTGGTT TATTTCAGTG TAAGTTAACC GAAGTACCAA CAGTATTACG CCAGTTACCA CTAAAGGTAT TAAGCTTGAG TTGTAATAAC TTACGCCAGT TACCCGATTG GTTAATTGAA TTGCCGCTCG TTGAATTGTC GCTTTTTGGT TGCCCTCACG TTGAATTGCC AGCAAATTTT GATCAAAGCT CAATCGAAAT CCTCGATCTA AGCTCCAACG AAATGACCGA GTTGCCAGCT ATTGTTAGGC GCATGCCGAA ATTGAAACAG CTTTTGTTGC GGGAGAATCA TTTTCAGAGC ATTCCCCGCC TGATTGCCGA TACTGCCATC GAACGGCTTG ACTTGGCTGA AAACCCAATT CGCGATTTTC GTTTGCTTCC TGCTACCCGC TTACGCATTT TGAATCTTAG CAACACTGGA TTGACAATGC TGCCGATCGA GTTGCGTCAG CACCCACTCC AAGAGCTTGA TCTTTCGAGG CTTGATGATT TACATATTCC ACCCTGGTTC GCAGAACTAA GCAGTCTTCG CATTTTAAAT TTATTCTATA GCAACTTCTC TGAGCCAGAA CTGCATCTGC CAACAAACTT GGCTGCTGTT TGGATGCAAC ATTGCAACTT AACCTCTATT CCTCAGGCGA TTCAATCCAA CCCGCAATTA CGCTCGCTCA ACCTTAGTGA AAACAATTTT GCCGATGCTG AGTTGGTGGT CAATGGCCCA TGCGAATTTA TCAACTTATC GGAATCATCG TTAGGTGAGC TTATTTTAGC CGATCACGCT CTATCAAGCC TTAAGAGTCT TGATCTTCAA ATGGCAAACG TTAAGCAGGT TCGGAATTTA GCGAAGTGTC GAAATTTAGG CGGATTGCGT TGCGTTGATG CCTATTTACT TGCAATGCCC ACTACGCCTG AGTGGCTGAA GAGTCTTCGT TACTTGTGGC TCAATGGCGA CTTTTCTAAC GAACATATTC CGTCGTGGTT TTGGCAGCTT GAGCAACTCC AATCACTCCA TTTGCAATCG CCAGATTGGA CACTGCTTGA CCCGCGGATT GGTCAATTGA GTGAACTCCA AGATTTGTCA ATTTATAGTA CACGGTTTGA GCATGTTCCA ATCAGTTTAT TACAATTAAC CAAATTGCAT AGGCTACAGC TGCATCTTGC AGATCCAAGC CATTTTAATT GGCTAGCAAG TTTGCCAGCA TTACACGAGT TGGGTTCGTA TCCTTATGGT CAGAAACCAC CGTTGGCGAT TCAAGCACGA GCTGCTGACG CAGATTTTCG TTATCATAAC CAATTTGCAG AGAGTAACGA TGACGACGGG TTGTATCCTC ATGCTGATTT GGCTTGGCTT GATCGGTATT TTCCCCAAAA CGATCGTTAA
|
Protein sequence | MTEDFLRYND WPLVLATARD EQWTNLLLSC CPLETNPELL NLLGQCHSLN SLGLFQCKLT EVPTVLRQLP LKVLSLSCNN LRQLPDWLIE LPLVELSLFG CPHVELPANF DQSSIEILDL SSNEMTELPA IVRRMPKLKQ LLLRENHFQS IPRLIADTAI ERLDLAENPI RDFRLLPATR LRILNLSNTG LTMLPIELRQ HPLQELDLSR LDDLHIPPWF AELSSLRILN LFYSNFSEPE LHLPTNLAAV WMQHCNLTSI PQAIQSNPQL RSLNLSENNF ADAELVVNGP CEFINLSESS LGELILADHA LSSLKSLDLQ MANVKQVRNL AKCRNLGGLR CVDAYLLAMP TTPEWLKSLR YLWLNGDFSN EHIPSWFWQL EQLQSLHLQS PDWTLLDPRI GQLSELQDLS IYSTRFEHVP ISLLQLTKLH RLQLHLADPS HFNWLASLPA LHELGSYPYG QKPPLAIQAR AADADFRYHN QFAESNDDDG LYPHADLAWL DRYFPQNDR
|
| |