Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5099 |
Symbol | |
ID | 5737057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 126357 |
End bp | 128429 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282264 |
Product | hypothetical protein |
Protein accession | YP_001547855 |
Protein GI | 159901609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCA TCCGTATCGT CGCGTTGGTG TTCATTGTCG TGCTCCTTGG ATGGTTCGCT CCACTTTATG CCCAACAATC AAATGATGTC ATCATTCAAC CGCATGCCTT CGTTGGTGAC ACGTTTACCT ACCAAGGTCT CCTTATGCAA GGAACCACCT ATCCCAGCGG AACCTTTGAT TTCCAATTTA GCCTCTATGA TGATCCGACC GCAGGTACGC TGCTTGGGCA GCTTCAACAG GACGACGTTC CCGTTGAAGC TGGCCAATTT ACCGTCGCCT TGACCTTCCC CGAAGGCAGC GTTACCGGTC ATCAACGCTG GTTGGCGATT GCCGTCAAAA CCCTGAATGG CAGCGCCTAT GTTCCCTTGA ATCCACGCTC CGCCGTCAGC GCAGCGCCGA TCGCCCTCAG TCTGCCCGGC CTCTGGACAC GGCAAAACGA TACCAGCCCC AACCTGATTG GGGGCTATAG CAGCAATACC GTTCCAGCGA ATGGCGTTGG GATGACGATT GGTGGCGGGG GAGCCTTTGG GAATCTTCAA CAAATCTATG ATCACTATGG CGTGATTGGC GGTGGGGCTA ATAATCGCGT GGGCAGCGAT GATGGCACTG TCACCAACGA CGGCTATGCC ACGATTAGCG GTGGGTTTGG TAACACGACG ACCCAAGAAT ATACCGTGAT TGGCGGTGGG CAGGCGAATA CCATCACGGG CGCATTCTCG ACCATTAGCG GTGGGACGAC CAATACGATT GCGCATATCT ACGCGACCAT TGGCGGCGGG ATGAACAATC GCGTTTCTGC CCAATTTGGC ACGATTGGTG GCGGTGGCAG TTCGGCCAGT GCGACTGGCA ACCGCGTCTA TGATACCTAT AGTACGATCA GTGGCGGCTA TAACAATGTT GCGGGAGTCG ATGACACGGG CAACCAACCA TTTGCGACGG TCGGCGGTGG GTCGAGTAAT AACGCGAATG CGCTTGGGAG TATGGTTGGT GGCGGTCGTT CAAATAGCAT CAGTGCCATC GCTGACTACA GTGTGATTAG TGGCGGCTAT AACAATGTTG CCACAGGATT GTATGCGACG GTGAGTGGGG GAGGAAGTGC CTCAAGTGGC CAAGGAAATC GCGCCTACGA CAACTACAGT ACCGTTGCTG GGGGCTATAA CAATGTTGCA GGGATCGATG ATTCAATCGG GCAACCTTTT ACCACGGTTG CTGGTGGTGG CTCGAATACG GCCAGCGGTT ATGCGAGCGC AATTGGGGGT GGGCGCTTGA ATCAGGCGAG TGGTCAGTAT GCCTTTATTG GGGGTGGTGA ATCGAATACC GCTACGGGAG ATCACACGAC GATCGGCGCA GGCCGACAGA ACACGGCCAA CGGCAACTTT TCGTCGATTC TCGGTGGGAG TGGCAATAGT ACGTCTGCTG ACTATAGTGT AGCCGCCGGG GAAAATGCGG TTGCCGCCCA TCGTGGCAGT TTTGTCTGGG CCAGTACCCA AGCCGCGCCG GATGCCACGA TCACCAGTAC CGCCCCGGGC CAATTTATTG TGCGTGCCCC CGGTGGGGCG TGGTTTGGCA GCAGCACGCA GGTGGACATG CCGAATGGAG CCATCCTTGC GACGGATAGC GGAGCCTTCC TCAGCAAGGG GGGAACGTGG TCAAATTCGT CGGACAAACA TCGCAAAACC CAGTTTGCGG CGATTGATCC TCATGCCCTC CTTACCAAAC TGGCAGCGAT CCCGATGCAG TCGTGGAGCT ACATCAATGA AGATCCGCAG ATTCGCCACC TTGGCCCGAC GGCGCAAGAT TTTTATGCAG CCTTTGGTTT GGGCACGGAC GATCGGCATA TTGCGACCGT CGATGCGGAT GGAGTTGCCT TGACCGCAAT CCAAGGACTG TATCAGCTGA ATCGGGAGCA AGCGGCAGTG ATCACCGATC TCGAAACCCG CTTAGCCGCG CTGGAAACAG CCACCCCCTC GCCAGCGCGT TCTGTATGGC TGCTCGCTGG TGGGTGGGGC AGTCTGCTGC TGATCGTTGG CTGGCTCGTT GGCCGCCGGA TGCGACGTGG AGGGACGGTA TGA
|
Protein sequence | MGRIRIVALV FIVVLLGWFA PLYAQQSNDV IIQPHAFVGD TFTYQGLLMQ GTTYPSGTFD FQFSLYDDPT AGTLLGQLQQ DDVPVEAGQF TVALTFPEGS VTGHQRWLAI AVKTLNGSAY VPLNPRSAVS AAPIALSLPG LWTRQNDTSP NLIGGYSSNT VPANGVGMTI GGGGAFGNLQ QIYDHYGVIG GGANNRVGSD DGTVTNDGYA TISGGFGNTT TQEYTVIGGG QANTITGAFS TISGGTTNTI AHIYATIGGG MNNRVSAQFG TIGGGGSSAS ATGNRVYDTY STISGGYNNV AGVDDTGNQP FATVGGGSSN NANALGSMVG GGRSNSISAI ADYSVISGGY NNVATGLYAT VSGGGSASSG QGNRAYDNYS TVAGGYNNVA GIDDSIGQPF TTVAGGGSNT ASGYASAIGG GRLNQASGQY AFIGGGESNT ATGDHTTIGA GRQNTANGNF SSILGGSGNS TSADYSVAAG ENAVAAHRGS FVWASTQAAP DATITSTAPG QFIVRAPGGA WFGSSTQVDM PNGAILATDS GAFLSKGGTW SNSSDKHRKT QFAAIDPHAL LTKLAAIPMQ SWSYINEDPQ IRHLGPTAQD FYAAFGLGTD DRHIATVDAD GVALTAIQGL YQLNREQAAV ITDLETRLAA LETATPSPAR SVWLLAGGWG SLLLIVGWLV GRRMRRGGTV
|
| |