Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5250 |
Symbol | |
ID | 5737208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 20302 |
End bp | 21900 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641282414 |
Product | hypothetical protein |
Protein accession | YP_001548005 |
Protein GI | 159901760 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.879154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC GTTGGCTTGG GGTTGGTATG CTCGTGTTCT TGGCAGGCTG TAGCCGCCTT GCGCCAAGCC AGCTTGCCGG AACCGCAGCG GTCTTGGGCT GTTGGCCCTA TGGGTTTGAG CCGCCGCCAC CGACGGCGAC CAATCCGGTG ATGCTGTCAC CACCGCCGAG CGGCACGGGA ACGCCATTGC CCACGTTGAC CGCTAGCCCG ACCGCCGCGC CAACCATGCC TGTCTGTACC CCCGCCCCGA ACACACCAAC CCTGACCCCC AGTCCTACGC CCTCGCCAAC CCCATGGACA CGACCCACCG CCGCCCCACC TGGGGGCAAA GGCATGCCGC CGCTGAATCT CTCGAACATG CCAGGCTACG ACCAAGACCC CGCGATTGCC GTCCACCCCA CGCAGGGCTG GGCCGCTGTG GTCTGGTCAA ATTGGCTGAA TGAGTTTCCG CAGGAGGCGA CGGTGCTGGT CAAGGTGCAA GACCCACAGA CCAAAACATG GCGGCAGGGG ATTGGGGTGA ACACGGCAAC CGTCACCAAA GGCGCAGGCG CACCCGCAAT TGCCATCGAT GCCCAAGGTC GCATCCATGT GGTCTTTGCG CAAAATGGGC AGGTGGTGAT CACGAGCAGC AGTGACGCGG GCCGAACGTG GACACCGCCC GAACCCATCC CACTGCCCAG TGGCAGTCAG GGCGGGCGGA TGTTTCAGGT GGCAGTTGAT GCAGTGGGCC AGCTGCATGT GTTTTTTATC AGTGCGGATG CCTGTTTTGA TTGCTTTCAC GCGGTGCATG CCCAACGCGC CAGCGATGGC AGCGGGCCAT GGGTATGGCA GGATTGGCTG ATCAATGACT CCAAACAACT TTATGGTGAC ATCGCCACCG TGCCGTTGGC CAATGGCACG ATCCGCACGG TCGTGGCGAT TGGGGTGGGC GATGGGGTGC GCATCGTGAC GCAGGACGGA CGCAATGGCC CATGGGTGGC GCGACCGTTG TCATTTGGCG GGTTGCCAAT CCAGCCGCAG GTCGTGGCAT GGATTGACCT GGTGGCCTTT ACCGATCAGG CGGGTCAGGC CCAGGTCTGT GTCAGTTGGG GCCAATATAG CAAAAGCGGG GTGTTTGTCG CCTGCTCGCG CGACGGTGGC CAAACGTGGG ATGTGCCGGA GATTCTGGCC ACCCACGCGG CACCAGGAGC CGCGCCAACG CCCGATCCCG CCGCGCCAAC GCCCGCCTTG GAAGACAATC CCAGTCCCAG TGAGGGCAGC GGCCAGCGCG GCTTTCACCC CGAACTCTTG TATGAACCCG CGACGGATAG CCTGATGGCC GTGTGGAGTC TGCTCGATGG CAGCGCCTCA ACCATTGTCT ACAGCTATCG ACCTGCCCAG GGCGGGGCAT GGTTGCCCGT GATGAATACC ATCACCACGG AACCCGCATT GGGCGTGTTT GGCGCAACCC GCCGCAGTGC CGCCCGCAAT CCGCGCTTAG CCTTTGCTGG ACAGGGCGTG GCGATGGTCG CGTGGATGGA AGTGGAGCGC GATGAAAACC TTGAGGTCTA TGTCGGCGGC TTTCTGCCCG CCACCCTCTT AACCCGCGCC GAGAACTAA
|
Protein sequence | MSRRWLGVGM LVFLAGCSRL APSQLAGTAA VLGCWPYGFE PPPPTATNPV MLSPPPSGTG TPLPTLTASP TAAPTMPVCT PAPNTPTLTP SPTPSPTPWT RPTAAPPGGK GMPPLNLSNM PGYDQDPAIA VHPTQGWAAV VWSNWLNEFP QEATVLVKVQ DPQTKTWRQG IGVNTATVTK GAGAPAIAID AQGRIHVVFA QNGQVVITSS SDAGRTWTPP EPIPLPSGSQ GGRMFQVAVD AVGQLHVFFI SADACFDCFH AVHAQRASDG SGPWVWQDWL INDSKQLYGD IATVPLANGT IRTVVAIGVG DGVRIVTQDG RNGPWVARPL SFGGLPIQPQ VVAWIDLVAF TDQAGQAQVC VSWGQYSKSG VFVACSRDGG QTWDVPEILA THAAPGAAPT PDPAAPTPAL EDNPSPSEGS GQRGFHPELL YEPATDSLMA VWSLLDGSAS TIVYSYRPAQ GGAWLPVMNT ITTEPALGVF GATRRSAARN PRLAFAGQGV AMVAWMEVER DENLEVYVGG FLPATLLTRA EN
|
| |