Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0022 |
Symbol | |
ID | 5736856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 26163 |
End bp | 28562 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277143 |
Product | hypothetical protein |
Protein accession | YP_001542802 |
Protein GI | 159896555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCGC AGCAACAGTT CACCGCTGCC TCGCATTTGA CTGCCGATAA TTTGCGTAGT ATTGCGCATT CGATCTCGAT CAGTCAGGTG TCGCGGTTGT CGTTGCCGGA GATCGATGCC GTTGTTGACC AAATCTCGCG GGTCGTGCCT GCTGGTAATG TCCCCGGGGT GATTTTAAGT GGCTTGGCGA AGTTGACGGG CCGTCGTCCA GCAGGCAATG TCATTAAGCG CGATGTAAAT TTGCTGTTTC GTGGGGTTGA GCAAGCCCTC GATAAAGCGG TGTTTAGCAC GTTCTTTGCT GGGCCGGCTG CGGTTATTTG GGGTTACCAA AAACTGCTCG AATTGGCTGG CAAAGATCCG CAAGATGCGT TTCCGGAAGG CACATGGCAG TTTTATGTGG GCTATGCCTT GCGCGAAGAT ACTGCCCGCC ATGCCAACGA AACGATTGGC TTCGATGAAA CCCTGAACGA TCATAAAATT AATTTGCCAC CAATCGACCG CATGACCGCT TGGGTTATGG CGGCGATTCA TATTCTGCAT AGTTACCCCG ATTTGCTCGA AAACGAATGG CGTGAGCGCG TTTCGTTGGC CTTGCTGCGC GATCTGACCA AGGTTAATCC TGAAACTCGT CAGTTTGCCG ATTTATATAA TCAATGGGAG CGCCAACGCC CTTATGGCCG TGGCCCCGAT GTTCAATCGC AAGAAAATTA TGCGCTGTAT CGCAAACGCA AATTCGATGA ATTTATGGCC GAATCTACCC GCGATTTGCC CAAAGAAATT CGCGAACGTT GGGGCAAACA GTTTCAGCGT GCTCGCGAAA TCGCCTTGCC AGCCTATCAA CGCCAAATGG CCCTGGCAGC CTACCTTGAT CCCACACCCT ACAACGAAAA TATGGTAGCT TTGCCACGCC AAAGTTGGCA TATTGGCTTA ATTTGGCGCG GCCATTATTA TTTGATTCCT GCTTGTGCGC CCAATAGCAC TCGACCCAAT GATGTAAGCA GTGTGCGCAG CCAAATTGCT GCCTTGTTAG CCAGCCCAGC CAACCATGCC CCAACTTCAT TAATTCCTTT GGCAACCACC AAACGCACAA TCTTGCCCAG CATTTTGGGT AAGTTGCGGC CTGAAACCAG CCAACAACTT GAGGCCTTGC GTTGTGCACC AATTTGGTTT AATGCTGATG GCCGACCGCG CCATTTGCCC TTAGCCGAGT TGCGCCTGAC CGAACGTGGA CTAGGCGACC ACGGCCTGAC CTTGATCGAT ACTGGCTCAA GTATGGTCTT TGATCAATCG CATATTTTCT TCGATGGCGC TTGGGGTTCA GCCGTCGCCG AAATTATGAC CCTCGAAGCC TTGGCTTGGG CGGTCTATTT GCGTGGCCAA CCAGCGCCAG TCGCGGGTAC GGTACGCCCC TATGCACCCA ATATTGAACT CAACGACGAA GAAAAACAGA TTCTCGCTGA TAGCCCCAAG ATTGTGGCTG AAGCCAGCGC CGAATCAATC GGGGTTGATC TGAAGAAGAT TTTGGAATTG CGCAAGTTGT TCAAGCAGCG TAACGACCAA ATTCGAATTA CAGTTAACGA TATTTTGGTG CTCTATCGGG CAATTCATGC GGTTTCCTAC AAGCCTAATC CAGAGTTGCA AGCGTCATTG CAAGAAGCCT CCAACGATGC CAATCTCAAA GCAGCGGTCG AAGCAACGAT CACGGCGTTC GAGGAATCGC TGGCTAATCC GGCAATTCTC ATTCCGGTTG ATGGCAGCAT TCCCAATCCC AGCGATCGAC TGCACCCCAT GACCTTCGAG GTTCCGCTGG AAGAGCTGGA AATTAGCAAA TTGCATGAAC GCGCATTAAG TTTGCTCGAT CAAGCGCGGC AAGAGTGGCG AGCGGAAGTC TGGGATACCT TTGAGGCAAC CCAAAAGCAT TATTTGGCGA CGATTGCTGG CTTTGGCGAG GTTTCAGCCC GCGCCAAAGA TATTGCCCAA TCGGGCGAAA GCACTTCGTC AGGAGCCTTG CGCTTGTTGG CCCACGTACC GATGGCTTTG CAACGCTTGC TCGATGCGAT TCCTGGTAAG TTCGATGTGC TCAACGATTT GATCAAAGGC CGCGAGGTGC TTTCAAATGT GGGCGCGGTG GCCGATACCA GCTCATTAAC CCGTTTTATC ACTGCCAAAG ACGATAACGA GAAAAAAACC TTGGCTTGGG GTGTTATCAC CGATGCCAAT GGGGTAATGC ACGTTTCGCT ACGCGACTTT CGCCCACATG TTGGCTTGTT TGTGGCTGCT GGACGACGCG ATTTGGCCCG CCGGATCGCC AACGATTATC TTGAAAGCTA TGTTAATGGT CTCAATCGCT TTATCAGTGA ATTGACCAAA ATTACCCAAG GCCGCCATAG TCGCCAATAA
|
Protein sequence | MDSQQQFTAA SHLTADNLRS IAHSISISQV SRLSLPEIDA VVDQISRVVP AGNVPGVILS GLAKLTGRRP AGNVIKRDVN LLFRGVEQAL DKAVFSTFFA GPAAVIWGYQ KLLELAGKDP QDAFPEGTWQ FYVGYALRED TARHANETIG FDETLNDHKI NLPPIDRMTA WVMAAIHILH SYPDLLENEW RERVSLALLR DLTKVNPETR QFADLYNQWE RQRPYGRGPD VQSQENYALY RKRKFDEFMA ESTRDLPKEI RERWGKQFQR AREIALPAYQ RQMALAAYLD PTPYNENMVA LPRQSWHIGL IWRGHYYLIP ACAPNSTRPN DVSSVRSQIA ALLASPANHA PTSLIPLATT KRTILPSILG KLRPETSQQL EALRCAPIWF NADGRPRHLP LAELRLTERG LGDHGLTLID TGSSMVFDQS HIFFDGAWGS AVAEIMTLEA LAWAVYLRGQ PAPVAGTVRP YAPNIELNDE EKQILADSPK IVAEASAESI GVDLKKILEL RKLFKQRNDQ IRITVNDILV LYRAIHAVSY KPNPELQASL QEASNDANLK AAVEATITAF EESLANPAIL IPVDGSIPNP SDRLHPMTFE VPLEELEISK LHERALSLLD QARQEWRAEV WDTFEATQKH YLATIAGFGE VSARAKDIAQ SGESTSSGAL RLLAHVPMAL QRLLDAIPGK FDVLNDLIKG REVLSNVGAV ADTSSLTRFI TAKDDNEKKT LAWGVITDAN GVMHVSLRDF RPHVGLFVAA GRRDLARRIA NDYLESYVNG LNRFISELTK ITQGRHSRQ
|
| |