Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1365 |
Symbol | |
ID | 5733257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1577207 |
End bp | 1579984 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641278503 |
Product | hypothetical protein |
Protein accession | YP_001544138 |
Protein GI | 159897891 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCTC TTGAGTTTGT GCGAAAATGG CGAGGTATTC AGGTCAACGA ACGACGTGCA TATTATGAGC ATTTTACCGA CCTCTGTTCT TTAGTTGGTG CTAAAACCCC GCTTGAGGAA GATCCCACAG GAACATTTTA TACCTTTGAG GCCGGTGTTA CAAAACTCAA TGGTGGAAAA GGATGGGCTG ATGTTTGGAA AAAAGGGTAC TTTGCAATTG AATATAAAGG CAAGCATGGG AATCTTAACC GCGCTTATGA TCAACTTCTC CAATATCGTG AGGCTCTATT AAATCCTCCA CTCCTAATCG TCTCCGATCT TGACAGTCTC GTTATTCATA CAAACTTTAC CAATACAGTC AAAAAGGTAA CAACACTTAC TTTAGATGAA ACATTAACAA GAAATGGCCT TGATACAATT CGAAGTATTT TTTATGCTCC TGATACCTTT CGATCTCCAG TCACACCAGA GCGGGTAACC GAAGAAGTTG CTGCAAAGTT CGCCCGCTTA GCCCAGCTTA TTACCCGCTA TGAAAAACAC ACTTCTCCGC AAGAAATTGC ACATTTTTTA ACACGATTAT TATTTTGTTT ATTTGCAGAA GATGTCAATT TACTTCCAAA GGATATCTTC TCTCGTTTAG TGACTCAAAC ACGTGGGAAA TCCTCTGCGT TTGCAGCCCA ACTCAGCCAG CTTTTCAATG TCATGACTAC AGGAGGATGG TTCGGCATTG AGGAGATACG GCATTTTAAC GGGTCGCTCT TTGATAATGC AACGGTTTTA CCCATGGATA GTGAAGCACT TGATATTCTA GTGGATATCT GCTCCTATGA TTGGTCATCG ATTGAACCAG CAATTTTTGG AACTCTGTTT GAACGTTCAC TCGATCCGGC GAAACGGAAG CAACTGGGTG CACACTATAC CAGTAAAGAT GATATTTTAC TGTTAGTCGA ACCAGTTGTT ATTCAACCTT TAAGGGAAGA ATGGAGTAAG CAAGAACAGG TTATCGAGGG ATTAGTTACC CAGCGTAATG AAACAGTGGG TAATGATGTA ACCAAAATTA ATCGGCAAAT TGAATTCCAT ATCAATACCT TCTTGCATAA ACTGAGATCA ATAAAAGTCC TTGATCCTGC CTGTGGAAGC GGCAATTTTC TCTATATCGC ATTAAAATTG TTGCTCGATT TGGAAAAGGA TGTTATTCGT TTTGGTGCTG ATGCAGGCTT GCCGTTACAA ATCCCACAGG TCAATCCAGA GCAATTTCTT GGGATGGAAG TTAATGCTTA TGCTCATGAG TTAGCTCAAA TTACGATTTG GATTGGCTAT ATTCAGTGGA TGAAAGAAAA TGGATTCGGA AATTTATCGG AACCTATCTT GAAGTCCCTC AAAACAATTC ATCGTATGGA TGCGATTTTA GCTTTTGATG CTGACGGGAA TAGCATTGAG CCTGCCTGGC CTCAGGCGGA TTACATTATT GGGAATCCAC CGTTTTTGGG TGGCAATAAG ATTCGGCAAG AGCTGGGTGA TGGCTATGTT GACGCATTGT TTAGCCGTTA TGCGGATCGT GTTCCTGCAT TTGCTGATCT GGTGTGTTAC TGGTTTGAAA AGGCTCGCGC AATGATTGGC AGTGACATAA CTCGACGAGC TGGTTTTATT GCAACCAATT CAATTCGGGG TGGGTCGAAT CGAAAAGTTT TAGAGCGTAT CAAAACGTCT GGTGATATTT TTATGGGTTG GTCAGATCGC CCGTGGATTC TTAATGGAGC GGCAGTACGG GTGTCAATGG TGGGATTCGA TAAAGGCGAA GAAATAATAC ATAGTTTGAA TGGGATGATA GTTCAATCCA TTAATGCAAA CCTAACCCAA GACATTGATA CAACTAAGGC TTTAATCCTT CCTGAAAATA AAAATATTAT TTTTGGAGGA ACAAAGAAAG GTGGAAAATT TGATATTACA CAAGGTATAT ATGATGTATT GATGCAGAGT CAAAATAATC CCCATGGCCG TCCCAATAGC GATGTAATTA AGCCATGGGT CAATGGACAA GCGTTATTAG GAAAGGGAGA AAAACGATGG GTTATTGATT TTGGCGTAGA TATGTCTCTG GAGGATGCAA GTCAGTATGA AAAAATATTT GAGTATATCA AAAAAGAAGT TTATCCTATT CGTATAAATA ATAGGATGGA GAGCAGAAGC AAGCATTGGT GGCTTCATTC TTTTACAGCC CCATCAATGC GGCATGCAGT CGCCACTATC TCAAGATATA TCGCTACGCC TCGGGTATCA AAATACCGAC TTTTTGTATG GGTTGATAGC AACACAATCC CAGATGATGG GACATATATT GTTGCCCGCG ACGATGACTA TTTTATGGGT GTGTTGCATT CAAAAATCCA TGAGTTATGG GCATTGCGGC AAGGAACATT TCTTGGGGTT GGAAACGATC CACGCTATAC CCCAACCTCA ACCTTTGAAA CCTTTCCCTT CCCATGGCCA CCAGCGAAGG AACCCAAGGA TTCACCGCTG GTTAACGCCA TTGCCGAGGC AGCAAAAGAG TTAGTTGAGA AGCGGGATCG GTGGCTGAAT CCGGCTGGAG CAACCGAGGC CGATTTGAAA AAGCGCACGT TGACCAATCT CTATAACGAG CGGCCAACGT GGCTCGATTT GGCGCACAAA AAGCTGGACA AAGCGGTGTT TGCGGCCTAT GGGTGGCCGG ATACCTTGAC CGATGATGAA ATCCTTGGCC CTTTGCTGGT CCTCAATCAC GATCGGGCGG CAGGCTAG
|
Protein sequence | MQPLEFVRKW RGIQVNERRA YYEHFTDLCS LVGAKTPLEE DPTGTFYTFE AGVTKLNGGK GWADVWKKGY FAIEYKGKHG NLNRAYDQLL QYREALLNPP LLIVSDLDSL VIHTNFTNTV KKVTTLTLDE TLTRNGLDTI RSIFYAPDTF RSPVTPERVT EEVAAKFARL AQLITRYEKH TSPQEIAHFL TRLLFCLFAE DVNLLPKDIF SRLVTQTRGK SSAFAAQLSQ LFNVMTTGGW FGIEEIRHFN GSLFDNATVL PMDSEALDIL VDICSYDWSS IEPAIFGTLF ERSLDPAKRK QLGAHYTSKD DILLLVEPVV IQPLREEWSK QEQVIEGLVT QRNETVGNDV TKINRQIEFH INTFLHKLRS IKVLDPACGS GNFLYIALKL LLDLEKDVIR FGADAGLPLQ IPQVNPEQFL GMEVNAYAHE LAQITIWIGY IQWMKENGFG NLSEPILKSL KTIHRMDAIL AFDADGNSIE PAWPQADYII GNPPFLGGNK IRQELGDGYV DALFSRYADR VPAFADLVCY WFEKARAMIG SDITRRAGFI ATNSIRGGSN RKVLERIKTS GDIFMGWSDR PWILNGAAVR VSMVGFDKGE EIIHSLNGMI VQSINANLTQ DIDTTKALIL PENKNIIFGG TKKGGKFDIT QGIYDVLMQS QNNPHGRPNS DVIKPWVNGQ ALLGKGEKRW VIDFGVDMSL EDASQYEKIF EYIKKEVYPI RINNRMESRS KHWWLHSFTA PSMRHAVATI SRYIATPRVS KYRLFVWVDS NTIPDDGTYI VARDDDYFMG VLHSKIHELW ALRQGTFLGV GNDPRYTPTS TFETFPFPWP PAKEPKDSPL VNAIAEAAKE LVEKRDRWLN PAGATEADLK KRTLTNLYNE RPTWLDLAHK KLDKAVFAAY GWPDTLTDDE ILGPLLVLNH DRAAG
|
| |