Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0544 |
Symbol | |
ID | 5732402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 630304 |
End bp | 633585 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641277671 |
Product | hypothetical protein |
Protein accession | YP_001543320 |
Protein GI | 159897073 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGCTC AGCGTGTGGC TAATTTTGTT CAGGAATATC AACGATATAT CAATAATTCA GCGAATGAAG AGGAGCTTCG GAGCGGGTTT TATCTGGCTG CTACTAATGC TTTGGGTATT CCAAATTTTA CCCTAGAACG AGGCCGGCAG GATATCCGCC GGAATCGTGT CATTCTAGAG TTTAAGAATA AAGGCCTATT TCGAGGTCAA TCCACCAGCC TTAAATTTAA AGAGGCGCAG GATCAACTCA TTAATAGGTA TATCCCTCAG CAATCGCTTC GTGATGGGAG AAGGCCCTCA GATTATATTG GCGTGTGTTT TGATGGAGAA CATCTTGCCT TTACCTTTGT TGAACCAGAT CAAACGGTTC GGGTCACGAA ACTATTGCCA TTTGATGAAC ATAGTGCTGG GGCTTTGGTT ATGGCGCTTG ACGTTGATGA TCGGCGTGAA CTAACCCCAC AAAATGTTAT TGATGATTTT GGTCCATCTT CCTTTATTGC GCGACAGACC CTACAAGCAT TATGGCATCA CTTAAATGTA TCTCTTGATA TGGGTGTCAA GCGGATACAC ATGCTCTTTA CCGAATGGAA AAGTCTTTTC CAACAAGGAA CAAGCATTGG TACACGAGGG CGACAGAAAC TTCAGGCGTA TCTTCAGTCA GTGGGTTTAC CTGAATCTGC TGATATCACC CGTCTCCTTT TTGTCCTTCA TACCTATCAT GCACTTTTTT TCAAATTGCT TGCGGCTGAG GTTGTGCTGA CAAATGCTAT TATCCCGGGT ATGACCCAAA CCGATTTTTG TTTTTCTACC GCAGGACTTC CTGATCGATC CCTTATTAAG TTGTTGGTAG ATGATATCGA GGAATCAAAA GTATTTCGCC GTGTCAACAT CCTCAATTTT GTTGAGGGGA CTTTTTTTAC CTGGTATACA CATGAGGCTC CCGCAGGATT GATTGAAGCA ATTCGACAGA TTATGCAGCG GCTTAACCTT TATCGGCTAA GCGATCTTCA ACTGGAACGA ACACGCGATA TTGTGAAGTA TGTTTACGAG CAAATAGTTC CTGAACCGCT TCGTCATAGT CTGGGGGAAT ATTTCACGCC AGAATGGCTG GTTGAGTTTA CTCTTGATCG TGTTGGATAT CAAGGCTCCC AAATTTTGGA TCAGAAGATT CTCGATCCAT GCTGCGGCTC AGGCAATTTC CTTATTCATG CTATTGAGCG CTATAAACAG GCGGCTCACG CCCAAGGATG GGATGATTCT GCTATTCTTC ATGGCATTAC CAATCACATC TTTGGATTTG ACCTAAACCC TCTTGCCATG TTAACTGCAC GGGTTAACTA TCTTATTGCG ATTTCGGATC TCTTGAAAAC TTCATCTGCG GTTGAAATCC CTGTATATCA AGCCGATGCC GTTTACATTC CAAAGCCCCG ACCTGATGAT CCGACTATTT ATCGCTATGA TATTTCAACC CGTCTAACAG GGTGTCCTAT ACTCGCCCTA GATATCCCTG TATCCCTAAT TCACAAACAA CATTTGTTTG CACGAGTGCT TGAGGAGATG GAAGAGTCGA TAAATGAGCA CCAGCATACC TCTGCATTTA TTACACATCT TAAGAGAAAT CCTGAATTCT GTAGAGTAGA GGATCATCTT GACTGGATTC CATATCTGGA AAAAATGTTT AATGACATCC AGTTTCTTGA ATCACGCTCA TGGAATCGGA TATGGTGTCG GATTGCGCGA AATTACTTTG CCTCCGTTGC AATAGGGTCT TGCGATATCG TTGCTGGTAA TCCACCCTGG GTTCGTTGGT CTGAATTACC CCAGCTTTAT GCGGAGAAAA TTAAACCGAT TTGTGATGAA TATGGAATTT TCTCTGATGA ACGTTTTTTT GGGGGGAATG AGCTTGATAT ATCGGGGATG ATAACCTATA CCGTTATTGA CCAGTGGTTG GATGAGGGTG GTAAGCTGGC CTTTATTCTT CCACAAAATC ACCTGCAATC TCAGTCATCT GGTGGATTTC GTCAGTTTCA GATTCGCGAA ACCCCGTTAG AGGTTTTACA GGTTGATGAT TTTAGCGAGG TTAAACCATT CCGCCGCGTA GGTAATCGTC CCGCAGTGAT AACCATCTAC AAGGGGCGAG CTACAACCTA TCCTGTTCCC TATAACATAT GGAAACGTAT AACCCCAACA ACTATCTCTG AAAATGTAAG CTATGCGACT GCGGCTGAGA ATCTTCTATC GATACCTCAT GAAGCTTATG CATTAGAGGA CGCAGGAAAA CGTTGGAGTA TTCTTCCGCT AGGACGATTT CCTCTCCTTT CCATGTTGAA TGGTGAGGAC AGGTCAATTG AAGGCCGTAA GGGTATTGTT ACCGATTTGA ATGGGGCATA CTTTATTCGA ATTCTTGGCC ATGGTTATCG GACTGGAACT CTTCGTTTTA AAACTTCTCC AGATCAGGGT CAAAAACCTG TACCAGAGCG AACATATGAG ATTGAGGCTG ACCTTGTATA TCCTCTGATA AAAGGGGCAA AAAATGTTCA ACCATTTTAT GCCACCACAA GCGAGTTAGC CGTAATTGTT CCTAATAAGG GGATTAATTC ATCCTCTATG CCATCCATGT CTCGACTGAC TTATCAGGGG TATCCACTAG CAGCACGTTA TTTCCAGTCT CTGAATCGCG ATGTGCTCAT TGATGGTGTT GGCCTACTCG ATCAACGATC AACTTGGCGA ACCCGAATGC GGCCATTTTT GGAAAAACAA TATGCCGATA ATCTTGCCGA TATACCCTTT TATGCAATCT ATAACGTAGG AGACTATACA TTTGCTCCCT ATAAGGTTGT ATGGGCAGAA ATGTCTGGAA GTCTTAAGGC GGCAGTGATT TCTGATGGTC TTGTACCCTA TATTGAGCAG CGAAAGATTA TTATCCCTGA TCATAAAATA TACTTTGCAT CCTTTTCGAG TGAAAAATAT GCTCACTTTA TTTGTGCGCT ACTCAATTCA TCAATTATAC GTGAATTTGT GGATAGCTTC ACTATAAAAT TGCAGGTTGG AACAATATTT AAGCATCTTC GATTGCCTAA GTTTGATTTA GAAAATGCCG AACATCTATC ACTCGTTGAC TTATCCATTA CGGCACATAA CACTATGAAT AAAACAAATG GCAGTGGTAA TATTGAAGCA TTTATAGAAC AAATTGATAT CATTGCTGTA AGATTATTAG CTAAATTTGC TGCCACTATT ACCGATCAAC TACAACAAGA AGGGTTTAAT TTAGAATTTT AA
|
Protein sequence | MYAQRVANFV QEYQRYINNS ANEEELRSGF YLAATNALGI PNFTLERGRQ DIRRNRVILE FKNKGLFRGQ STSLKFKEAQ DQLINRYIPQ QSLRDGRRPS DYIGVCFDGE HLAFTFVEPD QTVRVTKLLP FDEHSAGALV MALDVDDRRE LTPQNVIDDF GPSSFIARQT LQALWHHLNV SLDMGVKRIH MLFTEWKSLF QQGTSIGTRG RQKLQAYLQS VGLPESADIT RLLFVLHTYH ALFFKLLAAE VVLTNAIIPG MTQTDFCFST AGLPDRSLIK LLVDDIEESK VFRRVNILNF VEGTFFTWYT HEAPAGLIEA IRQIMQRLNL YRLSDLQLER TRDIVKYVYE QIVPEPLRHS LGEYFTPEWL VEFTLDRVGY QGSQILDQKI LDPCCGSGNF LIHAIERYKQ AAHAQGWDDS AILHGITNHI FGFDLNPLAM LTARVNYLIA ISDLLKTSSA VEIPVYQADA VYIPKPRPDD PTIYRYDIST RLTGCPILAL DIPVSLIHKQ HLFARVLEEM EESINEHQHT SAFITHLKRN PEFCRVEDHL DWIPYLEKMF NDIQFLESRS WNRIWCRIAR NYFASVAIGS CDIVAGNPPW VRWSELPQLY AEKIKPICDE YGIFSDERFF GGNELDISGM ITYTVIDQWL DEGGKLAFIL PQNHLQSQSS GGFRQFQIRE TPLEVLQVDD FSEVKPFRRV GNRPAVITIY KGRATTYPVP YNIWKRITPT TISENVSYAT AAENLLSIPH EAYALEDAGK RWSILPLGRF PLLSMLNGED RSIEGRKGIV TDLNGAYFIR ILGHGYRTGT LRFKTSPDQG QKPVPERTYE IEADLVYPLI KGAKNVQPFY ATTSELAVIV PNKGINSSSM PSMSRLTYQG YPLAARYFQS LNRDVLIDGV GLLDQRSTWR TRMRPFLEKQ YADNLADIPF YAIYNVGDYT FAPYKVVWAE MSGSLKAAVI SDGLVPYIEQ RKIIIPDHKI YFASFSSEKY AHFICALLNS SIIREFVDSF TIKLQVGTIF KHLRLPKFDL ENAEHLSLVD LSITAHNTMN KTNGSGNIEA FIEQIDIIAV RLLAKFAATI TDQLQQEGFN LEF
|
| |