Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1958 |
Symbol | |
ID | 5733847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2383832 |
End bp | 2387833 |
Gene Length | 4002 bp |
Protein Length | 1333 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279102 |
Product | hypothetical protein |
Protein accession | YP_001544729 |
Protein GI | 159898482 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA CGGCGATTAT CACCGAGGGC GGGTTGTTGC CTGCCGATAT GCTTCAAGCG ATTGCTGAGG GTGAGCAGGG CAGCCTCGCT GGTCAGCGCC CAGCCGATTT TGGCTTGCCT GCCAATCGCC GTATGAGCGA TGATATTGCC GCTGCTTGGG GCCAGGTGCG GGCGCAATGG CAGATTTTTC AGGCGGCAAT CGAGCGCCGC CCCCAAGATT CGCACACAAC CTTGACCCGC CGTTATTGGG TCGAGCCGTT TTTGCAGTTG ATCGGCTATG AGCCAACCAG CACCAAAAGC GCTCGCCGCG TTGATAATCG CACCTATGCT ATCAGTCATT CCGCCGATGA GCACGATGAT TCGCCGCCTA TTCATATCGA GGGGATTAAG ACTGATATCG ACCGCCGCCC CGAAAGTGGT CGCCCACGGA TTTCGCCCCA CGCTTTGATG CAGGAATATC TCAATAGCAC TGAACATACT TGGGGGATTG TGACCAACGG CAGGCGTTTG CGTTTGCTGC GTGATTCGTC GCAAACTACC CGCCCAAGTT TTGTCGAGTT CGATTTGGAG TCGCTGGTGA CTGGCCAGTT GTTCAATGAG TTTGCGCTGC TTTATCGGAT TTTGCATCGC ACGCGCTTGC CAATTACCAG CGCCGATACC GCCCAATCGT TGCTCGAACA GTATCATCAG CAGGCGCGTG AGGCTGGCGG TCGGGTGCGC GAGGGCTTGC GCGAAGGGGT TGAACGGGCG CTCAAGTTGC TGGGCCAAGG CTTGTTGCGC CATCCACGCA ACAGCGATTT GCGCCAGCGC TTTGCCACCA ATCAATTAAC TCCGCTGGAA TATTATCGCC AGTTGCTCAA GTTGGTCTAT CGCTTGCTGT TTTTGATGGT CGCCGAGGAT CGCGGGTTGA TCGAGGCCGA AACTGCTAGC GATAAATTAT CGGAGTTGGC GCAGCGTGGC ACGCCCAGCG AACGGCTGAA ATTGTATTAT GAGCATTATA GCGTTGGCCG TTTGCGGCGT TTGGCCGAGG TGCGCGGCGC TGGTCGCGGC CCTTACGATG ATATTTGGAT GGCGCTGCAA CAAACGTTTC GGATTTTTGA GGGTACTGAT CTTAAAGCCA ATCGCTTGGG CATCGCAGCG CTCGATGGCG ATTTGTTTGG CGAAGGCGCG ATTGGGGCGC TCGAAACTGC TCATTTGCGT AATGCCGATG TTTTAGCGGC GCTGCGGGCG CTCTCGATCT ATGCCGATCC GCAGTCGCGG GCTTTGCGGC GGGTCAATTA TGCGGCGCTC GATGTCGAGG AGCTGGGTAG TGTCTATGAG TCGTTGCTCG ATTATCGTCC GGTGGTCGCT GGCACAAGCT TCGATTTGGT CGCTGGCACC GAGCGCAAAA CCACAGGCTC GTATTACACT CGCCCCGAAT TGGTGCAGGA GCTGATCAAG AGTGCGCTTG AGCCAATTAT TGCTGAACGC TTGCGCGATA AAAACCCGGA ACAGGCACTG CTCTCAATTA CGGTCTGCGA CCCCGCTTGT GGTTCGGGCC ACTTTTTGCT GGCCGCTGCT CGCCGCATTG GGCGCGAACT GGCACGGGTG CGCTCCGGCG AGGATCAGCC AACGCCCGAT CAGTTTCGCC ATGCGGTGCG CGATGTGATT ACTCACTGTA TTTATGGAGT CGATTTCAAT CCGTTGGCGG TCGATTTGTG TAAATTGGCG CTGTGGATCG AGGGCCATTG CGCGGGCATG CCGCTTTCGT TTATTGATTA TCATATTCGT TGGGGCAATA GTTTGGTTGG CGCAACTGAA GAACTGGTTA ATCAAGGGAT TCCCGATGAT GCGTTTAAAC CGGTGACTGG CGACGATAAA ACGATCGCGA GCAATTTGCG CAAACGCAAC AAGCGCGAAC GTGAGGATAT TGCCAGCGGC CAAATTACCA TGAATCTTGC GCCCAGCCAG CTTGATCATG CGACGCTCGG TCGGGCCACA CGCCAACTTG AGGCCTTGCC CGATGATAGT GTGGCGGCAG TGCGGGCCAA AGCGGCTCGC TACGCCCGCA TGCGCGAGCA AGAACGCCCA AACTGGACGC GCTACAATCT TTGGACGGCG GCTTTTTTCC AGCCGATTAC CAAGGATACG CTGCCGCTGA TTCCAACTAG CGCTACCTTG CACGCCTTTG ATACGGCCCG CCAAAGCGTC AGCGCTGGCC TGCTGGCTTG GGTCGATGGC CTTGCCGACC AGCCTGAAAT GCGCTTTTTT CATTGGGAGT TGGAGTTTCC GCATATTTGT GGCGAAGGTA GCCCGCGTGG TTTTGATGTG ATTTTGGGCA ACCCGCCGTG GGAGCGGATT AAGCTGCAAG AGCAAGAGCA TTGGGTCGAT GTAGCCGAGA TTCGCGAGGC GGCCAATAAA GCGGCGCGTG AGAAGTTGCT CAAGGCGTGG GCCAGCAGCA GCGAACCAAG CAAGCAACAG CGTTATGCCA AATTTGAGCA TGCCAAATAT ATTGCCGAGG CTGCTAGCCG CTTTATTCGG GTTTCGCAGC GCTACCCACT GACGGCGGTT GGCGATGTTA ATACCTATGC CTTGTTTTCC GAGCTTGATC GCGATTTGAT CAATCGTAAA GGCCGCGCAG GCATTATTGT GCCAACTGGC ATCGCCACCG ATGATACAAC TAAAGCCTTT TTTGGCGATT TAATCAAGAA ACAATCATTA GAAAGGTTGA TTGGTTTTGA AAATGAAGCA TTTATTTTTC CTGAAGTACA TAACGCTTTC AAATTCTGTG CACTTACAAT GGTAGGAAAT GATATTTCTA GCGAAACCCC TGACTTCATT TTCTTATGTA GGTATTTTAG TGATATAGAA CAGGATGCTA GACATTTTAA TATGACTAGC GATGAATTCG CCTTAATTAA TCCCAATACT TTGAATTGCC CTATATTTCG TACTAAAACT GATGCACAGT TAACGAAAAA AATTTATCGG ATTGCCCCAA TCTTAGATAA CCAAAAAACG AAACGAAATC CTTGGAATAT ATCGTTTGGT ACAATGTTTC ATATGGCAAA TGATAGTGGT TTATTTAAAA ACGAATCCTC ACGCGATAGA ATGCCTTTAT ATGAAGCAAA AATGATATGG CAATTTGATC ATCGATTTGC ATCACTCATA GGCAAAGAAA ATGCAGGCAA CAGATTATCC AGAAAATATG AAGGCTGGTA TGGTGCAGAT TATGGCAACC CAGAAGATCT TCCAATTCCT ACATACTGGA TTGATAGAGA GAGTATAGAG GATCGTATTC CAAGTAAGCA TCAAAATAAG TGGTTATTGG TATTTCGTGA TATTACTAGC AGTGTTGTTG AACGAACGGC GATTTTTAGC CTGATTCCAC GAGTGGCGGT AGGCCATACC GCACCTTTGA TTTTCCTAAC AGATATTAAT TCTAGTTTGT TTTCCTGCTT CTTAAGCATA GTTAATAGCC TTTGCTTTGA CTACATAGTA CGACAGAAGA TTGGTGGCAC ACATTTAACC TTTGGCTATG TCAAACAACT GCCCGTGCTG CCACCCGAAC GCTTTGATGC AGCCCAGCTA GCCTTCATCG TGCCACGGGT TTTAGAGCTG GTCTATACCG CGTGGGATCT GCAACCGTTC GCCGCAGATG TTTGGGCCGA ACTTGATGAA ACGGGGCGGC AGGCACTTTT AGCCCAAAAC GCCGAGTGCA ACCGAGATGC GCCGCCGGAG TGGTTCAGCC CACGTGATGG TTTTGCTTTG CCACCCTTCC GCTGGAGCGA CGAACGGCGG GCGGTGTTGC GAGCCGAGCT TGATGCGCGG ATTGCGCGAT TGTATGGGCT AAGCCGCGAC GAACTGCGCT ACATCCTCGA TCCGGCTGAA GTCTATGGCC CCGACTTCCC GGGCGAAACC TTCCGCGTGT TGAAAGAAAA AGAGCTGAAA CAGTATGGTG AGTATCGCAC GCGGCGTTTA GTGCTCGAAG CGTGGGATGG GGAGGATGCC CTCACCCCCT AG
|
Protein sequence | MKYTAIITEG GLLPADMLQA IAEGEQGSLA GQRPADFGLP ANRRMSDDIA AAWGQVRAQW QIFQAAIERR PQDSHTTLTR RYWVEPFLQL IGYEPTSTKS ARRVDNRTYA ISHSADEHDD SPPIHIEGIK TDIDRRPESG RPRISPHALM QEYLNSTEHT WGIVTNGRRL RLLRDSSQTT RPSFVEFDLE SLVTGQLFNE FALLYRILHR TRLPITSADT AQSLLEQYHQ QAREAGGRVR EGLREGVERA LKLLGQGLLR HPRNSDLRQR FATNQLTPLE YYRQLLKLVY RLLFLMVAED RGLIEAETAS DKLSELAQRG TPSERLKLYY EHYSVGRLRR LAEVRGAGRG PYDDIWMALQ QTFRIFEGTD LKANRLGIAA LDGDLFGEGA IGALETAHLR NADVLAALRA LSIYADPQSR ALRRVNYAAL DVEELGSVYE SLLDYRPVVA GTSFDLVAGT ERKTTGSYYT RPELVQELIK SALEPIIAER LRDKNPEQAL LSITVCDPAC GSGHFLLAAA RRIGRELARV RSGEDQPTPD QFRHAVRDVI THCIYGVDFN PLAVDLCKLA LWIEGHCAGM PLSFIDYHIR WGNSLVGATE ELVNQGIPDD AFKPVTGDDK TIASNLRKRN KREREDIASG QITMNLAPSQ LDHATLGRAT RQLEALPDDS VAAVRAKAAR YARMREQERP NWTRYNLWTA AFFQPITKDT LPLIPTSATL HAFDTARQSV SAGLLAWVDG LADQPEMRFF HWELEFPHIC GEGSPRGFDV ILGNPPWERI KLQEQEHWVD VAEIREAANK AAREKLLKAW ASSSEPSKQQ RYAKFEHAKY IAEAASRFIR VSQRYPLTAV GDVNTYALFS ELDRDLINRK GRAGIIVPTG IATDDTTKAF FGDLIKKQSL ERLIGFENEA FIFPEVHNAF KFCALTMVGN DISSETPDFI FLCRYFSDIE QDARHFNMTS DEFALINPNT LNCPIFRTKT DAQLTKKIYR IAPILDNQKT KRNPWNISFG TMFHMANDSG LFKNESSRDR MPLYEAKMIW QFDHRFASLI GKENAGNRLS RKYEGWYGAD YGNPEDLPIP TYWIDRESIE DRIPSKHQNK WLLVFRDITS SVVERTAIFS LIPRVAVGHT APLIFLTDIN SSLFSCFLSI VNSLCFDYIV RQKIGGTHLT FGYVKQLPVL PPERFDAAQL AFIVPRVLEL VYTAWDLQPF AADVWAELDE TGRQALLAQN AECNRDAPPE WFSPRDGFAL PPFRWSDERR AVLRAELDAR IARLYGLSRD ELRYILDPAE VYGPDFPGET FRVLKEKELK QYGEYRTRRL VLEAWDGEDA LTP
|
| |