Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5278 |
Symbol | |
ID | 5737236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 65626 |
End bp | 68325 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641282442 |
Product | type III restriction protein res subunit |
Protein accession | YP_001548033 |
Protein GI | 159901788 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTTA ATGAAGCGGA TACCCGCGCC CAGCTTATTG ATCCGAAATT AAGTATTGCA GGATGGACTC GCACGCAAGT CACTCGAGAG CAATACTACT TGACTGATTG GATATATACT GCTGGTAGAG TTGTTTTGCG AGGAGAGCGA GCGGAACGCT TACAACCTAG GCGGGTTGAT TATGTATTAC GCTATACGGA TAGCTTCCCT TTAGCAATCG TTGAAGCAAA GGATGAGGGC AAACCTGCTG TGGCCGGACT AGAACAGGCC AAGCGATACG CTCGTGAATT GGGACTTATG TTCGCCTATG CTACCAATGG ACACGAAATT ATAGAGTGGG ATAATTTTAC CAATACTTCT ACATTGGTTG AGTCATTTCC TAGTCCAAGC CGTCTTTGGG ATCGTTGGTG TCGCAACATA GGGATTGAAG ACCCTACCCT CCAACCAAGA TTAACCAATG ATCTTAGAGA ACTACGACCG CTTTATAGTG CTGATGATGC TCAGGTACGA CGCAGAAATC CGCTCCTTCA CCCCTATGCC CCTGAAGATG TGACGCGTGG CAAAATACCT CATTACTATC AAGAAACGGC TATTCGAGAG ATTCTGTTGC GCATTATTCG AGGTCAGCGC CGTATTCTCC TTACAATGGC TACGGGGACT GGAAAAACGC ATACTGCTGT TCAGCTTATG TGGAAACTTC TTCAGTCAGG TTGGCTTACG GGACGGCAAA GTTCACAGCA GGGCCGTATG TTATTTCTCG CAGATAGGGT GATCCTCAGA GATCAGGCAT ATAATGCATT TAGTCCGTTT GCAAGTGGTG CCAGCGAGCC TCGCTTTTTA TTAGATGGGC AGCGTCCGCT ATCACTGAAT AGAGATTTAT ACTTTGGAAT TTATCAAACT TTATGGAATG AGAATGATCA AGGAAAACGA TTATTTGAGT TGTTTCCTCC GAATTTTTTC GATGTAATCA TTATTGATGA GGCTCATCGT TCTGGATTTG GCACATGGCG CGATATTCTT GATCATTTCA CCAGCGCCAT CCAGTTGGGG ATGACCGCGA CCCCGAAACA GGATGAAAAT ATCGATACCT ATGCCTATTT CTGTGCTGAA GAGGTATTGA CCTCGGTTAA CCCAGATAAC CCTGATGGCG ATCAAATTCG CCAAGCTGCA TATACATACA GCCTTGGTCA AGGTATTGAA GATGGGTTTC TCGCAACCTA TAAAATTCAT CGAGTAAGAA CCAGCTTAGA TCGTGATGGG TTTCGATTAC AAGATGCGAT TGAACAAGGT GCGGAGATAG TTATCCCTAA TGGAGTTGAA CCACGCGATC ACTACTTAAC ACCACAGTTT GAACGTGAGA TCCGTCTGCC TGATCGAACC AAGGTTATCG TCAATCATCT TTCCCAACGG CTACGGCTCT TTGGTCCATT GCAAAAAACG ATGGTTTTTT GTGTAGATAT GGAGCATGCG CAGGAAGTTG CAAGGCAGCT CAATAATGAA TTTGCTGATC TGGGATATGG TGACAACTAT GCAGTCCCTA TTGTCAGTGA AGAAGGTGAT CAAGGCCGAC GTTGGTTGAG TTTGTTCCAA GATAGCGATC GCCAGCTTCC AGTTGTTGCA ACAACAGCTG AACTCCTTTC AACTGGTGTT GATGTTCCTT CAGCTCGCAA TATTGTTTTT ATGAAGACAG TTGCTTCACC GATTGTATTT AAACAGATTA TTGGGCGTGG AACCCGGATT GATAGCAGTA TCGATAAATT GTGGTTTCGT ATTATTGATT ATACCGGAGC TACGCACTTA CTTGATCCTT ATTGGGATCA TCCACCCTCA GCGGTAATAC CCTCAACAAC CCAGCCTATG ACTTCAATTG TTACTGGAAC TGTGACATTG GCTGGGTCAG GTAACCCTGT TGTTGGGGCG GCGATTGCAA TTCAGGTTGG GCCAAATGAC CAACGTGGAC CAATTTTATC AGATGCCAAT GGATGTTTTT CTTTTACGAG CCTGCCTGCT AGTAACATGA CGCTTGTTGC TAGTAAACCA GGATTACACC GTCGGCAAAT AAGCCTAATG ACAGAACCTA ATCTTGCAAC ACAATGTGCT ATTGAACTGA AGCCAATTGG GGAAAGTGCT GGTAAAATTG AGGCACATGG TCTGCATGTC GCGATTGCAG ATGAAGCAAC ATTTATCGTC GAGGGCATGA ATGAACCAAT GACCTTGGAA CGCTACCTCG ACTATAGCCG ATCTAAGATT ATCGGCTTTG TATCAGAGCG GAGCAAGTTG CAAGTAATCT GGCAAGATCC TACGCAGCGT CGAGTATTTA TTGAACAGCT TTCACACCAG AGCGTCCATT TAGAAGTCCT TGCCGATATT TTCAAAGCAC AGGAAGCAGA TCAATTTGAT CTCCTTAGTC ATTTGGCCTA TAGAACACCG CTCCAAACCC GAGTTGAACG TGCCACCGCT TTCCACAGAC GCGAGCAAGC GTGGTTGGCA GCTCAATCTG AACCCATTCG TGAGGTTATA ATAGAACTTC TCGCCAAGTA TGAGCTTGGC GGACTTAACC AGATTAGTGA CCCAAGTATC TTTAGAGTTA GTCCTTTCCG TGAGATGGGT GAAGTGCGCG GAGTTATTGC GCGATTCGGT GATGCTCAGC GGTTGCGCGA AACCATTGAT GAAATTCAGC GACGTTTATA CGCCGCATAG
|
Protein sequence | MPLNEADTRA QLIDPKLSIA GWTRTQVTRE QYYLTDWIYT AGRVVLRGER AERLQPRRVD YVLRYTDSFP LAIVEAKDEG KPAVAGLEQA KRYARELGLM FAYATNGHEI IEWDNFTNTS TLVESFPSPS RLWDRWCRNI GIEDPTLQPR LTNDLRELRP LYSADDAQVR RRNPLLHPYA PEDVTRGKIP HYYQETAIRE ILLRIIRGQR RILLTMATGT GKTHTAVQLM WKLLQSGWLT GRQSSQQGRM LFLADRVILR DQAYNAFSPF ASGASEPRFL LDGQRPLSLN RDLYFGIYQT LWNENDQGKR LFELFPPNFF DVIIIDEAHR SGFGTWRDIL DHFTSAIQLG MTATPKQDEN IDTYAYFCAE EVLTSVNPDN PDGDQIRQAA YTYSLGQGIE DGFLATYKIH RVRTSLDRDG FRLQDAIEQG AEIVIPNGVE PRDHYLTPQF EREIRLPDRT KVIVNHLSQR LRLFGPLQKT MVFCVDMEHA QEVARQLNNE FADLGYGDNY AVPIVSEEGD QGRRWLSLFQ DSDRQLPVVA TTAELLSTGV DVPSARNIVF MKTVASPIVF KQIIGRGTRI DSSIDKLWFR IIDYTGATHL LDPYWDHPPS AVIPSTTQPM TSIVTGTVTL AGSGNPVVGA AIAIQVGPND QRGPILSDAN GCFSFTSLPA SNMTLVASKP GLHRRQISLM TEPNLATQCA IELKPIGESA GKIEAHGLHV AIADEATFIV EGMNEPMTLE RYLDYSRSKI IGFVSERSKL QVIWQDPTQR RVFIEQLSHQ SVHLEVLADI FKAQEADQFD LLSHLAYRTP LQTRVERATA FHRREQAWLA AQSEPIREVI IELLAKYELG GLNQISDPSI FRVSPFREMG EVRGVIARFG DAQRLRETID EIQRRLYAA
|
| |