Gene Haur_5278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5278 
Symbol 
ID5737236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp65626 
End bp68325 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content46% 
IMG OID641282442 
Producttype III restriction protein res subunit 
Protein accessionYP_001548033 
Protein GI159901788 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTTA ATGAAGCGGA TACCCGCGCC CAGCTTATTG ATCCGAAATT AAGTATTGCA 
GGATGGACTC GCACGCAAGT CACTCGAGAG CAATACTACT TGACTGATTG GATATATACT
GCTGGTAGAG TTGTTTTGCG AGGAGAGCGA GCGGAACGCT TACAACCTAG GCGGGTTGAT
TATGTATTAC GCTATACGGA TAGCTTCCCT TTAGCAATCG TTGAAGCAAA GGATGAGGGC
AAACCTGCTG TGGCCGGACT AGAACAGGCC AAGCGATACG CTCGTGAATT GGGACTTATG
TTCGCCTATG CTACCAATGG ACACGAAATT ATAGAGTGGG ATAATTTTAC CAATACTTCT
ACATTGGTTG AGTCATTTCC TAGTCCAAGC CGTCTTTGGG ATCGTTGGTG TCGCAACATA
GGGATTGAAG ACCCTACCCT CCAACCAAGA TTAACCAATG ATCTTAGAGA ACTACGACCG
CTTTATAGTG CTGATGATGC TCAGGTACGA CGCAGAAATC CGCTCCTTCA CCCCTATGCC
CCTGAAGATG TGACGCGTGG CAAAATACCT CATTACTATC AAGAAACGGC TATTCGAGAG
ATTCTGTTGC GCATTATTCG AGGTCAGCGC CGTATTCTCC TTACAATGGC TACGGGGACT
GGAAAAACGC ATACTGCTGT TCAGCTTATG TGGAAACTTC TTCAGTCAGG TTGGCTTACG
GGACGGCAAA GTTCACAGCA GGGCCGTATG TTATTTCTCG CAGATAGGGT GATCCTCAGA
GATCAGGCAT ATAATGCATT TAGTCCGTTT GCAAGTGGTG CCAGCGAGCC TCGCTTTTTA
TTAGATGGGC AGCGTCCGCT ATCACTGAAT AGAGATTTAT ACTTTGGAAT TTATCAAACT
TTATGGAATG AGAATGATCA AGGAAAACGA TTATTTGAGT TGTTTCCTCC GAATTTTTTC
GATGTAATCA TTATTGATGA GGCTCATCGT TCTGGATTTG GCACATGGCG CGATATTCTT
GATCATTTCA CCAGCGCCAT CCAGTTGGGG ATGACCGCGA CCCCGAAACA GGATGAAAAT
ATCGATACCT ATGCCTATTT CTGTGCTGAA GAGGTATTGA CCTCGGTTAA CCCAGATAAC
CCTGATGGCG ATCAAATTCG CCAAGCTGCA TATACATACA GCCTTGGTCA AGGTATTGAA
GATGGGTTTC TCGCAACCTA TAAAATTCAT CGAGTAAGAA CCAGCTTAGA TCGTGATGGG
TTTCGATTAC AAGATGCGAT TGAACAAGGT GCGGAGATAG TTATCCCTAA TGGAGTTGAA
CCACGCGATC ACTACTTAAC ACCACAGTTT GAACGTGAGA TCCGTCTGCC TGATCGAACC
AAGGTTATCG TCAATCATCT TTCCCAACGG CTACGGCTCT TTGGTCCATT GCAAAAAACG
ATGGTTTTTT GTGTAGATAT GGAGCATGCG CAGGAAGTTG CAAGGCAGCT CAATAATGAA
TTTGCTGATC TGGGATATGG TGACAACTAT GCAGTCCCTA TTGTCAGTGA AGAAGGTGAT
CAAGGCCGAC GTTGGTTGAG TTTGTTCCAA GATAGCGATC GCCAGCTTCC AGTTGTTGCA
ACAACAGCTG AACTCCTTTC AACTGGTGTT GATGTTCCTT CAGCTCGCAA TATTGTTTTT
ATGAAGACAG TTGCTTCACC GATTGTATTT AAACAGATTA TTGGGCGTGG AACCCGGATT
GATAGCAGTA TCGATAAATT GTGGTTTCGT ATTATTGATT ATACCGGAGC TACGCACTTA
CTTGATCCTT ATTGGGATCA TCCACCCTCA GCGGTAATAC CCTCAACAAC CCAGCCTATG
ACTTCAATTG TTACTGGAAC TGTGACATTG GCTGGGTCAG GTAACCCTGT TGTTGGGGCG
GCGATTGCAA TTCAGGTTGG GCCAAATGAC CAACGTGGAC CAATTTTATC AGATGCCAAT
GGATGTTTTT CTTTTACGAG CCTGCCTGCT AGTAACATGA CGCTTGTTGC TAGTAAACCA
GGATTACACC GTCGGCAAAT AAGCCTAATG ACAGAACCTA ATCTTGCAAC ACAATGTGCT
ATTGAACTGA AGCCAATTGG GGAAAGTGCT GGTAAAATTG AGGCACATGG TCTGCATGTC
GCGATTGCAG ATGAAGCAAC ATTTATCGTC GAGGGCATGA ATGAACCAAT GACCTTGGAA
CGCTACCTCG ACTATAGCCG ATCTAAGATT ATCGGCTTTG TATCAGAGCG GAGCAAGTTG
CAAGTAATCT GGCAAGATCC TACGCAGCGT CGAGTATTTA TTGAACAGCT TTCACACCAG
AGCGTCCATT TAGAAGTCCT TGCCGATATT TTCAAAGCAC AGGAAGCAGA TCAATTTGAT
CTCCTTAGTC ATTTGGCCTA TAGAACACCG CTCCAAACCC GAGTTGAACG TGCCACCGCT
TTCCACAGAC GCGAGCAAGC GTGGTTGGCA GCTCAATCTG AACCCATTCG TGAGGTTATA
ATAGAACTTC TCGCCAAGTA TGAGCTTGGC GGACTTAACC AGATTAGTGA CCCAAGTATC
TTTAGAGTTA GTCCTTTCCG TGAGATGGGT GAAGTGCGCG GAGTTATTGC GCGATTCGGT
GATGCTCAGC GGTTGCGCGA AACCATTGAT GAAATTCAGC GACGTTTATA CGCCGCATAG
 
Protein sequence
MPLNEADTRA QLIDPKLSIA GWTRTQVTRE QYYLTDWIYT AGRVVLRGER AERLQPRRVD 
YVLRYTDSFP LAIVEAKDEG KPAVAGLEQA KRYARELGLM FAYATNGHEI IEWDNFTNTS
TLVESFPSPS RLWDRWCRNI GIEDPTLQPR LTNDLRELRP LYSADDAQVR RRNPLLHPYA
PEDVTRGKIP HYYQETAIRE ILLRIIRGQR RILLTMATGT GKTHTAVQLM WKLLQSGWLT
GRQSSQQGRM LFLADRVILR DQAYNAFSPF ASGASEPRFL LDGQRPLSLN RDLYFGIYQT
LWNENDQGKR LFELFPPNFF DVIIIDEAHR SGFGTWRDIL DHFTSAIQLG MTATPKQDEN
IDTYAYFCAE EVLTSVNPDN PDGDQIRQAA YTYSLGQGIE DGFLATYKIH RVRTSLDRDG
FRLQDAIEQG AEIVIPNGVE PRDHYLTPQF EREIRLPDRT KVIVNHLSQR LRLFGPLQKT
MVFCVDMEHA QEVARQLNNE FADLGYGDNY AVPIVSEEGD QGRRWLSLFQ DSDRQLPVVA
TTAELLSTGV DVPSARNIVF MKTVASPIVF KQIIGRGTRI DSSIDKLWFR IIDYTGATHL
LDPYWDHPPS AVIPSTTQPM TSIVTGTVTL AGSGNPVVGA AIAIQVGPND QRGPILSDAN
GCFSFTSLPA SNMTLVASKP GLHRRQISLM TEPNLATQCA IELKPIGESA GKIEAHGLHV
AIADEATFIV EGMNEPMTLE RYLDYSRSKI IGFVSERSKL QVIWQDPTQR RVFIEQLSHQ
SVHLEVLADI FKAQEADQFD LLSHLAYRTP LQTRVERATA FHRREQAWLA AQSEPIREVI
IELLAKYELG GLNQISDPSI FRVSPFREMG EVRGVIARFG DAQRLRETID EIQRRLYAA