Gene Haur_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1283 
Symbol 
ID5733176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1493683 
End bp1495980 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content48% 
IMG OID641278423 
ProductEcoEI R domain-containing protein 
Protein accessionYP_001544059 
Protein GI159897812 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGAT CTGAGTTAAG CGAACGTGAT ATTCAAACCA AGTTCATTAC TCCAGCATTG 
GTTCAAGCTG GCTGGACAGC CGATTTTCAT CTGCGTGAAG AGGTTCAATT AACTGCTGGA
CAAATTTTAG TCACGAATGG GGTCGTTCAG CGAGCGGCCA AAAAATTTGC CGATTATGTG
TTGTATCACG CACCCAATAT TCCGCTGGCG GTGGTCGAGG CAAAAGATGC CAACCATGCA
GTCGGGGCTG GCATTCAACA GGCTTTGGGT TATGCCGATT TGCTTGATGT GCCCTTTATG
TATAGTTCGA ATGGCACTGG TTTTATTGAG CATGATCGAA CTGCGGCCCA TGGCTTAGTG
GAACGCTATT TGGAGCTCGA TGAGTTTCCT TCGCCCGAGG AGCTGTGGCG GCGTTATTGT
GTTTGGAAGC AGCTAACTCC CAATCAGACT AGCCTGATCA ATCAAGATTA TTACCTCGAT
GGCTATGGCA AGGCTCCACG CTATTATCAA ATGATTGCTA TCAATCGCAC GATTGAGGCA
ATTGCCCGTG GTCAACAACG GATTTTATTG GTGATGGCAA CTGGTACTGG GAAAACCTAT
ACCGCCTTTC AAATTATTTG GCGCTTGTGG AAAGCCAAGG TTAAAAAACG GATTCTTTTT
TTGGCCGATC GGAATATTTT GGTCGATCAG GCTCGCAATA ACGACTTTAA GCCCTTTGGC
AATGCCATGA CCAAAATCAG CAATCGTGAG GCTAATACAG CATTTGAGAT TTACCTCGCG
CTCTATCAGG CTATGAGCGG TAGCGAGGCT AGCCAGAATA TTTATCGGCA GTTTTCGCCG
ACTTTTTTTG ATCTCATCGT CATCGATGAG TGTCATCGAG GCAGCGCCGC CGAAGATTCA
GCTTGGCGAC AGATTTTAGA TTATTTTGCC AGTGCAACCC ATATTGGGCT AACTGCCACG
CCCAAAGAAA CCCAAACCAT CTCAAATAGT GATTATTTTG GCGATCCACT GTATACCTAT
TCACTCAAAC AAGGAATTGC CGATGGCTTT CTTGCACCCT ATAGTGTGCT GCGCGTGGCC
ACCAATGTTG ATCTTGAGGG TTGGCGGCCT CAGCCAGGCC AGCGTGATGC CGACGGTCAA
TTGATTGAAG ATCGGCTCTA CAATAGCCGC GATTTTGAGC GCATGGTTTT TTTAGATGAT
CGGATTCAGT TGGTCGCTGC CCGAATCGCG GAGTTTTTAC GCAATCACGA CCCTTACCAA
AAAACCATTG TCTTTTGCCA AAATATTGAG CACGCCGCCC GGATGCGCAC CGCTATCGCC
CAACATTGTA ATGAGTTTGT TCAAGAAGAT GCACGTTATG TCATGCAAAT TACTGGCGAT
AATCCTGAGG GTAAAGCTCA ACTCGATATG TTTATCTTGC CCGATAGCCG TTATCCAGTG
GTCGTTACAA CCTCGAAGTT GCTTACCACT GGCGTAGATG TGCAAACCTG TCGGTTGATT
GTGATCGACC AATACTTGGA ATCGATGACC GAGTTTAAGC AAATTATTGG ACGGGGAACC
CGCATTCGCG AGGATTATAA TAAATTGTAT TTTACAATTA TGGATTTTCG CAATGTTACA
ACGCTGTTTA ACGATCCGGA ATTTGACGGT GATCCAGTTC AGGTTCAAAA CTTCGGCGTT
GATCAAGCCT TGCCCACCGC AACCGCCGCG CCACCCGCCG AGGCCCAAAC CACCAATATT
CGCTACCAAA TAGGCTCTGG CGAAGTGGTG CATATTTTGG CTGAGCAGGT GCGCTACTAC
AGCAGCGATG GCCGCTTAAT TACCGAATCG GTCGAGCATT TTGCTCGCAA CACCGTGCGC
CAACGCTATA CCACCCTCGC TAACTTTTTG CAGACATGGA GCCACGAAGC GCAAAAACAG
GCAATTGTGC AAGAGCTGGC AGCCCAAGGG ATTTTGTTCG ATAAACTTGC TGAAATTGTG
GGATACGAGT ACGATCCGTT TGATCTGATT TGTCATGTTG CCTTTGATCA GCCAGCCCTG
ACGCGCCGCG AACGGGCCGA ACAGGTGCGT AAACGTTCAT ATTTTGCCCA ATATGGCGAA
AAAGCCCGGG CCGTGATTCA AGCACTCTTG GAAAAATACG CCGATCAGGG GTTAGCTACA
ATTCAAGATC GTCAGGTATT GCAGTTGCCA AGCTTTCAAC AACTTGGCAC GCCACGCGAA
ATTATTCAAG CGTTTGGCAG TCTCGGCCAG TATCAGCAAG CCGTAGACGA GCTAGTGCGG
CACTTATATG CTGCATAA
 
Protein sequence
MDRSELSERD IQTKFITPAL VQAGWTADFH LREEVQLTAG QILVTNGVVQ RAAKKFADYV 
LYHAPNIPLA VVEAKDANHA VGAGIQQALG YADLLDVPFM YSSNGTGFIE HDRTAAHGLV
ERYLELDEFP SPEELWRRYC VWKQLTPNQT SLINQDYYLD GYGKAPRYYQ MIAINRTIEA
IARGQQRILL VMATGTGKTY TAFQIIWRLW KAKVKKRILF LADRNILVDQ ARNNDFKPFG
NAMTKISNRE ANTAFEIYLA LYQAMSGSEA SQNIYRQFSP TFFDLIVIDE CHRGSAAEDS
AWRQILDYFA SATHIGLTAT PKETQTISNS DYFGDPLYTY SLKQGIADGF LAPYSVLRVA
TNVDLEGWRP QPGQRDADGQ LIEDRLYNSR DFERMVFLDD RIQLVAARIA EFLRNHDPYQ
KTIVFCQNIE HAARMRTAIA QHCNEFVQED ARYVMQITGD NPEGKAQLDM FILPDSRYPV
VVTTSKLLTT GVDVQTCRLI VIDQYLESMT EFKQIIGRGT RIREDYNKLY FTIMDFRNVT
TLFNDPEFDG DPVQVQNFGV DQALPTATAA PPAEAQTTNI RYQIGSGEVV HILAEQVRYY
SSDGRLITES VEHFARNTVR QRYTTLANFL QTWSHEAQKQ AIVQELAAQG ILFDKLAEIV
GYEYDPFDLI CHVAFDQPAL TRRERAEQVR KRSYFAQYGE KARAVIQALL EKYADQGLAT
IQDRQVLQLP SFQQLGTPRE IIQAFGSLGQ YQQAVDELVR HLYAA