Gene Haur_5275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5275 
Symbol 
ID5737233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp60738 
End bp62609 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content45% 
IMG OID641282439 
ProductN-6 DNA methylase 
Protein accessionYP_001548030 
Protein GI159901785 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATA TTGCAGCAGG GTTGAACGAG ATATGGAAGA TCTTTCGAGC AAAGGGGATC 
GTTGATGATC TCCTAATTAT CGATCATATC GCGACGTTAC TGCTTGAGCA AAACAGCCTG
TCGCCTCCTT CAGGTTTACA AGGTGAGCCA GTTCTACTTC CTAAGGTTGA TGAAATTAAA
ACGCGGCTGA GCGCTCTTTC AACTCTACTA GAGGGAGGAG CAGCTGAACT CTTCGACCGT
TATATTCTCT TTCGCCTCGA CCAAACCCAT CTTGGCGGAC GTTATCCTAC TCCACGCCAT
CTAGTAAAAT TTATGCGTAC TATTGCTCAT GTGACAGCCA ACGATAGCCT GCTTGACTTG
GCCTGTGGTA GTGGTGGCAT GCTTGCAGGG CGTGCTCAAT CAGCAGAGCA TCCGACCCTT
ACCAACGGAC TTGAAATCTC GCCCCAATGG GCACGATTAG CCTGGGCCAA CTGTGCACTC
CATGGACTCA AAGATTTTAC GATTGAAATA GCTGATGCTT TAACTTATCC GCAAGCCATT
TCCGTTAATC GAATACTTAT GAATCCTCCA TTTGGCACAC AAGTATCCAC AGAAGGTTTA
TCCGGTCGGA GTGAAACCCG TCTGATAGAA CAGGCGATCA AATGGCTAGC TGACAATGGT
CGGCTTTGTG TTCTGGCTCC AGCTGGAATC CTGTTTGGTG GAGGAAGGGA AAAAGAGCTA
AGAAAGAATC TATGCACAAA TCAACAAATT AATGCGATTA TTGCCCTACC GAAAGACACC
TTTCAACCTT TCAGTACACT CCAAACCTAT CTGCTGCTTA TTACGAAATC AGTGCCTCAA
GCTGGAACGT GGTTTATCCG CGCCGAACGT GATGGTTATA TGCGTGGGCG TGGGCGAGAT
CTAACTAAGC AACCGACTGA TGCGAGTGAT TTTCCATTAA TCGAAAGCAT ACTTGGGTGG
GATAACACAT GGAATCTTAC TGACGATCAG CAATTATTAT CATATCGGCA ACTTACTATT
GATGAAGAGC GTGTTTTAAT TATTGGTGCA CCCGCTGGGA GTATATTCAC ACAGGTAGAG
CGTTACAGTC AAGGTTCTAA GCATATCTTT TTAATCAATG TTGGTTTAGA TGCGCAGCGT
AAAAGTTACA TTGTAGATCT TAATGATCCT ATCCCAATTA AATTAATGAC ACAGCAACGT
GAAGATATAA TTACAGAGAA GTTTAGCAAA TCAAAAGAGG AGAAACCAAA ATTAGTAACA
CTTTTGAACG GAGACCATTA TAGTTCAGCC ATTGCAATCA CAACAAGTGG CCGTTTGCTA
GGCACTCGTG TTCTTCAAGA TCAGATTATT AAGCAGGCAG ACTATACATT TAAAATCGAT
CGTTACTTGC CAGCCGAAGA GATGGCGGTT GTCAATCGCC CACCGAGTGA ACTGCTTGTT
GAGATTCGGG CCAATCAAGG TCGTATGGCG CAGTATATTG ATAGTCTTTT AAGAAAACTT
GAAGCACCCC AAATTGGCGA TGGCAGGTTG ATGGCGCAGG TGTGGCAGCT AGAACCGACC
GCGATTGATG TACTTAGTAG AGAACAGCGC CAGATTTGGG ATAGTATCAA GAGCCTAACC
CTGACCGTAC ACTCAGAAAG TGCTAGTACT GGTTTTGAAA CCCCAAACTA CTTTGATGTG
GCATCACTGC ATCAACAACA ACCAAATTTG CCTGAATCAG AACTGGCAAG CATGCTTGAA
TTATTTGAGA AGCTGGGGCT GATCGTTGCC GTAACGCTGA TAGATTCGCA AGACCAGCAT
CTATCTGCCT ATCGATTGTT GAGCGAACGA GATATTTGGC GGGAGCTTCC TAGCTCTGGG
GTTAGCTCAT GA
 
Protein sequence
MADIAAGLNE IWKIFRAKGI VDDLLIIDHI ATLLLEQNSL SPPSGLQGEP VLLPKVDEIK 
TRLSALSTLL EGGAAELFDR YILFRLDQTH LGGRYPTPRH LVKFMRTIAH VTANDSLLDL
ACGSGGMLAG RAQSAEHPTL TNGLEISPQW ARLAWANCAL HGLKDFTIEI ADALTYPQAI
SVNRILMNPP FGTQVSTEGL SGRSETRLIE QAIKWLADNG RLCVLAPAGI LFGGGREKEL
RKNLCTNQQI NAIIALPKDT FQPFSTLQTY LLLITKSVPQ AGTWFIRAER DGYMRGRGRD
LTKQPTDASD FPLIESILGW DNTWNLTDDQ QLLSYRQLTI DEERVLIIGA PAGSIFTQVE
RYSQGSKHIF LINVGLDAQR KSYIVDLNDP IPIKLMTQQR EDIITEKFSK SKEEKPKLVT
LLNGDHYSSA IAITTSGRLL GTRVLQDQII KQADYTFKID RYLPAEEMAV VNRPPSELLV
EIRANQGRMA QYIDSLLRKL EAPQIGDGRL MAQVWQLEPT AIDVLSREQR QIWDSIKSLT
LTVHSESAST GFETPNYFDV ASLHQQQPNL PESELASMLE LFEKLGLIVA VTLIDSQDQH
LSAYRLLSER DIWRELPSSG VSS