Gene Haur_0528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0528 
Symbol 
ID5732445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp614213 
End bp615277 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content44% 
IMG OID641277655 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001543304 
Protein GI159897057 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.555926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGCG CAGTCATTGA TCTATTCTGT GGAGTTGGTG GGCTAACCCA CGGGCTTATT 
CTTGAAGGTT TTGGCGTACT GGCAGGGATT GATAACGATC CTTCTTGTAA GTATGCCTAT
GAGCAAAATA ATAGAACTCG TTTTATTGAA AAGTCTATTA CTGAAGTTGA TGGCAGAGAG
TTAAATGCCC TCTATCCCAA TAATCAACAT AAAATCTTAG TAGGCTGTGC CCCATGCCAA
GATTTTTCTC AATATACGAA GAAGAGTCGT ACTGGAACAA AGTGGCAATT ACTCACAGAA
TTTTCGCGCC TTATTAGAGA GATCGAGCCT GATATTATAT CAATGGAAAA TGTTCCTGAG
GTTCGAACAT TCAATAGAGG GGAAGTTTTT AACAATTTTA TTCAAGCACT TGAACAATTA
GGGTATCACG TTTCGCATAG CGTGGTGCAT TGTCCTGATT ATGGAATTCC GCAGCAACGT
GACCGACTAG TCTTATTTGC TGCTAAACAG GGCGTTATTA AGATTATACC CCCAACCCAC
ACTCCTGAGA ACTATCGAAC TGTACGTGAA GTTATTGGTT CTCTGCCACC AATTACTGCT
GGCGGACACT GGGAGGGTGA TAGCATGCAT GCAGCCAGCA GACTAGAAGA TATAAACCTT
CGACGTATTC AGCATTCTGT GCCTGGTGGT ACTTGGGCCG ATTGGCCGGA AGAATTGATT
GCAGAATGTC ATAAAAAGGA AAGCGGTGAG AGTTATGGGA GCGTCTATGG TCGAATGGAA
TGGGATAAAG TAGCACCTAC CATCACTACA CAATGCAATG GGTATGGTAA TGGTCGCTTT
GGTCATCCAG AGCAGGATCG CGCTATTTCG CTCCGTGAGG CTGCCTTACT TCAGACATTT
CCGCGAAGTT ATCAATTTGC CCCTGAAGGC CAACTGAAAT TTAAGACAGT TAGTCGTCAA
ATAGGGAATG CTGTTCCGGT CGCACTAGGT CGTGTTATTG CAAAAAGTAT TAAGCGTTTT
TTGGAGGGTT TACATGAGCG ACAGCGGGTA CGAATTATCA TTTAG
 
Protein sequence
MVGAVIDLFC GVGGLTHGLI LEGFGVLAGI DNDPSCKYAY EQNNRTRFIE KSITEVDGRE 
LNALYPNNQH KILVGCAPCQ DFSQYTKKSR TGTKWQLLTE FSRLIREIEP DIISMENVPE
VRTFNRGEVF NNFIQALEQL GYHVSHSVVH CPDYGIPQQR DRLVLFAAKQ GVIKIIPPTH
TPENYRTVRE VIGSLPPITA GGHWEGDSMH AASRLEDINL RRIQHSVPGG TWADWPEELI
AECHKKESGE SYGSVYGRME WDKVAPTITT QCNGYGNGRF GHPEQDRAIS LREAALLQTF
PRSYQFAPEG QLKFKTVSRQ IGNAVPVALG RVIAKSIKRF LEGLHERQRV RIII