Gene Haur_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1333 
Symbol 
ID5733225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1543504 
End bp1544751 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content38% 
IMG OID641278471 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001544106 
Protein GI159897859 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATTA CCCTTAAGAC AATAGAATTA TTTGCTGGAG CTGGAGGCCT CGGGCTAGGG 
TTTCTTCTCG CAAATCATCC AGGTGTCAAT TTTAGGCCTT TATGTGCAGT CGATTTTAAC
GTAGACGCAT GTACTAGCTA TAATATGAAT ATGCAATGGC TGCATCAGAA TGCTCCTCAT
TTACAGACAA CACAAGCTTC TAAGGCTTAT CTGCGGAAAG TTGAATCCTT AAACGTTAAT
GCAGTGAAGA GGCTTTTCCA GTTACAACAA GGTGATCTCG ATATTTTAAT GGGTGGTCCT
CCTTGTCAAG GATATTCATC TTCAAATCGC CAGGCATCAA AAGAAACACG CGATGAACTT
AATAATATGG TGAAATCCTT TCTTGATCGA GTTCAAGATT TTTCACCAAA AATGTTTCTC
TTAGAAAATG TCCAAGGAGT CACATGGACT GCCTCGACTG ACGAAATGAG AATACCTAGT
GAGCAATTAT CCTTTATAGA TAATGAAGAG ATTGCTGATG TTAAAGACTA TTTAGTTCAT
AGAGCACGCG AGCTGGGTTA TCACATATGG TATTCGGTGC TTGATGCAGC GGATTTTGGT
GTCCCTCAAC ATAGAAAACG ATTTTTTCTT TTTGGTATTC GTACAGACTT GACAACTGAC
CCAAATATTC GGCTTGAAAA ATTTATCAAT CCTTATAGAA CGAGCACACT TACAACAGTT
GCCCAAGCTA TTGAGGATCT TCCTGTTATT AATAATGGCG AGCATTGGAA AGGTAATAAC
TATAATCCGG TGGCGAATGG GTATATCACT ATGATGCGTA GCTTTATGAA TAATAATGTT
TTATTTGACC ACTTTACAAC AAATCATCAA GAATATGTTC TTGAGCGTTT CAGAAATATT
CCTGAAGGCG AAAATTGGAA ATCTATAAAA AATATTATGA ATACGTATAA AAATGTAAAC
AAAACCCATA GTAATATTTA TAGAAGATTA CAACGGAATG CCCCATCGCA TACTATTAGT
CATTACCGCA AAGCAATGAC TATCCATCCT GTACAGAATA GAGGATTATC ATTTAGAGAA
GCCTGTAGAT TGCAGTCTTT TCCAGACTGG TATCGATTTA GTGGAACAAG AGAAAGTGCC
CAACAGCAAC TAGCGAATGC AGTGCCACCT TTGCTTTCGT CAAAGGTGGC ACTGGCTATC
GCAGATTATT GGTTATCTCT GCCACATAAT GCTCTTATGA AAGATTAA
 
Protein sequence
MPITLKTIEL FAGAGGLGLG FLLANHPGVN FRPLCAVDFN VDACTSYNMN MQWLHQNAPH 
LQTTQASKAY LRKVESLNVN AVKRLFQLQQ GDLDILMGGP PCQGYSSSNR QASKETRDEL
NNMVKSFLDR VQDFSPKMFL LENVQGVTWT ASTDEMRIPS EQLSFIDNEE IADVKDYLVH
RARELGYHIW YSVLDAADFG VPQHRKRFFL FGIRTDLTTD PNIRLEKFIN PYRTSTLTTV
AQAIEDLPVI NNGEHWKGNN YNPVANGYIT MMRSFMNNNV LFDHFTTNHQ EYVLERFRNI
PEGENWKSIK NIMNTYKNVN KTHSNIYRRL QRNAPSHTIS HYRKAMTIHP VQNRGLSFRE
ACRLQSFPDW YRFSGTRESA QQQLANAVPP LLSSKVALAI ADYWLSLPHN ALMKD