Gene Haur_5277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5277 
Symbol 
ID5737235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp63865 
End bp65580 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content46% 
IMG OID641282441 
ProductN-6 DNA methylase 
Protein accessionYP_001548032 
Protein GI159901787 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAATGG ATACACTTCC TAAGCGAAGC ACCAATGAGA CGCTTCGTAG TAATATGTGG 
CGGGCATGTG ATATCCTGCG CCGTGATAAT AATGTTGGCG GAGTAATGCA GTATACCGAG
CATCTTGCTT GGCTGTTATT TCTTCGGTTC ATGGATATGG AAGAAAAACG TCGCGTTGAT
TTGGCGCTGC TAAACGAGAT GCCTTATCAT CCAGTACTTC ATGGAGACTT GTCTTGGGAT
TTTTGGGCTA GCCCAGAGGC ATTAGAGCGT CGTTCTGCAC CTGAGTTAAT TCAATTTGTG
CGCGGTCGGC TTTTGCCGGG TCTTGCGACC CTTACCGGAT CATCATTGGC ACGCACAATT
GCAGGCATTT TCTCTGATGA AAGTACTGGT GATCAGAATG TAGTGCGGGC AGTTCCAGTC
TGTGCCTCAG GATATAATCT CAAAGATGTA CTGGAGATTA TCAACAGTAT TCACTTTGAG
CTTGATAGTG ATCTCTTTAC GATCTCGCTT TTTTACGAAG ATCTTCTTGA ACGGATGAGT
AGCGAGAATC GTACTGCTGG CGAGTTCCAT ACACCACGAG CGGTTATTCG ATTTATGGTT
GAGCTGATGG CTCCCCAAAT CGGTGAGACC GTTTACGACC CAGCCTATGG ATCAGCAGGC
TTTTTGGTCC AGGCATTTTT ATTTATGCAG CCCTTTGCCC GCACAATTGA AGAACACACT
AGCTTACATG AACAAACCTT CTTTGGAATC GAGAAGAAAG CACTTTCGGC TTTGCTTGGT
ACCATGAATA TGGTGTTACA TGGTGTCAAT GCCCCCAAAC TCCTTCGAGC CAATACTTTG
GAAGAATCAA TGCAGGGAGA TTCGGGTCAA CGCTATGATG TGGTGCTTAC AAATCCTCCG
TTTGGTGGCA CTGAGGGTGC TCATATTCAG CAAAATTTTG CGGTTAAGGC GAATGCTACT
GAGTTATTAT TTCTTCAACA TATTATCAAA AAACTCAAGC GAACACCCAA TGCCCGAGCA
GCTATTGTTG TGCCCGAAGG AACGCTCTTT CGTAGTGGAG CCTTTGCTGA GGTAAAGCAA
GATCTATTGC AGCAGTTTCA TCTGTTTGCA GTATTCAGCT TGCCTCCAGG CACATTCGCT
CCCTACTCTG ATGTTAAAAC AGCAATTCTA TTTCTTAAGC GGCCTGATTC ACTATTAATT
GCTAATCCAT TGGCACGTGA GGAAACGTGG TTTTACGAAT TGCCGCTTCC TGAAGGACTC
AAGAAGTTTT CTAAAGGCAG TCGCATTAGT GATAGCCATT TCGATGAGGC GCGGCATTTA
TGGCAGGTTT GGAGTGATTA TCTATCTGGT AATGCTGAGC GACCCTTTGT CTACGCCGCT
GATCTACGAC CGCATCAAAC TTCTAACGAG CCAACTCCTA TTCAGGAAAC ATTCTTTGCC
AACAAGCAAA CCAGTTTGCA GCTGTCATCC ATAAACAATC GGCAATTTGA GCCAGTGTTT
GCGCGAAATA TTAACGCTTG GATCGAGACC TACAATGACA TAGCTTCACG CGGTTTTGAT
CTAAGCGCCC GAAATCCTCA TCGGGTTGAA CAAGAGTCCC GCGAATCGGC GTTCGTGCTG
ACAGCCCGAT TGTTAGAGCG TAGCCGCGAA TTACATTCTA TGATCCAGAG TCTGCATGCT
AAGCTGAGTC AAGGCAGAGA GGAGGTGGAA GAGTGA
 
Protein sequence
MVMDTLPKRS TNETLRSNMW RACDILRRDN NVGGVMQYTE HLAWLLFLRF MDMEEKRRVD 
LALLNEMPYH PVLHGDLSWD FWASPEALER RSAPELIQFV RGRLLPGLAT LTGSSLARTI
AGIFSDESTG DQNVVRAVPV CASGYNLKDV LEIINSIHFE LDSDLFTISL FYEDLLERMS
SENRTAGEFH TPRAVIRFMV ELMAPQIGET VYDPAYGSAG FLVQAFLFMQ PFARTIEEHT
SLHEQTFFGI EKKALSALLG TMNMVLHGVN APKLLRANTL EESMQGDSGQ RYDVVLTNPP
FGGTEGAHIQ QNFAVKANAT ELLFLQHIIK KLKRTPNARA AIVVPEGTLF RSGAFAEVKQ
DLLQQFHLFA VFSLPPGTFA PYSDVKTAIL FLKRPDSLLI ANPLAREETW FYELPLPEGL
KKFSKGSRIS DSHFDEARHL WQVWSDYLSG NAERPFVYAA DLRPHQTSNE PTPIQETFFA
NKQTSLQLSS INNRQFEPVF ARNINAWIET YNDIASRGFD LSARNPHRVE QESRESAFVL
TARLLERSRE LHSMIQSLHA KLSQGREEVE E