Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5277 |
Symbol | |
ID | 5737235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 63865 |
End bp | 65580 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641282441 |
Product | N-6 DNA methylase |
Protein accession | YP_001548032 |
Protein GI | 159901787 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAATGG ATACACTTCC TAAGCGAAGC ACCAATGAGA CGCTTCGTAG TAATATGTGG CGGGCATGTG ATATCCTGCG CCGTGATAAT AATGTTGGCG GAGTAATGCA GTATACCGAG CATCTTGCTT GGCTGTTATT TCTTCGGTTC ATGGATATGG AAGAAAAACG TCGCGTTGAT TTGGCGCTGC TAAACGAGAT GCCTTATCAT CCAGTACTTC ATGGAGACTT GTCTTGGGAT TTTTGGGCTA GCCCAGAGGC ATTAGAGCGT CGTTCTGCAC CTGAGTTAAT TCAATTTGTG CGCGGTCGGC TTTTGCCGGG TCTTGCGACC CTTACCGGAT CATCATTGGC ACGCACAATT GCAGGCATTT TCTCTGATGA AAGTACTGGT GATCAGAATG TAGTGCGGGC AGTTCCAGTC TGTGCCTCAG GATATAATCT CAAAGATGTA CTGGAGATTA TCAACAGTAT TCACTTTGAG CTTGATAGTG ATCTCTTTAC GATCTCGCTT TTTTACGAAG ATCTTCTTGA ACGGATGAGT AGCGAGAATC GTACTGCTGG CGAGTTCCAT ACACCACGAG CGGTTATTCG ATTTATGGTT GAGCTGATGG CTCCCCAAAT CGGTGAGACC GTTTACGACC CAGCCTATGG ATCAGCAGGC TTTTTGGTCC AGGCATTTTT ATTTATGCAG CCCTTTGCCC GCACAATTGA AGAACACACT AGCTTACATG AACAAACCTT CTTTGGAATC GAGAAGAAAG CACTTTCGGC TTTGCTTGGT ACCATGAATA TGGTGTTACA TGGTGTCAAT GCCCCCAAAC TCCTTCGAGC CAATACTTTG GAAGAATCAA TGCAGGGAGA TTCGGGTCAA CGCTATGATG TGGTGCTTAC AAATCCTCCG TTTGGTGGCA CTGAGGGTGC TCATATTCAG CAAAATTTTG CGGTTAAGGC GAATGCTACT GAGTTATTAT TTCTTCAACA TATTATCAAA AAACTCAAGC GAACACCCAA TGCCCGAGCA GCTATTGTTG TGCCCGAAGG AACGCTCTTT CGTAGTGGAG CCTTTGCTGA GGTAAAGCAA GATCTATTGC AGCAGTTTCA TCTGTTTGCA GTATTCAGCT TGCCTCCAGG CACATTCGCT CCCTACTCTG ATGTTAAAAC AGCAATTCTA TTTCTTAAGC GGCCTGATTC ACTATTAATT GCTAATCCAT TGGCACGTGA GGAAACGTGG TTTTACGAAT TGCCGCTTCC TGAAGGACTC AAGAAGTTTT CTAAAGGCAG TCGCATTAGT GATAGCCATT TCGATGAGGC GCGGCATTTA TGGCAGGTTT GGAGTGATTA TCTATCTGGT AATGCTGAGC GACCCTTTGT CTACGCCGCT GATCTACGAC CGCATCAAAC TTCTAACGAG CCAACTCCTA TTCAGGAAAC ATTCTTTGCC AACAAGCAAA CCAGTTTGCA GCTGTCATCC ATAAACAATC GGCAATTTGA GCCAGTGTTT GCGCGAAATA TTAACGCTTG GATCGAGACC TACAATGACA TAGCTTCACG CGGTTTTGAT CTAAGCGCCC GAAATCCTCA TCGGGTTGAA CAAGAGTCCC GCGAATCGGC GTTCGTGCTG ACAGCCCGAT TGTTAGAGCG TAGCCGCGAA TTACATTCTA TGATCCAGAG TCTGCATGCT AAGCTGAGTC AAGGCAGAGA GGAGGTGGAA GAGTGA
|
Protein sequence | MVMDTLPKRS TNETLRSNMW RACDILRRDN NVGGVMQYTE HLAWLLFLRF MDMEEKRRVD LALLNEMPYH PVLHGDLSWD FWASPEALER RSAPELIQFV RGRLLPGLAT LTGSSLARTI AGIFSDESTG DQNVVRAVPV CASGYNLKDV LEIINSIHFE LDSDLFTISL FYEDLLERMS SENRTAGEFH TPRAVIRFMV ELMAPQIGET VYDPAYGSAG FLVQAFLFMQ PFARTIEEHT SLHEQTFFGI EKKALSALLG TMNMVLHGVN APKLLRANTL EESMQGDSGQ RYDVVLTNPP FGGTEGAHIQ QNFAVKANAT ELLFLQHIIK KLKRTPNARA AIVVPEGTLF RSGAFAEVKQ DLLQQFHLFA VFSLPPGTFA PYSDVKTAIL FLKRPDSLLI ANPLAREETW FYELPLPEGL KKFSKGSRIS DSHFDEARHL WQVWSDYLSG NAERPFVYAA DLRPHQTSNE PTPIQETFFA NKQTSLQLSS INNRQFEPVF ARNINAWIET YNDIASRGFD LSARNPHRVE QESRESAFVL TARLLERSRE LHSMIQSLHA KLSQGREEVE E
|
| |