Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0414 |
Symbol | |
ID | 4711541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 481678 |
End bp | 482988 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639854873 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_001002006 |
Protein GI | 121997219 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.596456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTCTT CCAATGCTGC GACCGCACTG CTTTACTCGG CCTTGGAGCT GGCCACGAGG GCGGATGTGG CCCGTGAACT CGGGGTGGAC GAACGTACGG TCCGGCGGTG GGCGAAAGGC GAGATCCCGA TGCCCAATCG TTGGGAGCCG GCGCTGAGCC AGCTCCTGCT CCGGCGGGCT CCCCTGCGGT CTGACGTTGA CGGGCAGTTC TCCTTCATTG ACCTGTTCGC CGGAGTGGGG GGTATCCGAC AGGGGTTTGA AAGCGTCGGT GGCCACTGCG TCTTTTCCTC CGAATGGGAC CGTTTCGCCC TCCAGACCTA CCGTGCAAAC TTTGGGAACG AGGGCGAAGA GATCCAGACG GACATCCGTC AGATTACGGC GGTCTCAGAT GACGCGGATG AGAACAGCCG CTCTATCGAC GAACGCATTC CTCAGCACGA CGTCCTGTTG GCCGGATTCC CGTGTCAGCC CTTTTCGCTG GCCGGCGTTT CCAAGAAGAA CAGCCTGGGA CGCGAGCATG GGTTCCTGTG CGAGGCGCAG GGAACCCTGT TCTTTGACGT TGCGCGGATC ATCGAGGTAA AGCGTCCACG GGCATTCCTC CTCGAGAACG TGAAAAACCT GCGAAGTCAT GATGGCGGAC GTACCTACGA AGTGATCCGC CGCGTGCTGG AAGAGCTCGG TTATCGCGTG CATGATCGGG TCATCGACGG CAAGGGGTTC GTGCCCCAGC ACCGCGAGCG GATCTACATG GTTGGCTTCC GTAAGGATAC GCCCTTCACC TGGAACCAAC TGGACTTCCC CGCACCGGAC GCCCGCACCC TCCGGGAGGT CCTGCACCCG GAGGACGGAT CGGAAGCCGC GGAACCCCCT TATACCGAGG GTGACTTGGC CACCGTAGGC GACAAGTACG TCCTGAGCGA GAAACTCTGG AAATACCTGC AGGACTATCG GGCCAAGCAT GAGCACGCGG GCAACGGCTT CGGCTACAGT AAGGTCGGCC CGGAAGACAC CGCACGGACA CTGTCCGCCC GGTACCACAA GGACGGCTCC GAGATCCTGG TTGACCGGGG AGCTGGCGAG CGGCCGCGTC GGCTTACGCC ACGCGAGTGC GCCCGACTGA TGGGCTTCGA TGACAGTTTC CGGATCCCGG TGAGTGACAC GCAGGCTTAT CGCCAGTTTG GTAACTCCGT TGTCGTTCCG GTCATCCGTG AGATCGCGTC TGCGATGGCG CCACACGTCC TCGCGGACAT CCGCTCCGAC CAGGATGGCC ATCAGCTGGC GCTGCCGATG GAGTTCAGGG AGACGGCGTG A
|
Protein sequence | MHSSNAATAL LYSALELATR ADVARELGVD ERTVRRWAKG EIPMPNRWEP ALSQLLLRRA PLRSDVDGQF SFIDLFAGVG GIRQGFESVG GHCVFSSEWD RFALQTYRAN FGNEGEEIQT DIRQITAVSD DADENSRSID ERIPQHDVLL AGFPCQPFSL AGVSKKNSLG REHGFLCEAQ GTLFFDVARI IEVKRPRAFL LENVKNLRSH DGGRTYEVIR RVLEELGYRV HDRVIDGKGF VPQHRERIYM VGFRKDTPFT WNQLDFPAPD ARTLREVLHP EDGSEAAEPP YTEGDLATVG DKYVLSEKLW KYLQDYRAKH EHAGNGFGYS KVGPEDTART LSARYHKDGS EILVDRGAGE RPRRLTPREC ARLMGFDDSF RIPVSDTQAY RQFGNSVVVP VIREIASAMA PHVLADIRSD QDGHQLALPM EFRETA
|
| |