Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0531 |
Symbol | |
ID | 4242379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 840599 |
End bp | 841864 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638105842 |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_720456 |
Protein GI | 113474395 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.247848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGAC CAATTGGTAT TGATTTATTT GCAGGCGCAG GAGGTATGAC TCTTGGATTT GAACAAGCAG GATTTGATAT ACCTATTTCC GTAGAATTAG ACCCTATTCA CTGTGCTATT CATAAGTTTA ATTTTCCCTT TTGGTCCATA TTATGTCGTA ATGTTGTCGA ACTCACAGGG AATGAAATTA GAGAGAAATT AAATATTCCA AATAGAGAAA TTGATGTAAT TTTTGGAGGT CCTCCATGTC AAGGATTTTC TCAGATTGGA AAACGTGCTT TAGACGATCC TCGAAATGCG CTTATATCTC ATTTTTTACG AATAGTTTTG GAATTAAAAC CCAAATATTT TGTCATAGAA AATGTTAAAG GTTTAACTGT AGGAAAACAT CAAATTTTTC TTGAGGAAGT TATTAATAAA TTATCTAAAA ATAGTTATCA ACTACAACTG CCTTACCAAG TTTTAAATGC TGCTAATTAT GGTGTACCAC AACACCGAGA AAGATTATTT ATACTAGGAT GTAAAAAAGG TTTAAAATTA CCAAATTATC CACAAATTCA AATACATAAA AAATCAGAAG CCTATATAAA TGTCTGGGAT GCAATAGGAG ATTTACCAGA AGTAGAAAAT TATCCAGAAT TATTAGAAAT AGACTGGGTA AAAGCAGAAA ATGACTATGA TAAACCAAGT GAATATGCTA AAAAACTCCG TGGAATTGAA TATTTTAACA ATGATTATTC TTATGAACGT GAATACGACC AGACAATATT AACCTCTAGT TTACGCACAA AACATACACA ACAATCAATA GCAAGATTTG ATGCTACAGC ACAAGGAAAA ACTGAGCCAG TAAGTCGCTT TTATAAACTG AATCCTCACG GTATTTGTAA CACACTCAGA GCAGGGACTC CAAGTAGTAG AGGTTCATAT ACTTCTCCTA GACCAATACA TCCATTGACA CCCAGATGTA TCACAGTTAG AGAAGCAGCA AGACTACATT CATATCCTGA TTGGTTTAGA TTTCATGTGA CAAAATGGCA TGGTTTTAGA CAAGTAGGTA ATTCAGTTCC ACCACTATTA GCAAAAGCAG TTGCTCAAGA AATTATTCAT GCTTTAAATA TTCAACCATT TAAACCTACA AAGACAGAAT ATGAGTTAGA AAATCTCAAA AATTTAGAGC TTAATATGTC TAGAGCCGCC GATTTATATG GTATTGATAG ATATACTATT GCACCCAGAC TAAGAAAGTT AAAATTAAAT AATTAA
|
Protein sequence | MQRPIGIDLF AGAGGMTLGF EQAGFDIPIS VELDPIHCAI HKFNFPFWSI LCRNVVELTG NEIREKLNIP NREIDVIFGG PPCQGFSQIG KRALDDPRNA LISHFLRIVL ELKPKYFVIE NVKGLTVGKH QIFLEEVINK LSKNSYQLQL PYQVLNAANY GVPQHRERLF ILGCKKGLKL PNYPQIQIHK KSEAYINVWD AIGDLPEVEN YPELLEIDWV KAENDYDKPS EYAKKLRGIE YFNNDYSYER EYDQTILTSS LRTKHTQQSI ARFDATAQGK TEPVSRFYKL NPHGICNTLR AGTPSSRGSY TSPRPIHPLT PRCITVREAA RLHSYPDWFR FHVTKWHGFR QVGNSVPPLL AKAVAQEIIH ALNIQPFKPT KTEYELENLK NLELNMSRAA DLYGIDRYTI APRLRKLKLN N
|
| |