Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3540 |
Symbol | |
ID | 4075218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 579025 |
End bp | 580188 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 638005054 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_611773 |
Protein GI | 99078515 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCAG TGGCCCTCGG TGAGTTGGTT GAAATCAGAG GCGGCGGCAC GCCGGACAAG AAGGTTCCAG ATTACTGGGA CGGCGATATT CCTTGGGCAT CTGTGAAGGA CTTCAAAAGC ACATCGCTCG CCTCAACCAT CGACCGGATA ACGCAAGCGG GTGTCGCGAA CTCTGCCACT CAAGTTATCC CTGCTGGAAA CATTATCGTC CCGACGCGTA TGGCGGTTGG TAAGGCAGCA ATCAATGAGA TCGACCTAGC GATCAATCAA GACCTGAAGG CGCTGATCCC AAGCCAACGG ATCGACCGCC AATACCTTCT GCATGCCCTT CTAGCCAACG CAAAGACGCT CGAAGACCAA GCCACAGGGG CTACCGTGAA GGGCATCAAG CTTGATGCTT TGAGATCTTT ACAAATCCCC CTCCCTCCGT TGCAGGAGCA GAGGCGGATT GCGGGAATAC TGGATCAGGC AGACGCGCTC CGCCGCTTCC GCACCCGCGC CCTCGACAAA CTGGGCACCC TCGGCCAGGC GATCTTTCAT GAGATGTTTG GGGCAAGCTC TCCTGACCAT GCTGCGTGGG AGAAAATCAA TCTGAGCGAA CTGGTTCTGC CTGATGACCG CATCAATTAT GGTGTCGTAC AGCCTGGCCC TCACGACCCC GAAGGGGTTC CGATCATTCG TGTCGCCGAT CTAGCGAGCC CGGTGGTCGC TTTTGATTCA ATCAAACGGA TCGCCCCGAG CATTGATGCA GAGTACGGGC GTTCAAGATT GAAGGGCGGT GAAGTGCTAA TCGGCTGCGT CGGTTCGATT GGCACGACAA TCATCGCTCC TCCAGAGTTC GCAGGAGCAA ATGTTGCTCG CGCGGTTGCG CGTGTTCCCC TCGACACCAG CAGATGTGAA CCGAGGTTTG TCGCTGAGCA ACTACGATCT CAGCGGATAC AAAATTACTT CACAAAAGAG GTCCGGCTTG TTGCGCAGCC CACCCTGAAC ATCAAGCAAA TTCGCGAGAC AGAAATCATT CTTCCGCCAA AGGAGCTGCA GGTTTCGTTT GTTGAACGTG TTCATGAAAT CGAAGCCCAA AAAGCCCAGC ACGCAGCAGC TCTGACGGCA TGCGACGTGC TCTTTGCTTC CCTTCAGTCA ACGGCTTTCC GGGGGGAAGT GTAA
|
Protein sequence | MSSVALGELV EIRGGGTPDK KVPDYWDGDI PWASVKDFKS TSLASTIDRI TQAGVANSAT QVIPAGNIIV PTRMAVGKAA INEIDLAINQ DLKALIPSQR IDRQYLLHAL LANAKTLEDQ ATGATVKGIK LDALRSLQIP LPPLQEQRRI AGILDQADAL RRFRTRALDK LGTLGQAIFH EMFGASSPDH AAWEKINLSE LVLPDDRINY GVVQPGPHDP EGVPIIRVAD LASPVVAFDS IKRIAPSIDA EYGRSRLKGG EVLIGCVGSI GTTIIAPPEF AGANVARAVA RVPLDTSRCE PRFVAEQLRS QRIQNYFTKE VRLVAQPTLN IKQIRETEII LPPKELQVSF VERVHEIEAQ KAQHAAALTA CDVLFASLQS TAFRGEV
|
| |