Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1438 |
Symbol | |
ID | 5708061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1664025 |
End bp | 1665752 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270947 |
Product | type III restriction protein res subunit |
Protein accession | YP_001536328 |
Protein GI | 159037075 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00286846 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAGCCC GGACGCCAAC GCTCGACACG TTCCCGGCCC TGCGTGCCTG GCAGCGCAGG GCGCTGGTGG AGTACCTGCG CCGGCGGGAG CCGGACTTCA CGGCGGTGGC CACGCCGGGC GCCGGCAAGA CCACCTTCGC CCTTCGGGTC GCCGCCGAGT TGCTCGCCGA CGGGAGCGTC GAGGCGGTCA CCGTGGTCGC GCCGACCGAG CACCTCAAGG CCCAGTGGGC GCAGGCCGCC GCCCGGGTCG GCATCCAACT CGACGCGGCC TTCCGTAACG CCGACCTGCA CTCGTCGGCC GACTTCCACG GGGCCGTGGT CACCTACGCC CAGGTCGGGA TGGCGCCGCA GGTGCACCGG CGGCGTACCC TGACCCGGCG CACCCTGGTC GTCCTCGACG AGATCCACCA TGCCGGCGGT TCCCGGACCT GGGGTGACGG TGTGCAGGCG GCCTTCGAAG GCGCGGAGCG CCGGCTCATG CTCACCGGCA CGCCGTTCCG TTCCGATGAC AACCCGATTC CGTTCGTCAG CTACGAGCGG GGCGGCGACG GCCTGCTGCG CTCCCGCGCC GACTCTGTCT ACGGCTATGC CGACGCGCTG CACGACGGCG TGGTCCGCCC GGTGCTCTTC CTCGCCTACT CGGGGGAGAC CCGGTGGCGG ACGAATGCCG GTGAGGAACT GGCCGCCCGG CTCGGCGAGC CGATGACCCA GGATCTGATC GCGCAGGCGT GGCGGACCGC GCTCGACCCG GCCGGCGACT GGATGCCGCA GGTGCTGCGG GCGGCGGACG CCCGGCTGAC CGTGCTCCGC AACGCCGGGA TGCCCGACGC CGCCGGCCTG GTGATCGCCA GCGACCAGCA GACCGCCCGT TCGTACGCGA AGCTCATCGA GCAGGTGACC GGCGAGAAGG CCGCCGTGGT GCTCTCCGAC GACGCGGGTG CCTCGGCCCG GATCGCGACG TTCGCGACCG CCGAGCAGCG TTGGCTGGTG GCGGTCCGGA TGGTTTCCGA AGGCGTGGAC ATCCCCCGTC TCGCCGTCGG TGTCTACGCC ACCAGCGCCA GCACCCCGCT CTACTTCGCC CAGGCCATCG GGCGGTTCGT CCGGGCACGG CGGCCGGGGG AGACGGCATC GGTGTTCCTG CCCAGCGTCC CACACCTGCT CGGCCTCGCC AGCGAGATGG AAGCCGAGCG GGACCATGTG CTGGGTAAAC CGAAGGACCA GGACGGTTTC GACGACGACC TGCTGGAGCG CGCCCAACGG GACGACCAGG CCAGCGGTGA ACTGGAGAAG CGGTACGCCG CGCTCTCCGC GACCGCCGAG CTGGATCAGG TGATCTTCGA CGGCGCGTCG TTCGGCACCG CTGCCCAGGC CGGTACGCCC GAGGAGGAGG AGTATCTCGG CCTCCCCGGG CTGCTCACCG CCGACCAGGT GGCCATGCTG CTGTCCAAGC GGCAGGCCGA GCAACTGGCC GCGTCGCGGC GCAGGACCGC CGCCCGGCCC GTTGAACCGG CCGCGACGAC CGCGCCACCG GCACCGATGA GTGCGGCGCA ACGCCGGGTG GCACTGCGCC GACAGTTGAA CGCCCTGGTG GCCGCCCGAC ACCACCACAC CGGTCAGCCG CACGGCAAGA TCCACGCAGA GCTGCGCCGC CGCTGCGGCG GCCCGCCCAG TGCCCAGGCG ACGATCGAGC AGCTGGAGGA ACGGATCGCC ACGGTGCAGA CCCTCTGA
|
Protein sequence | MAARTPTLDT FPALRAWQRR ALVEYLRRRE PDFTAVATPG AGKTTFALRV AAELLADGSV EAVTVVAPTE HLKAQWAQAA ARVGIQLDAA FRNADLHSSA DFHGAVVTYA QVGMAPQVHR RRTLTRRTLV VLDEIHHAGG SRTWGDGVQA AFEGAERRLM LTGTPFRSDD NPIPFVSYER GGDGLLRSRA DSVYGYADAL HDGVVRPVLF LAYSGETRWR TNAGEELAAR LGEPMTQDLI AQAWRTALDP AGDWMPQVLR AADARLTVLR NAGMPDAAGL VIASDQQTAR SYAKLIEQVT GEKAAVVLSD DAGASARIAT FATAEQRWLV AVRMVSEGVD IPRLAVGVYA TSASTPLYFA QAIGRFVRAR RPGETASVFL PSVPHLLGLA SEMEAERDHV LGKPKDQDGF DDDLLERAQR DDQASGELEK RYAALSATAE LDQVIFDGAS FGTAAQAGTP EEEEYLGLPG LLTADQVAML LSKRQAEQLA ASRRRTAARP VEPAATTAPP APMSAAQRRV ALRRQLNALV AARHHHTGQP HGKIHAELRR RCGGPPSAQA TIEQLEERIA TVQTL
|
| |