Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1922 |
Symbol | |
ID | 5708275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2218415 |
End bp | 2219620 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641271427 |
Product | integrase family protein |
Protein accession | YP_001536798 |
Protein GI | 159037545 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.612381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00155149 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAGTC CTCAACACCC GCCAATAGGC GTACGGCTTG CGCCTGATGT GGAGTTCCGG CCTGGCCGAG AAAACTCCTA CCGGGCACGG GTCCGGTGGA TCGATCCGGC CACGAAGCGC CGTCTGTCCA AGTCAACGAG TGTGGCCACG TCGGAGGAAG CGCAAGCCTG GATCGATGGG CTCATGAGTG CCGCGCAAGG TGGCATCGAC CCGACCGCCG CCACCAAGCG GCTAACCGAG TATGGCGAGA GTGTAATGAC GCTGGCCCTA CGCGGGCTGG AAGGCAAGAC GCTCGATCCG TATCTGGCTG GGTGGCGGAA ACGGGTTGTT CCCACGCTCG GTCACATCCC GGTTCGCATG ATTACCAATG GCGCGGTTGA CCGCGCTGTA CATAGCTGGA TTGCCGACGA ATGCAGCCGC TCGACGGTGA AGAACAGCCT CGCCGTTCTG GTTCGCGTGA TGGAACAGGC GGTGCGGGAC GGCATCATCG CTCGCAATCC CGCCCAGGTC ACGGGATGGC AGCGCGAATA CCAGCAAGCC GAGGACGAAT TGGACGATCC CCGCTCGCTG GCGCTCTCCG ATTGGGAGGC GCTAACCGCA CTCGCTGCCG CGTTGGTCGA ACGGTCGGCC AACGCCTTCA CCGGGTGGGC GGACGTGGTG ATTTTCGCTG CCTGCACCGC CGCGCGAATA GGCGAGGTAT CGGGCGTTCG GGCCGAGGAC ATCAACCGGG ATACGTGGAT GTGGACCGTG CGCCGGCAGA CCACGCCCGG CCCCGGTGGC CTGATCGATA AGGGCACCAA GGGCAAGCGC GCCCGGATGG TTCCGCTGAT CGAGGAAGTG CGGCCGCTCG TGACGCACCG CCTGGGGGTG GCGACCAAAC CCGACGCACG GCTGTTTACC GGCCCGCGCG GTGGCCGTAT TTCCACGGCC GTTCTCCGCG ACGCGACTCA TTGGGATGAG GTGGTGACGA AGCTCGGCTA CGAGCACCTA CGCCGACACG ACCTGCGGCA CACCGGGTTG ACCTGGATGG CCGACGCTGG CGTGCCGGTG CACGTCCTGC GGAAAATCGC CGGACACGGG TCGCTCACCA CGACCCAGCG ATACCTACAC CCCGACCGAC AGGCGATCAC GGACGCCGGC ACGGCGCTCA GCGCCCACTT GAAGGCCCGC CGGTCCCCAG GTGGTCCCCA GCTACGCGCC GTCTAG
|
Protein sequence | MASPQHPPIG VRLAPDVEFR PGRENSYRAR VRWIDPATKR RLSKSTSVAT SEEAQAWIDG LMSAAQGGID PTAATKRLTE YGESVMTLAL RGLEGKTLDP YLAGWRKRVV PTLGHIPVRM ITNGAVDRAV HSWIADECSR STVKNSLAVL VRVMEQAVRD GIIARNPAQV TGWQREYQQA EDELDDPRSL ALSDWEALTA LAAALVERSA NAFTGWADVV IFAACTAARI GEVSGVRAED INRDTWMWTV RRQTTPGPGG LIDKGTKGKR ARMVPLIEEV RPLVTHRLGV ATKPDARLFT GPRGGRISTA VLRDATHWDE VVTKLGYEHL RRHDLRHTGL TWMADAGVPV HVLRKIAGHG SLTTTQRYLH PDRQAITDAG TALSAHLKAR RSPGGPQLRA V
|
| |