Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5005 |
Symbol | |
ID | 5705460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5673522 |
End bp | 5674382 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274398 |
Product | DNA-(apurinic or apyrimidinic site) lyase |
Protein accession | YP_001539739 |
Protein GI | 159040486 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.952143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000288771 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCGAAC TACCGGAGGT AGAGGCGCTC GCCGATTATC TGCGTCGACG CGCCGTCGGC CGGCGGGTCG ACCGCCTGGA GATCGCCGCG ATCAGCGCCC TGAAGACGTA CGACCCGGCA ATCACGGCGG CGGCTGGTCG GCAGGTCACC GACGCCCGGC GGCTCGGCAA GTTCCTCGAC CTGGTGCTCG ACGCCGACCT GCACCTGGTG ATCCACCTTG CCCGCGCGGG CTGGCTACAG TTCCGGGAGG AGTTCCCGTC CCGTGGCCCG CTACGCCCCG GCAAGGGTCC GGTGGCACTG CGGGTACGCC TCGACGACGG CTCCGGCTTC GACCTGACCG AGGCCGGTAC GCAGAAGAGT CTCGCCATCT ACCTGGTGAC CGATCCGGCG GTGGTGCCCG GGGTGGCCCG GCTCGGCCCG GACGCGCTCG CCGTCGACCC GGCCACCTTC GCCGAGCGAC TCCGCGGCCG CAGGGGCCAG GTCAAGGGGG TACTCACCGA TCAGACGGTG CTCGCCGGGG TGGGCAACGC GTACTCCGAC GAGATCCTGC ACACGGCCCG GTTGTCACCG TTCGCGCTGA CCTCCCGGCT GACCGACGAC CAGCTGGCCG CCCTGCACAC GGCGACCCGG GACGTGCTCG GCGAGGCGGT GTCCCGGTCG GTGGGACAAC GGGCTGCGGA GCTCAAAGGC GAGAAGCGCT CCGGGCTACG GGTACACGCC CGAACCGGGC TGCCCTGCCC GGTCTGTGGT GACACCGTAC GGGAGGTCTC CTTCGCTGAT TCGAGCCTCC AGTACTGTCC GGCGTGCCAG ACCGGCGGAA AGCCGCTCGC GGACCGACGG TTGTCCCGAC TTATACGGTG A
|
Protein sequence | MPELPEVEAL ADYLRRRAVG RRVDRLEIAA ISALKTYDPA ITAAAGRQVT DARRLGKFLD LVLDADLHLV IHLARAGWLQ FREEFPSRGP LRPGKGPVAL RVRLDDGSGF DLTEAGTQKS LAIYLVTDPA VVPGVARLGP DALAVDPATF AERLRGRRGQ VKGVLTDQTV LAGVGNAYSD EILHTARLSP FALTSRLTDD QLAALHTATR DVLGEAVSRS VGQRAAELKG EKRSGLRVHA RTGLPCPVCG DTVREVSFAD SSLQYCPACQ TGGKPLADRR LSRLIR
|
| |