Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3518 |
Symbol | |
ID | 5704646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4059009 |
End bp | 4060766 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641272945 |
Product | hypothetical protein |
Protein accession | YP_001538311 |
Protein GI | 159039058 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.663453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00905238 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCACCGC AGGAGTATGT CCAGGAGTCG TTGGCCGGTA TCGATCCGGC CGTCGGCGGA GTTGACCCGG AGCTACCGCT CTACGCGACC ACCTTCGTGG TGGTCGACCT CGAAACCACC GGCGGTGCGC CGGACGGCGG TGGCATCACC GAGATCGGCG CGGTGAAGGT ACGCGGCGGT GAGCAGCTCG GTGTGCTCGC CACCCTGGTC AACCCCGGCG TGCCGGTGCC ACCCTTCGTC ACCGTGCTCA CCGGCATCAC CCAGGCGATG CTGCTGCCTG CCCCGTCGAT CGCCCAGGTG CTGCCGAGCT TCCTGGAGTT CCTCTCCGCC GACGCGGTCC TCGTCGCCCA CAACGCCCCA TACGACGTGG GGTTCCTCAA GGCGGCGTGT GCGGCGCACG GCCACCGCTG GCCGAATCCG CGGATACTGG ACACGGCGGC GCTGGCCCGG CGGGTCCTCA GCCGGGACGA GGTGCCCAAC CGCCGGCTCG GCACCCTCGC GGCATACTTC CGCACCAGCA CCCAGCCCAC CCATCGGGCA TTGGACGACG CGAAGGCCAC AGTCGACGTG CTGCACGGGT TGATTGCCCG GCTGGGCGGC CACCGGGTGC ACACCATCGG CGAGGCGATC GAGTTCGCCC GGGCGGTCAC CCCCACGCAG CGTCGAAAGC GGCACCTCGC CGAGGGGCTA CCCCGTACGC CCGGCGTCTA CCTCTTCCGG GGCGCCGACG ACCGCCCGCT CTACGTCGGC ACCTCGGTTG ACATCGCCAC CCGGGTCCGC AGCTACTTCA CCGCCGCCGA GAAGCGGGCC CGCATCTCGG AGATGCTCGC CGCCGCCGTG CGGGTGGAGG CGGTGGAGTG CGCTCACTCG CTCGAGGCAG AGGTGCGGGA GCTGCGACTG ATCGGGGCGC ACGCCCCGCC GTACAACCGG CGGTCCAAGT TCCCCGAGCG GGTGGTCTGG CTGAAGCTGA CCGACGAGGC GTACCCACGG TTGTCGGTGG TTCGCAAGCT GACCCCGGGT GACGAGGCAT ACCTCGGACC CTTCACCTCC CGCCGTGCCG CGGAGCTCGC CGCCGCCGGC TTCCACGACG CCATGCCGCT GCGGCAGTGC ACGCACCGAC TGTCGGTGCG GACGGTCACG CCGGCCTGCG CGTTGGCGGA GTTGGGTCGC TGCCCGGCGC CCTGCGAGCA TCGGATCACC CCCGACGAGT ACGCGAACCG AGCGGTGACG CCCTTCCGCA CGGCGTGCCG CGGCGACCCG CAGGGCGTGG TGGATGCGTT GCTTGGCCGG ATCGAGACGC TCTCGACGGG ACAACGCTAC GAGGAGGCCG CGGTGGTGCG GTCCCGGCTC ACCGCCGTGC TCCGCGCCAC GACCCGGATG CAGCGCCTCG CCGCGCTCAC CGGGATCGGC GAGGTGGCGG CTGCCCGGCC GGCGGTGGGC GGAGGCTGGG AGCTGGCGCT GGTGCGGTAT GGACGGCTCG CCGGTGCCGG TGTGTCACCG CCGGGTGTCC ACCCGCGACC GACGATCGCC ACGATTCGGG CTACTGCGGA GACAGTGACC CCGGGGTTGG GACCGACTCC AGCTGCCAGC GCCGAGGAGA CCGAACGCAT CTTGTCCTGG TTGGAACGTC CGGAGACGAG ACTGGTGGAG ATGTCCTCCG GCTGGGCCTC CCCGGCGACT GGGGCGGGCC GGTTCCAGAC TCTGCTGATG AAGGCGACGA ACGCCGCTTC CCACCAACTA TCATCCGAAA GCCCATGA
|
Protein sequence | MAPQEYVQES LAGIDPAVGG VDPELPLYAT TFVVVDLETT GGAPDGGGIT EIGAVKVRGG EQLGVLATLV NPGVPVPPFV TVLTGITQAM LLPAPSIAQV LPSFLEFLSA DAVLVAHNAP YDVGFLKAAC AAHGHRWPNP RILDTAALAR RVLSRDEVPN RRLGTLAAYF RTSTQPTHRA LDDAKATVDV LHGLIARLGG HRVHTIGEAI EFARAVTPTQ RRKRHLAEGL PRTPGVYLFR GADDRPLYVG TSVDIATRVR SYFTAAEKRA RISEMLAAAV RVEAVECAHS LEAEVRELRL IGAHAPPYNR RSKFPERVVW LKLTDEAYPR LSVVRKLTPG DEAYLGPFTS RRAAELAAAG FHDAMPLRQC THRLSVRTVT PACALAELGR CPAPCEHRIT PDEYANRAVT PFRTACRGDP QGVVDALLGR IETLSTGQRY EEAAVVRSRL TAVLRATTRM QRLAALTGIG EVAAARPAVG GGWELALVRY GRLAGAGVSP PGVHPRPTIA TIRATAETVT PGLGPTPAAS AEETERILSW LERPETRLVE MSSGWASPAT GAGRFQTLLM KATNAASHQL SSESP
|
| |