Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3286 |
Symbol | |
ID | 5706903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3795646 |
End bp | 3798168 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272713 |
Product | hypothetical protein |
Protein accession | YP_001538080 |
Protein GI | 159038827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.577779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00794489 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCAACT ACCAAGCTCA GAACAACGCC ATCGACCCGG AGGCCGCGTC GAAACGACGA CGCAAGCTCT GGCTCGCCAG CGGCGTCGCC GGCCTCACCG GGGTGGTCAG CATCGCCGCA GTTGGCTTCA CCGGGGCCGG AGCCGGTGCC CGGGACCTGG CCTGGAACAC AGCCCAGATC ACCAAGAACG GCGAGCAGGT CGCCGACGAC CCTGGCGCGG GCCGCGTGGG CGGCTGGGAA GCCAACCACA GGGGCGACTC GGACGGCAAT CGAGACGACA GAAACCGCGG GAAGAAGAAG CGCGGGAAGG AGGTGCCCTG CAATACCGAC CAGCTGATCC AAGCGATCGT CTACGCCAAC AGCAACCAGG GCGGCGTACT CAAGCTGGCC AAGGACTGCA CCTACACGCT GACCCGTTCC GACAAACACG GCAACGGCCT GCCGATCATC AAGGAACCGA TCACGCTTGC CGGTTACAAC ACCAAGATCG TCCGGGACAG TCAGGCCGAC TTCTTCCGAA TTCTCAACGT CGGCCGGGGC GGGCACCTGA AGGTCGAGGG CCTGACCATC AAGGGTGGGA AGACCCTGGA GGGCGAGCGG GGCGCGAACA CGCCGGAGGC GGTGTGGTCG CGCTTCTCGA ACTCGGTCGC GGCGGGCGAG GCGGCCAAGG CCGGCAAGCC GTTCCTGCCG CTGTTGCAGG CCGAGCCGGG TGCCGCAGCT CGGGGCACCG ACGCGGCACG CGCCGCAGAA GGGGCACCAG CCGGGCGTGT GGCAGCAGCC ACCGACGCGG CACCGGCCCC CGTCTGGAAC GACCGGGCAC CAGCCGGCCG ATCGGCCACC GAAGGCGCAC CGGCTATCGA CGCGGCACCG GCTGCCGGAT GGACCGACCG GGCACCAGCC GCCGACAAGG CACCAGCCGC GGAAGGCGCA CCGGCCGACC GCCTGGCACC AGCCACCGAC GCGGCACCGG CCCCCGTCTG GACCGACCGG GCACCAGCCG CCGACAAGGC ACCAGCCGGC GGATGGGGCG ACAAGGCACC AGCCGCGGAA GGCGCACCGG CCGACCGACG GGCACCAGCC GCCGACAAGG CACCAGCCAC CGACGCGGCA CCGGCCCCCG TCTGGACCGA CCGGGCACCA GCCGCCGACA AGGCACCAGC CGGCGGATGG GGCGACAAGG CACCAGCCGC GGAAGGCGCA CCGGCCGACC GACGGGCACC AGCCGCCGAC AAGGCACCAG CCACCGACGC GGCACCGGCC CCCGTCTGGA CCGACCGGGC ACCAGCCGCC GACAAGGCAC CAGCCGGCGG ATGGGGCGAC AAGGCACCAG CCGCGGAAGG CGCACCGGCC GACCGACGGG CACCAGCCGC AGAACAGGCA CCAGCCACCG ACGCGGCACC GGCCCCCGTC TGGACCGACC GGGCACCAGC CGCCGACAAG GCACCAGCCG GCGGATGGGG CGACAAGGCA CCAGCCGCGG AAGGCGCACC GGCCGACCGA CGGGCACCAG CCGCAGAACA GGCACCAGCC ACCGACGCGG CACCGGCCCC CGTCTGGACC GACCGGGCAC CAGCCGCCGA CAAGGCACCA GCCGGCGGAT GGGGCGACAA GGCACCTGCC GCGAAGAAGG AGAAGCCCAA GGTCCTCCGG CACGACAGGT CAAGCGACGG CGCCGGTGTC CTCGTGCAGG CCGGCGGCAC CGCCGAGTTC GAGAAGTCCT ACCTGGAGCA CAACCTCGCC GGCGGTGTCG GAGGCGGGCT CGCCAACTTC GGCAAGACCA GCCTCTACCA CACCACCGTG GCCAACAATG ACGCCTTCCA CTACGGCGGC GGCATCTTCA ACGCCGGCGT GCTGCGGGTC TCCTCGTCCA GGGTGATGGA CAACGGCGCC GCTATCGGCG GTGGCGGCAT CGCCAACGGG GCGGCGTTCG TCGACCGCAA GGACATCGAG CCCGGTTATG TCACGGTCGA AAAGGCCGAG GTTACCGGTA ACGAGGTTCT CGGCTTCGGC GGTGGCCTGT TCGACATCGG TGGCGAAACG ACCGTCAAGC AGTCGATAGT CGCCAGGAAC CTGGCGCTCC TCGCCGGTGG CGGGATCGCG GCGGTCGAAG GGAGCAACCT CTACCTGAAG GAAATGGAGG TTGCCGACAA CACCACCATC GGTGACGGTG GAGGTCTGGC TCTGGCACTC GGTGCGATCG CCAACGTGGA CAAGAGCAAG GTCCGGGACA ACAAAGCCGG CGTCTTCGGG GGTGGCGTGT TCAACCTCCT CAGCGCGGTC ACGTTCCGCA ACAGCTCGGT CTCCGGCAAC CTCGCTGGTA TCTCCGGCGG CGGGATCTTC AACGTCGCGG GCAGTGTCGA CCTGACCGCC ACCAAGGTGA CGAAGAACCG CTCCACCCTC GAACCGGGTG GTGTGTTCAG CGCACTGGGC AAGGTGACGG TGGACAACAA GTCCGCGGTC AAGGGGAACG ACCCGACCAA CTGCAAGGGC AGCGTTGACC GGATCGAGAA CTGCTTCGGC TGA
|
Protein sequence | MSNYQAQNNA IDPEAASKRR RKLWLASGVA GLTGVVSIAA VGFTGAGAGA RDLAWNTAQI TKNGEQVADD PGAGRVGGWE ANHRGDSDGN RDDRNRGKKK RGKEVPCNTD QLIQAIVYAN SNQGGVLKLA KDCTYTLTRS DKHGNGLPII KEPITLAGYN TKIVRDSQAD FFRILNVGRG GHLKVEGLTI KGGKTLEGER GANTPEAVWS RFSNSVAAGE AAKAGKPFLP LLQAEPGAAA RGTDAARAAE GAPAGRVAAA TDAAPAPVWN DRAPAGRSAT EGAPAIDAAP AAGWTDRAPA ADKAPAAEGA PADRLAPATD AAPAPVWTDR APAADKAPAG GWGDKAPAAE GAPADRRAPA ADKAPATDAA PAPVWTDRAP AADKAPAGGW GDKAPAAEGA PADRRAPAAD KAPATDAAPA PVWTDRAPAA DKAPAGGWGD KAPAAEGAPA DRRAPAAEQA PATDAAPAPV WTDRAPAADK APAGGWGDKA PAAEGAPADR RAPAAEQAPA TDAAPAPVWT DRAPAADKAP AGGWGDKAPA AKKEKPKVLR HDRSSDGAGV LVQAGGTAEF EKSYLEHNLA GGVGGGLANF GKTSLYHTTV ANNDAFHYGG GIFNAGVLRV SSSRVMDNGA AIGGGGIANG AAFVDRKDIE PGYVTVEKAE VTGNEVLGFG GGLFDIGGET TVKQSIVARN LALLAGGGIA AVEGSNLYLK EMEVADNTTI GDGGGLALAL GAIANVDKSK VRDNKAGVFG GGVFNLLSAV TFRNSSVSGN LAGISGGGIF NVAGSVDLTA TKVTKNRSTL EPGGVFSALG KVTVDNKSAV KGNDPTNCKG SVDRIENCFG
|
| |