Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4110 |
Symbol | |
ID | 5707661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4668182 |
End bp | 4671163 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641273538 |
Product | hypothetical protein |
Protein accession | YP_001538891 |
Protein GI | 159039638 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0418433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGCA GCGCCCCCAT GCCGAGGATG AGCCGACGCG GACGCGTCAC GATTGGTGTC CTGGTCGGGG TGTTCGTGCT CTTCACCCTG CTCGGCTGGG GTGTGCAGGC CTGGACCGAC TGGCTCTGGT TCGGCAAGGT CGACTACACC GAGGTCTTCT CCGGGGTGCT CGTCACCCGG CTGCTGCTCT TCGTCACGGT CGGCCTCGCC ATGGCGGTGG TCGTGGGCGG CAATCTCTGG CTGGCGCACC GACTGCGGCC CCGGCTGCGA CCGCAGTCGC CGGAGCAGGC CACCCTGGAG CGCTACCGGA TGCTGCTGAG CCCTCGGCTC GGCACCTGGT TCGCGGTGGT CTCCGTGGTG GTCGGCCTCT TCGCGGGGCT GTCGGCGCAG AGCAGGTGGA GTCAGTGGCT GTTGTTCCGC AACGGCGGGG ACTTCGGGGT CAAGGATCCG GAGTTCGGGA TAGACATCGG CTTCTACGTG TTCGACCTGC CCTTCTGGCG CTACCTGCTG GGGGTGGCCT TCACCGCCGT GGTGCTGGCC CTGATCGGGG CACTCGCGGT GCACTACGTC TTCGGCGGGG TCCGGCTCCA GGGCGTGGGC GATCGGATGA GCAACGCGGC GCGGGCTCAC CTGAGCGCGC TGGTCGCGGT CTTCGTGCTA CTCAAGGCCG TCGCGTATGT GCTCGACCGG CGGACGATGC TGCTGGAGTA CAACGACGGT GCCAACGTGT ACGGCGCCGG CTACGCCGAC ATCAACGCGC TGCTGCCGGC GAAGGAGATC CTCGCCTACA TCTCGGTCGT CGTGGCGATC GCGGTCCTCG TCTTCTCCAA CGCCTGGATG CGGAACCTGG TCTGGCCGGG CATCTCGCTG GCCCTGCTCG GAGTCTCCGC GGTCGCCATC GGCGGCATCT ACCCGTGGGC TGTGCAGACC TTCGAGGTGA AGCCGAGTGC CCGCGACAAG GAAGCGCGGT ACATCGAGCG CAGCATCGAG GCGACCCGTG CGGCCTTCAA CCTGGGCGGG GTCGAGACCA GGCGGTATGC GGCGAGTAAC CTTCAGCCAC CAGCGAGCCT GGCCACCGAC ACGGCGGTGG TGCCGAACGC CCGGCTGCTG GATCCACAGC TGGTCAGCGA GACGTACACG CAGCTCCAGC AGGTCCGCGG CTTCTACGAC TTCGGCCCCA AGCTCGACAT CGACCGCTAT GCCGTCGAGG GCAAGACCCA GGATTACGTG GTCGGCGTCC GCGAGATCAA CTACGGCGAG CTGACCGCCC AGCAGAGCAA CTGGATCAAC CGGCACACCG TCTATACCCA TGGTTACGGC CTGGTCGCGG CCCCGGCGAA CCGGGTGGTC TGCGGCGGCC AGCCCTACTT CGTCTCCGGC TTTCTCGGTG ATCGATCGCA GGAGGGGTGT GCCGCGCCGA CCGATCAGAT CCCGGCCAGC CAGCCGCGGA TCTACTACGG CGAGCGGATG GAGGCCGGCG ACTACGCCAT CGTCGGTAAG TCGAACCCGG ACGCCAACCC CGCCGAGTTC GATCGGCCGG TCGGCGAGGG CGACGACGGG GCCGAGTCCT ACTACACCTA CACCGGCTCC GGCGGCGTCG AGATCGGGTC GTTCAGCCGT CGTCTGCTCT ACGCCATCAA GGAGCAGGAA TCGAACTTCC TGCTCTCTGA GGCGGTCAAC GAGAATTCGA AGTTGCTCTA CGTCCGTAAT CCGCGCGAGC GGGTGGAGAA GGTCGCTCCG TTCCTCACCG TGGACGGCGA CCCGTATCCG GCGGTGATCG ACGGCCGGGT GACCTGGATC ATCGATGGCT ACACGACGGC TGCGACCTAT CCCTACGCAG AGCGGATCAA CCTACAGACC GAGACCACCG ACGAGCTGAC CAACCGGGGC ACGTTCCAGC AGGCCCGGGA AAATATCAAC TACATTCGTA ACTCGGTCAA GGCGACGGTC GACGCATACG ACGGCACGGT CACCCTCTAC GAGTTCGATG ACGGCGACCC GGTACTCAGG GCGTGGAACA AGGCGTTCGG CGGCGATCTG ATCAAGTCGA AGACGGAGAT CCCGGCCGAG TTGAGCGCCC ACTTCCGTTA CCCGGCGGAC CTGTTCAAGG TGCAGCGGAA CGTGCTCACC CGATTCCACG TGACCAGCCC CGGCGACTTC TACTCCGGGC AGGACTTCTG GCAGGTGCCG AACGTACCGG ACGCGCCGGA CAGCGGTCAG AAGCAGCCAC CGTACTACCT CTTCACCCAG TTCCCCGGGC AGGAGGAAGC CCGCTTCCAG CTCACCGCAG CGGTTACGCC GAACCGACGA CAGAACCTGG CAGCGCTGAT GTCCGGTTCG TACGTGGATG GAAAGCCCCA GCTCGAGGTG CTGGAGCTGC CGGAAGACAC CCGGATCTCC GGGCCGGTGC AGGTGCACCA GCAGATGACC AACAACGCGC AGATCCGGCA GCAGCTGAAC CTGCTCTCGT CGAACCAGGC TCAGGTCCAG TACGGCAATC TGCTTTCGCT GCCGTTCGGC GACGGCATGC TCTACGTCGA GCCGGTCTAT GTGAAGAGCA ACCAGCAGCA GGCGTATCCG CTGTTGCAGA AAGTGCTCCT ATCCTACGGT GACGGCGGCT CGTTCGTCGT CCTGGCGGAC AACCTCGCCG ACGGCATCAA ACAGCTGGTC GAACAGGGTG AGAAGGCCGG CGCACCGTCA ACGCCCCCGC CGTCCGGTGA GACGCCCGCG CCGACCCCGA CCCCGACCCC AACCCCGTCG AGTCCGAGCG TGACGCCGCC CCCGGTCACG GGCGAACTGG CGGATGCGGC GCAGCGGGTT CAGGCGGCGA TCGTGGAACT GCGGGCCGCA CAGGAATCCG GTGACTTCGA ACGCTACGGC CGGGCACTGA AGGCATTGGA TGAGGCCACC GCTGCCTTCG AGCAGGCCGC GGGGCCGGGT TCCGCTGCTA CGCCCACCGG TTCACCGTCG CCTGGTGGCT GA
|
Protein sequence | MRSSAPMPRM SRRGRVTIGV LVGVFVLFTL LGWGVQAWTD WLWFGKVDYT EVFSGVLVTR LLLFVTVGLA MAVVVGGNLW LAHRLRPRLR PQSPEQATLE RYRMLLSPRL GTWFAVVSVV VGLFAGLSAQ SRWSQWLLFR NGGDFGVKDP EFGIDIGFYV FDLPFWRYLL GVAFTAVVLA LIGALAVHYV FGGVRLQGVG DRMSNAARAH LSALVAVFVL LKAVAYVLDR RTMLLEYNDG ANVYGAGYAD INALLPAKEI LAYISVVVAI AVLVFSNAWM RNLVWPGISL ALLGVSAVAI GGIYPWAVQT FEVKPSARDK EARYIERSIE ATRAAFNLGG VETRRYAASN LQPPASLATD TAVVPNARLL DPQLVSETYT QLQQVRGFYD FGPKLDIDRY AVEGKTQDYV VGVREINYGE LTAQQSNWIN RHTVYTHGYG LVAAPANRVV CGGQPYFVSG FLGDRSQEGC AAPTDQIPAS QPRIYYGERM EAGDYAIVGK SNPDANPAEF DRPVGEGDDG AESYYTYTGS GGVEIGSFSR RLLYAIKEQE SNFLLSEAVN ENSKLLYVRN PRERVEKVAP FLTVDGDPYP AVIDGRVTWI IDGYTTAATY PYAERINLQT ETTDELTNRG TFQQARENIN YIRNSVKATV DAYDGTVTLY EFDDGDPVLR AWNKAFGGDL IKSKTEIPAE LSAHFRYPAD LFKVQRNVLT RFHVTSPGDF YSGQDFWQVP NVPDAPDSGQ KQPPYYLFTQ FPGQEEARFQ LTAAVTPNRR QNLAALMSGS YVDGKPQLEV LELPEDTRIS GPVQVHQQMT NNAQIRQQLN LLSSNQAQVQ YGNLLSLPFG DGMLYVEPVY VKSNQQQAYP LLQKVLLSYG DGGSFVVLAD NLADGIKQLV EQGEKAGAPS TPPPSGETPA PTPTPTPTPS SPSVTPPPVT GELADAAQRV QAAIVELRAA QESGDFERYG RALKALDEAT AAFEQAAGPG SAATPTGSPS PGG
|
| |