Gene Sare_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4110 
Symbol 
ID5707661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4668182 
End bp4671163 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content66% 
IMG OID641273538 
Producthypothetical protein 
Protein accessionYP_001538891 
Protein GI159039638 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0418433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCA GCGCCCCCAT GCCGAGGATG AGCCGACGCG GACGCGTCAC GATTGGTGTC 
CTGGTCGGGG TGTTCGTGCT CTTCACCCTG CTCGGCTGGG GTGTGCAGGC CTGGACCGAC
TGGCTCTGGT TCGGCAAGGT CGACTACACC GAGGTCTTCT CCGGGGTGCT CGTCACCCGG
CTGCTGCTCT TCGTCACGGT CGGCCTCGCC ATGGCGGTGG TCGTGGGCGG CAATCTCTGG
CTGGCGCACC GACTGCGGCC CCGGCTGCGA CCGCAGTCGC CGGAGCAGGC CACCCTGGAG
CGCTACCGGA TGCTGCTGAG CCCTCGGCTC GGCACCTGGT TCGCGGTGGT CTCCGTGGTG
GTCGGCCTCT TCGCGGGGCT GTCGGCGCAG AGCAGGTGGA GTCAGTGGCT GTTGTTCCGC
AACGGCGGGG ACTTCGGGGT CAAGGATCCG GAGTTCGGGA TAGACATCGG CTTCTACGTG
TTCGACCTGC CCTTCTGGCG CTACCTGCTG GGGGTGGCCT TCACCGCCGT GGTGCTGGCC
CTGATCGGGG CACTCGCGGT GCACTACGTC TTCGGCGGGG TCCGGCTCCA GGGCGTGGGC
GATCGGATGA GCAACGCGGC GCGGGCTCAC CTGAGCGCGC TGGTCGCGGT CTTCGTGCTA
CTCAAGGCCG TCGCGTATGT GCTCGACCGG CGGACGATGC TGCTGGAGTA CAACGACGGT
GCCAACGTGT ACGGCGCCGG CTACGCCGAC ATCAACGCGC TGCTGCCGGC GAAGGAGATC
CTCGCCTACA TCTCGGTCGT CGTGGCGATC GCGGTCCTCG TCTTCTCCAA CGCCTGGATG
CGGAACCTGG TCTGGCCGGG CATCTCGCTG GCCCTGCTCG GAGTCTCCGC GGTCGCCATC
GGCGGCATCT ACCCGTGGGC TGTGCAGACC TTCGAGGTGA AGCCGAGTGC CCGCGACAAG
GAAGCGCGGT ACATCGAGCG CAGCATCGAG GCGACCCGTG CGGCCTTCAA CCTGGGCGGG
GTCGAGACCA GGCGGTATGC GGCGAGTAAC CTTCAGCCAC CAGCGAGCCT GGCCACCGAC
ACGGCGGTGG TGCCGAACGC CCGGCTGCTG GATCCACAGC TGGTCAGCGA GACGTACACG
CAGCTCCAGC AGGTCCGCGG CTTCTACGAC TTCGGCCCCA AGCTCGACAT CGACCGCTAT
GCCGTCGAGG GCAAGACCCA GGATTACGTG GTCGGCGTCC GCGAGATCAA CTACGGCGAG
CTGACCGCCC AGCAGAGCAA CTGGATCAAC CGGCACACCG TCTATACCCA TGGTTACGGC
CTGGTCGCGG CCCCGGCGAA CCGGGTGGTC TGCGGCGGCC AGCCCTACTT CGTCTCCGGC
TTTCTCGGTG ATCGATCGCA GGAGGGGTGT GCCGCGCCGA CCGATCAGAT CCCGGCCAGC
CAGCCGCGGA TCTACTACGG CGAGCGGATG GAGGCCGGCG ACTACGCCAT CGTCGGTAAG
TCGAACCCGG ACGCCAACCC CGCCGAGTTC GATCGGCCGG TCGGCGAGGG CGACGACGGG
GCCGAGTCCT ACTACACCTA CACCGGCTCC GGCGGCGTCG AGATCGGGTC GTTCAGCCGT
CGTCTGCTCT ACGCCATCAA GGAGCAGGAA TCGAACTTCC TGCTCTCTGA GGCGGTCAAC
GAGAATTCGA AGTTGCTCTA CGTCCGTAAT CCGCGCGAGC GGGTGGAGAA GGTCGCTCCG
TTCCTCACCG TGGACGGCGA CCCGTATCCG GCGGTGATCG ACGGCCGGGT GACCTGGATC
ATCGATGGCT ACACGACGGC TGCGACCTAT CCCTACGCAG AGCGGATCAA CCTACAGACC
GAGACCACCG ACGAGCTGAC CAACCGGGGC ACGTTCCAGC AGGCCCGGGA AAATATCAAC
TACATTCGTA ACTCGGTCAA GGCGACGGTC GACGCATACG ACGGCACGGT CACCCTCTAC
GAGTTCGATG ACGGCGACCC GGTACTCAGG GCGTGGAACA AGGCGTTCGG CGGCGATCTG
ATCAAGTCGA AGACGGAGAT CCCGGCCGAG TTGAGCGCCC ACTTCCGTTA CCCGGCGGAC
CTGTTCAAGG TGCAGCGGAA CGTGCTCACC CGATTCCACG TGACCAGCCC CGGCGACTTC
TACTCCGGGC AGGACTTCTG GCAGGTGCCG AACGTACCGG ACGCGCCGGA CAGCGGTCAG
AAGCAGCCAC CGTACTACCT CTTCACCCAG TTCCCCGGGC AGGAGGAAGC CCGCTTCCAG
CTCACCGCAG CGGTTACGCC GAACCGACGA CAGAACCTGG CAGCGCTGAT GTCCGGTTCG
TACGTGGATG GAAAGCCCCA GCTCGAGGTG CTGGAGCTGC CGGAAGACAC CCGGATCTCC
GGGCCGGTGC AGGTGCACCA GCAGATGACC AACAACGCGC AGATCCGGCA GCAGCTGAAC
CTGCTCTCGT CGAACCAGGC TCAGGTCCAG TACGGCAATC TGCTTTCGCT GCCGTTCGGC
GACGGCATGC TCTACGTCGA GCCGGTCTAT GTGAAGAGCA ACCAGCAGCA GGCGTATCCG
CTGTTGCAGA AAGTGCTCCT ATCCTACGGT GACGGCGGCT CGTTCGTCGT CCTGGCGGAC
AACCTCGCCG ACGGCATCAA ACAGCTGGTC GAACAGGGTG AGAAGGCCGG CGCACCGTCA
ACGCCCCCGC CGTCCGGTGA GACGCCCGCG CCGACCCCGA CCCCGACCCC AACCCCGTCG
AGTCCGAGCG TGACGCCGCC CCCGGTCACG GGCGAACTGG CGGATGCGGC GCAGCGGGTT
CAGGCGGCGA TCGTGGAACT GCGGGCCGCA CAGGAATCCG GTGACTTCGA ACGCTACGGC
CGGGCACTGA AGGCATTGGA TGAGGCCACC GCTGCCTTCG AGCAGGCCGC GGGGCCGGGT
TCCGCTGCTA CGCCCACCGG TTCACCGTCG CCTGGTGGCT GA
 
Protein sequence
MRSSAPMPRM SRRGRVTIGV LVGVFVLFTL LGWGVQAWTD WLWFGKVDYT EVFSGVLVTR 
LLLFVTVGLA MAVVVGGNLW LAHRLRPRLR PQSPEQATLE RYRMLLSPRL GTWFAVVSVV
VGLFAGLSAQ SRWSQWLLFR NGGDFGVKDP EFGIDIGFYV FDLPFWRYLL GVAFTAVVLA
LIGALAVHYV FGGVRLQGVG DRMSNAARAH LSALVAVFVL LKAVAYVLDR RTMLLEYNDG
ANVYGAGYAD INALLPAKEI LAYISVVVAI AVLVFSNAWM RNLVWPGISL ALLGVSAVAI
GGIYPWAVQT FEVKPSARDK EARYIERSIE ATRAAFNLGG VETRRYAASN LQPPASLATD
TAVVPNARLL DPQLVSETYT QLQQVRGFYD FGPKLDIDRY AVEGKTQDYV VGVREINYGE
LTAQQSNWIN RHTVYTHGYG LVAAPANRVV CGGQPYFVSG FLGDRSQEGC AAPTDQIPAS
QPRIYYGERM EAGDYAIVGK SNPDANPAEF DRPVGEGDDG AESYYTYTGS GGVEIGSFSR
RLLYAIKEQE SNFLLSEAVN ENSKLLYVRN PRERVEKVAP FLTVDGDPYP AVIDGRVTWI
IDGYTTAATY PYAERINLQT ETTDELTNRG TFQQARENIN YIRNSVKATV DAYDGTVTLY
EFDDGDPVLR AWNKAFGGDL IKSKTEIPAE LSAHFRYPAD LFKVQRNVLT RFHVTSPGDF
YSGQDFWQVP NVPDAPDSGQ KQPPYYLFTQ FPGQEEARFQ LTAAVTPNRR QNLAALMSGS
YVDGKPQLEV LELPEDTRIS GPVQVHQQMT NNAQIRQQLN LLSSNQAQVQ YGNLLSLPFG
DGMLYVEPVY VKSNQQQAYP LLQKVLLSYG DGGSFVVLAD NLADGIKQLV EQGEKAGAPS
TPPPSGETPA PTPTPTPTPS SPSVTPPPVT GELADAAQRV QAAIVELRAA QESGDFERYG
RALKALDEAT AAFEQAAGPG SAATPTGSPS PGG