Gene Sare_4783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4783 
Symbol 
ID5704450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5415023 
End bp5416696 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content71% 
IMG OID641274181 
Producthypothetical protein 
Protein accessionYP_001539527 
Protein GI159040274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000350269 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGACACA GGGCCGCTCT GCTGGACCTC CATCCCGCCA AACCCGCCCC CGCCATATCG 
GCCGCCACGC CGTCGGCTGC ACGGCGACTG CCCACCCGCG CCGTCGTCGC CACCGGTTGT
ATCGGTAGCG CCACGACGCT GGGCGCCACC GCTGCCGTCG GCCTCCCGCT GTTCGCACGG
GTGACCCTCG GGCTGGTCGG CATGATGCTG GTAGTGGTCG CCTGGCTTCT GCTGGGGCGC
GAGCACCGGC AGCTGACGGT GGTTTCCCTG CACACGATCG CGGTGGTGTG GACCCTGCCC
TTCCTGGTGG CCGCGCCGCT GTTCAGTGGC GATGTATGGA GTTACCTCGC CCAGGGCGCG
ATCGCGGCGA GTGGGCAGGA CCCGTACACA ATCGGACCGT TGCCGGGGCT CGGTGCCGCC
CATCCCGCGA CCCAGCAGGT CAGCCACTAC TGGATCGAGA CACCGGCGCC GTACGGGCCG
GCCTGGCTGG CCCTGTCCCG GTCCGTCGCC GCCCTGACCG GAGAGCACCT GATCGCTGGG
GTGCTGCTCT ACCGGCTCAT CGCGATCGTC GGCGTCGTCC TTGTCGCCTG GGCGCTTCCC
CACCTCGCCC GGCGGGCCGG CGCCGATCCG TCCACCGCCC TCTGGCTGGG CCTGCTCAAC
CCGCTGGTGA TCTGGCATTT CGTGGCTGGT GTCCACAATG ACGCCCTCAT GATGGGACTA
CTGCTCTGCG GCACGGAACT GGTGCTGCGC GGCATGGAGC GCACGGGCGC CGCCGCGGTC
CTGCAGTTCG GCGGAGGCCT CACGGCGCTC ACGGTCGCGG CGAACATCAA GTTCGTCGGC
GTCGCCGCGG TGGGTTGCCT GGCCGTCCAT CTTGCCCGTC GCCGCATCGG CGGGATGCCG
GCCGTGCTGG TCATGGCACT GGGCAGCACA GCGATCACCC TGGCGGTCTC GCACGGCACC
GGCCTCGGCT TCGGCTGGAT CGGCGCCATC CGACAGAGCA CCGCCGTGCA CAGCTGGCTG
GCACCGACGA ACATTGCCGG CTTCCTGGCC GGTGGGCTCG GCAGGCTCGC CGGCCGGGAC
ATCACCACCG TGGCCATCCA GGTCGCGGTA GTCATCGGCG TGGTGCTCGC TGTCATCATC
GTGGCGGCCC TGCTGAGGGC GATATCCCGA GGCGGAGTGG CCCCGGTACG CGGTCTCGGC
CTGATCTTTG CCGCCGTGGT CGCGTGCGGG CCGGTCGTCC ATCCCTGGTA TCTGCTGTGG
GCGGTGCTGC CGCTCGCCGC GACGGCTCGG TCCCACCGCA GCCGCTCGAT CCTCACCGCC
ATCAGCGCGA TCACCGCGAT GGCCCTGCCA CCGGTCGGGA CCGGCGCCCT ACCGCTGACC
GTCGGCTACC TCGGCGCCGT CCTTCTCCTC GGGAGCGCGG CCCTGCTGCT GCATCGTCGG
GGCCTTGACC TGCAGCTGAC ATCACCGATG ATCTTCCTGC GCCGTGCCCT GGTCACCGCT
ACGGTCCCGG GCGCACCGGC GCGCAGGCTC CTCGGCACCC TGGCGTCAGG CACCGAACGG
TCTGACTGCC CAGAGATCGG CATTCGCACG AGAGGTGCGG GCGACGGCCC ACCGGGCTCA
GCCGCCACGT CGATGTCCTC GAAGCAGCGG GGCAGGTTCG GGCCCGAGCG CTGA
 
Protein sequence
MRHRAALLDL HPAKPAPAIS AATPSAARRL PTRAVVATGC IGSATTLGAT AAVGLPLFAR 
VTLGLVGMML VVVAWLLLGR EHRQLTVVSL HTIAVVWTLP FLVAAPLFSG DVWSYLAQGA
IAASGQDPYT IGPLPGLGAA HPATQQVSHY WIETPAPYGP AWLALSRSVA ALTGEHLIAG
VLLYRLIAIV GVVLVAWALP HLARRAGADP STALWLGLLN PLVIWHFVAG VHNDALMMGL
LLCGTELVLR GMERTGAAAV LQFGGGLTAL TVAANIKFVG VAAVGCLAVH LARRRIGGMP
AVLVMALGST AITLAVSHGT GLGFGWIGAI RQSTAVHSWL APTNIAGFLA GGLGRLAGRD
ITTVAIQVAV VIGVVLAVII VAALLRAISR GGVAPVRGLG LIFAAVVACG PVVHPWYLLW
AVLPLAATAR SHRSRSILTA ISAITAMALP PVGTGALPLT VGYLGAVLLL GSAALLLHRR
GLDLQLTSPM IFLRRALVTA TVPGAPARRL LGTLASGTER SDCPEIGIRT RGAGDGPPGS
AATSMSSKQR GRFGPER