Gene Sare_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2501 
Symbol 
ID5703951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2858664 
End bp2859779 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID641271965 
Producthypothetical protein 
Protein accessionYP_001537335 
Protein GI159038082 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.942803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0300138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA ATCGCCGTAC CCTGCTCGGT CGGGTCGCCG TGGTCGGCAC GGGTATTGCG 
GCCGGTGGGC TGCTTGCTCC CGACGCGGCC CGAGCCGCGT TCTGGAAGAA GCGCCTCACC
GGCGCCGACC TGGACACCAA CCGCCGGTGG CAGATCGCCG GGACCGACCT CGGCATCCCC
TACGTACTGG AGAACGGCTC CATCGGGTAC CTCCTCGGCG ACACCTTCAA CACCCCGTGG
CCTGAGGGCC CGCCGCTGCC CAACGACTGG CGCTCACCGG TGATGCTGCG CTCCCACGCC
CACCCTGGCG CCGCCGACGG TGTGGTCTTC GACAACGCCG CCGGGGTGCT CGGCGACGGG
CGGGCGCCGG AGCTGATGCA CAACGGCCAC CGGGGCATCG GCATCGACGG CCTCTGGGAG
GTGACCGTCA TCCCCAACGA CGGCATCAGC TTCCCGGAGA CCGGCCGGCA GGTGATCTCG
TACATGAGCA TCGAGTACTG GACACCGCCC GGGCAACCCG GAGCCCGCTG GCGGTCGCGC
TACGCCGGGC TGGCCTTCAG CGACAACGGT AACGACTTCA CTCGCACGTC GCTGACGTGG
TGGAACGACA GCACCAACAC CGACCCGTTC CAGATGTGGA CGATGCAGCG TGACGGCGAC
TGGGTGTACG TCTTCTCGGT GCGCCCGGGG CGCCAGGACG GTCCGATGAT GCTGCGTCGG
GTCTTCTGGG ATCGGATGTT CTATCCCGAG TCGTACGAGG GCTGGGGCTG GAACGGCAGC
ACCTGGGGCT GGGGCCGGCC GTGCACGCCG ATCCTGACCG GCTCGTTCGG GGAGCCCTCG
GTCCGGCGGC TCGCGGACGG CACCTGGGTG ATGTCCTACC TCAACTGCGT CACCGGGTGC
GTCGTCACCC GCACCGCCGG CGGGCCGGAC CAGGCCTGGA CGGCGGAAAA GGTGCAGATC
ACGCCGTGGC AGGAGCCGGG GCTCTACGGC GGGTTCATCC ACCCGTGGTC CAGCCGGCAG
GTCAACGACC TGCATCTGAT GGTCTCGACG TGGACCACGA CACCCGATAA CCGAAGCACC
GCCTACCACG TCAGCCAGTT CGTCGGCACT GCCTGA
 
Protein sequence
MAINRRTLLG RVAVVGTGIA AGGLLAPDAA RAAFWKKRLT GADLDTNRRW QIAGTDLGIP 
YVLENGSIGY LLGDTFNTPW PEGPPLPNDW RSPVMLRSHA HPGAADGVVF DNAAGVLGDG
RAPELMHNGH RGIGIDGLWE VTVIPNDGIS FPETGRQVIS YMSIEYWTPP GQPGARWRSR
YAGLAFSDNG NDFTRTSLTW WNDSTNTDPF QMWTMQRDGD WVYVFSVRPG RQDGPMMLRR
VFWDRMFYPE SYEGWGWNGS TWGWGRPCTP ILTGSFGEPS VRRLADGTWV MSYLNCVTGC
VVTRTAGGPD QAWTAEKVQI TPWQEPGLYG GFIHPWSSRQ VNDLHLMVST WTTTPDNRST
AYHVSQFVGT A