Gene Sare_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2583 
Symbol 
ID5707168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2941478 
End bp2943301 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content72% 
IMG OID641272045 
Producthypothetical protein 
Protein accessionYP_001537415 
Protein GI159038162 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.11419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGCCC TGGGCGACAC CGACGCGCTC GCGGCGCCCG CCGCGGCCGA CCCCGACCGC 
CGCGACGCCA CCGTGCACCT CACCGCCGCC ACCGTGCTGA TCGGACCGTC CAGCAGCACG
GCCGGCCCCG CCTGCGGACA CTGCCTGGCC ATCCGATGGC AGCGGCTGCG CACCCGCAGC
CAACGCGACG CCCTGGAAGT CGGCGACCAG ACCATTGCCG TCGGCCCGTG GCCACAGCTG
ACCGCCTACC AGCTCGACGC CGTCTGGGAG CTCTACCGCG CCAGCCACAC CGGCCCACCA
CTGCCGCCGC CACCAAGCTG GGACCGGCAC AGCACCCCGC TGCCCCGGGT CAGCCAGCTC
GACCTGGCCA GCCTGCGGAT CCGCACCTAC CCGGTGCTGG CCGACCCGCG CTGCCCCAGT
TGCGCCCGGC AACGACCTGA CACCCCCGAC CCTCTGGTGC TCGCGCACCA ACCCCGGCCC
AAGCCGCGCC CCGACACGTA CCGGCTGAGC ACTCCCGACG GCTATCCGCT GCCAGCGACC
GCGCTGGTCA ACCCGGTCTG CGGCGCGCTG GGTGCGGGTA CCGCGCTCAC CATCACGTCA
CCGACCACAG CACCCGTGAC CGGCAGCGTC TTCATCCGCG GCTACGGCGG GCTGCTCGAC
GTCTCCTGGA GCGGCCAGTC CAGCGGCTAC GACGCGAGCC GCTCGCTGGC CTACCTCGAA
GGGCTGGAGC GCTACGCCGG AACCCACCGG CGGCGCAACA CCATCCCGGT CGTCGCCGCA
TACGCCGATC TCGACACCGA CGCACTGCAC CCGGACCGCT GCGGCAGCCA CCCCGACGAG
GTGTACGACA CCGACCCGAT CCTGCGCCGG TTCGACCCAC AACGACCGAT CCCCTGGGTG
TGGGGCCAGA ACCTGCACAC CGGCAAGCCC GTACTGGTGC CTCGCCGACT GTGCTTCTAC
AGCTCGCCCG CCGCCGGCGA CACCTTCGTC CTCTCCTCCT CCAGCGGCTG CGCCACCGGC
AGCTGCCTGG AGGAAGCCGC CCTGTTCGGC ATGCTGGAGC TGATCGAACG CGACGCGTTC
CTGCTCGCCT GGTACGGCAA CCTGACCCTG CCCCGGATAG ACCTCGACAC CTGTCCGCCG
GTCGTGCGCG CGCTGGTCGA TCGCGCCGAA CTGCAGGGCT ACCGTCTCTA CGCCTTCGAC
AACCGGATCG ACCTCGACGT ACCCGTGGTC ACCAGTCTCG CCGTCCGCCA CGACGGCGGT
CCCGGCCTGC TGTCGTTCGC CGCCGCGGCC CACCTCGACC CGCGGCAGGC GGTCACCGGC
GCGCTCGCCG AGGCGCTCAC CTACATCCCA CACCAGCCCG CCACGGTGCG CCGACGTCGG
GCCGAACTGG AGCGGATGGC CGACGACTAC ACCCTCGTCC GCCGGCTGCC GGACCACTCG
GCACTGTTCG GCCTACCCCG AATGGCGGTG CACGCCGAAA GCTACCTCGA CGACCGAGGC
ACGCTCCCCA TCGAGCACGC GTTCACCGGC TACCGGCCCC CCGGCACGCC GGATCTCCGC
GATGACCTGC GCCGGGTGCT CGACCTGCTG GATGCGCGCG GGCTCGAGGC GATCATGGTG
GACCAGACCA CACCAGAACA GGAGGCGGTC GGACTCCGCT CGGTCTGCAC GATCGTGCCC
GGCCTGCTAC CGATCGACTT CGGCTGGATC CGACAACGGG CTCCGCACCT GCCGCGGCTG
CGGACCGCGC CCGTGGTGGC CGGCCTCGCC GACACCGAAC TTACCGACGC CGACTTCCGC
CTCGTTCCGC ACCCCTTCCC ATGA
 
Protein sequence
MVALGDTDAL AAPAAADPDR RDATVHLTAA TVLIGPSSST AGPACGHCLA IRWQRLRTRS 
QRDALEVGDQ TIAVGPWPQL TAYQLDAVWE LYRASHTGPP LPPPPSWDRH STPLPRVSQL
DLASLRIRTY PVLADPRCPS CARQRPDTPD PLVLAHQPRP KPRPDTYRLS TPDGYPLPAT
ALVNPVCGAL GAGTALTITS PTTAPVTGSV FIRGYGGLLD VSWSGQSSGY DASRSLAYLE
GLERYAGTHR RRNTIPVVAA YADLDTDALH PDRCGSHPDE VYDTDPILRR FDPQRPIPWV
WGQNLHTGKP VLVPRRLCFY SSPAAGDTFV LSSSSGCATG SCLEEAALFG MLELIERDAF
LLAWYGNLTL PRIDLDTCPP VVRALVDRAE LQGYRLYAFD NRIDLDVPVV TSLAVRHDGG
PGLLSFAAAA HLDPRQAVTG ALAEALTYIP HQPATVRRRR AELERMADDY TLVRRLPDHS
ALFGLPRMAV HAESYLDDRG TLPIEHAFTG YRPPGTPDLR DDLRRVLDLL DARGLEAIMV
DQTTPEQEAV GLRSVCTIVP GLLPIDFGWI RQRAPHLPRL RTAPVVAGLA DTELTDADFR
LVPHPFP