Gene Sare_2590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2590 
Symbol 
ID5707175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2951729 
End bp2953621 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content72% 
IMG OID641272052 
Producthypothetical protein 
Protein accessionYP_001537422 
Protein GI159038169 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.188065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.130609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG GTACGACCGC GGCCGACGAC CTGGTCCGTG CCGGGCTGGC CCGGCTGCTC 
GCCGAACGGG GCTGCGACGC GGACGTGGTC GCGGTGGGGG TGCGGGACGA ACTCAGCGCG
ACCCCCCGCC CGGAGCGCGG ACCGCGGGTG CTGCTCTACG GGCAGCTGGC GCTGATCGGA
CCGGTGCCGC CGGCCGCGGC CTGTTCGATC TGCCTCGCCC GTCGCTGGCA GGCGGTGCGC
CACCGCGACC TGCGCGACGC GCTGGAACTG GGCGGTGACA CGGTCGCGGC GGGCCCCTGG
CCGTACGCGA CCCCGCTCGT CGCGGACTTC CTGCATGCCC TGATCGCAGC CCGCCGTACC
GCCGCCACCG GGCCGGCAGG CACCGGGACG GTGACGCCCA GCACGGTGTT TCAGGTCGAC
CTGTCGACCC TGCGGGTGCA CCAGGTGCCG CTGCTGCCCG ACCCCGAGTG CCCGGCCTGC
GGCGACATGG TCACCGATGC ACCGGAACTG GCGGCGATCG AGCTGCCCGC CACACCGAAA
TCGGAGCCGG GCACGTTTCG CGGCCGGGAC CTGGACGACT ACCCGCTGAA CGTGGCGGCG
TTCGCCAACC CCGTGTGCGG GGCGCTCGGC GCCAGTCTGT GGCAGGACGT CACCTCGCTG
TCCACCTCCC CGGCGGTCGG CTCGTTCACC CTGCGATGTG GGCGCTTCCT GCGGGAGACG
CTCTACGGTG GGCACACCGA CGGCTACCGG AGCAGCGCCC GGATCGCGGT GCTGGAAGGA
CTGGAACGCG CCGCCGGCCT GCGCCCGCGG GGAAAACGCA CCGCGGTCAC CGCGACCCTG
CGGGAACTGG GCGACGAGGC CCTGGACCCA CGCGAGTGTG GGCTCTACAC CGACGCCTTC
TATCAGGCCG CTCCGTACCT GCACCGGTTC GACGTGGACC GGCCGATCAC GTGGGTGTGG
GGCTGGTCGC TGCGCGACCA GCGCCCGCTG CTGGTGCCGG AGGTCCTCGC CTACTACCAC
GCGGCGAGTG TCGAGGAGCG GTTCGTCCAG GAGACCTCGA ACGGCTGCGC CTCCGGCGGA
TCGATGGTGG AGGCGATCTA CCACGGCCTG ATGGAGGCGA TCGAGCGCGA CGCGTTCCTG
CTGGCCTGGT ACGGCGGCCG GTCCCTACCG GAGATCGACC CGGCCACCAT CGACCGACCA
CGGACCCGGA TGATGGTCGA CCGGCTGGCG ATGTACGGCT ACCGGGCCCG ATTCTTCGAC
ACCCGGATGA CCTTCGACAT CCCGGTGGTG ACCGCGGTGG CCGTCCGTGC GGACGGCGGT
CTCGGTACCC TCGCCTTCGG CGGTGGGGCG AGCCTCGACC CGCAGGCCGC GATCACCGCG
GCGCTCTGCG AAATCGCCAC CGACTCGGTG ATGGTCCGGG TCCGCGCCCG CGCCGACGAA
ACCCGGTTAC GTCAGATGAC GACCGACTTC TCCCGGGTGC AGAGCCTGCA CGACCACCCG
TTGCTCTACG GCCTGCCGGA GATGGCGCGG CACGCCGCGT TCCTGCTGGA ACACGGCAGG
GCGCCGGTCC CGATGGCGCA CCTGTACGAG CGGGACCGTC CCGCCCCACC GGTCACCACC
GACCTGCGCG ACGACCTCGA ACGCTGCCTG AAGCAGGTGA CCGCGCAGGG CTTCGACGTG
ATCGCCGTCG ACCAGACCAC CCCCGAGCAG CGCGAGCTGG GCCTGACGAC GGTGAGCGTG
GTGGTCCCGG GACTGCTGCC GATCGACTTC GGCTGGCTAC GCCAGCGCGC CCCGCACGCG
CGGCGGCTGC GGACCGCGTT CCGCACCGCC GGGCTCCTGC ACCGCGACCT GCGCGACGAC
GAAATCCACT CCGTTCCCCA CCCGTTCCCG TGA
 
Protein sequence
MNDGTTAADD LVRAGLARLL AERGCDADVV AVGVRDELSA TPRPERGPRV LLYGQLALIG 
PVPPAAACSI CLARRWQAVR HRDLRDALEL GGDTVAAGPW PYATPLVADF LHALIAARRT
AATGPAGTGT VTPSTVFQVD LSTLRVHQVP LLPDPECPAC GDMVTDAPEL AAIELPATPK
SEPGTFRGRD LDDYPLNVAA FANPVCGALG ASLWQDVTSL STSPAVGSFT LRCGRFLRET
LYGGHTDGYR SSARIAVLEG LERAAGLRPR GKRTAVTATL RELGDEALDP RECGLYTDAF
YQAAPYLHRF DVDRPITWVW GWSLRDQRPL LVPEVLAYYH AASVEERFVQ ETSNGCASGG
SMVEAIYHGL MEAIERDAFL LAWYGGRSLP EIDPATIDRP RTRMMVDRLA MYGYRARFFD
TRMTFDIPVV TAVAVRADGG LGTLAFGGGA SLDPQAAITA ALCEIATDSV MVRVRARADE
TRLRQMTTDF SRVQSLHDHP LLYGLPEMAR HAAFLLEHGR APVPMAHLYE RDRPAPPVTT
DLRDDLERCL KQVTAQGFDV IAVDQTTPEQ RELGLTTVSV VVPGLLPIDF GWLRQRAPHA
RRLRTAFRTA GLLHRDLRDD EIHSVPHPFP