Gene Sare_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3033 
Symbol 
ID5707350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3440520 
End bp3442790 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content64% 
IMG OID641272478 
Producthypothetical protein 
Protein accessionYP_001537846 
Protein GI159038593 
COG category[S] Function unknown 
COG ID[COG3472] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGCCCA CACTCTACCG CGACACTGGG TACTCACTGA ACCACCTGAT CGAGGACATC 
AAGGTCGGAC GCATTGGCCT GCCCGACATT CAGCGTCCCT TCGTCTGGTC GGCCACCAAG
ACCCGAGACC TGTTCGACTC GATGTACCGC GGCTACCCCA TCGGCACCCT GATGTTCTGG
GAAACCGGTG CGGAGGCCGG TACCCGCCAA ATCGGCGTGG AGTCCTCCGG CCGGGCACCA
CAACTCCTCA TCGTTGACGG GCAGCAGCGA CTGACCTCGC TGTTTGCCGT TCTCACCGGC
CGGCCGGTGC TGACCAAGTC GTTCGAACAG CGCCGCATCC GGATCGCCTT CCATCCGCAA
GACCAGACCT TCGAGGTGAC GGACGCCGCG ATCGAGCGCG ACGCACACTT CATCCCCGAC
ATCACCACGC TTTGGTCGAG CGGCTACAAG ACCGCCGTGC GCCAGTTCTT CGATCGCCTG
GACCAGGCCA ACGAGAAGCT GTCGGACGCC CACAAGGACG ACCTTGAGGA ACGCATTGAC
CGTGTGCGCG ACCTGCGCGA GTTCCGGTTC CAGGTCGTTG AGCTGGGCGC CTCGGCGAAC
GAGGAGCAGG TCGCCGAGGT TTTCGTCAGG ATCAACTCCG AGGGCGTCAA GCTCAACCAA
TCGGACTTCA TCCTCACCCT CATGTCTGTG CACTGGGAGC AGGGACGCCG GCAGCTCGAA
GACTTCAGCC GCTCCGCCGT CGACCCGGCA GTCACCGGGC CCAGCCCTCG CAATCCGTTC
CTCGACCCGA GTCCAGACCA ACTGTTGCGG GTAGCCGTCG CGGTTGCCTT CCGCCGCGCC
CGCCTGCAGC ACGTCTACAG CATCCTGCGT GGCAAAGACC TTGAGACCGG ACGGGTCAGC
GCAGAGCGCC GCCAGCAGCA GTTCGAACGG CTCGCCGCAG CGCAGGAGAG AGCGCTCGAC
CTGACCAACT GGCACGAGTT TCTCAAGTGC CTGACCACCG CTGGATTCCG CAACCGCAAG
ATGATCACGT CGGACAACGC CCTGCTGTTC AGCTACTCAC TCTGGCTGAT CGGCCGACAC
GACTTCGGCC TCGACGTGTC AACCCTGCGC CCGATGATCG CACGATGGTT CTTCATGGCG
CACACCACCG GCCGCTACAC CAGTTCGCCG GAGTCCCAGC TCGAATCCGA CCTGGGACGG
ATCACCGGCC TCGCGACCGG CGACGGAACC GCCTTTGTCA CGGAACTGGA CCGCATCGTT
GCCGCGAACT TCACCGGAGA CTACTGGGAC ATCTCACTGC CGAACCGGCT TGACACCTCG
TCGTCGCGAT CGCCGGTACT CTTCGCCTAC CTCGCCGCGC TGAACATCCT CGACGCCGAA
GTGCTCTTCA GTAACCTGCG TGTCAAAGAT CTACTAGATC CCTCAGGTGC AGGGCCGAAT
TCCGTCGGAC GTGACCGCCT GTTCCATCGC AAGCACCTCG AGTCTATCGG TTTCTCCGGG
ACGCGACAGC TCAACGCCAT CGCGAACATG GCGTACGTCG AATGGCCGCC AGCTGAGCGG
AACAACGCTG ACGCTCCGCG TGACTACCTA CCCCGCATCG CCGAGGCGAT TGATCCGGAG
ACGCTGACCC GCCAGTCCCG CTGGCACGCG CTTCCGGTGG GTTGGGAGCA ACTGGACTAT
CCAACCTTCC TCGAGCGTCG CCGCCAGCTC ATCGCCCGAG TCGTCCGGGA TGCCTTCAAC
ACGCTCGCCG GCGAGCGCAA GAAGTACATA GCGACGACCG CCGAGGATCT CATCGCGGCG
GGCGAAACCC AGACGACGGA GTTCAAGTCC AGCGGCCGAT GGAACCCCCA CACCGGCCAG
TACGACCCGA AGCTCGGGCA GATCCTCATC AAGACGGTGT GCGGATTCCT CAACGCCGAG
GGCGGTGTCC TACTCATTGG CGTCGATGAC GACGGCCAGG TTCTCGGCAT CAAGGGCGAC
CTCACCACTC TCGGCACGAA GCCGAATGCC GACGGTTACG AACTGTTTCT CCGACAACTG
CTCGACGACA GCCTCTCCGC TTCGACCGCC CCGACCGTCC GCATCCGCTT CCCCCAGATC
GCGGGACAGA CGATCTGCCA GGTCACTGTC GCCGCCTCCG GCCGACCCAT CTTCGCCAAA
CCGGCCAAGG GCGGCCCCGG CGCGTCGGAC TTCTGGGTAC GTGTCGGCAA CGCGACCAAA
CAACTCCACG GCGATGACCT CATCAAGTAC CAAGAGGAAC ACTGGGGATG A
 
Protein sequence
MPPTLYRDTG YSLNHLIEDI KVGRIGLPDI QRPFVWSATK TRDLFDSMYR GYPIGTLMFW 
ETGAEAGTRQ IGVESSGRAP QLLIVDGQQR LTSLFAVLTG RPVLTKSFEQ RRIRIAFHPQ
DQTFEVTDAA IERDAHFIPD ITTLWSSGYK TAVRQFFDRL DQANEKLSDA HKDDLEERID
RVRDLREFRF QVVELGASAN EEQVAEVFVR INSEGVKLNQ SDFILTLMSV HWEQGRRQLE
DFSRSAVDPA VTGPSPRNPF LDPSPDQLLR VAVAVAFRRA RLQHVYSILR GKDLETGRVS
AERRQQQFER LAAAQERALD LTNWHEFLKC LTTAGFRNRK MITSDNALLF SYSLWLIGRH
DFGLDVSTLR PMIARWFFMA HTTGRYTSSP ESQLESDLGR ITGLATGDGT AFVTELDRIV
AANFTGDYWD ISLPNRLDTS SSRSPVLFAY LAALNILDAE VLFSNLRVKD LLDPSGAGPN
SVGRDRLFHR KHLESIGFSG TRQLNAIANM AYVEWPPAER NNADAPRDYL PRIAEAIDPE
TLTRQSRWHA LPVGWEQLDY PTFLERRRQL IARVVRDAFN TLAGERKKYI ATTAEDLIAA
GETQTTEFKS SGRWNPHTGQ YDPKLGQILI KTVCGFLNAE GGVLLIGVDD DGQVLGIKGD
LTTLGTKPNA DGYELFLRQL LDDSLSASTA PTVRIRFPQI AGQTICQVTV AASGRPIFAK
PAKGGPGASD FWVRVGNATK QLHGDDLIKY QEEHWG