Gene Sare_4735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4735 
Symbol 
ID5704560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5356011 
End bp5359355 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content72% 
IMG OID641274133 
Producthypothetical protein 
Protein accessionYP_001539479 
Protein GI159040226 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000107106 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTCTCC CCCACCCGCC GGGAACCCCG GCCGGGCAGC CGGCCGCATC GGAAGGCAAC 
GGAACGCAGC CTGGCAGCTA CGCCAACTCC ACGGGCCTCG ACGAGTCCCT GCTCGATCCG
GCCCTGGCCA CCTCCCCGGG CCACGGCGGC GTCGGCGTCT TCCAGGCCCC CCGCCCACTG
CAACAGCGAG CACCCGCAGA GCCGACACCG GGCGCCGGCA CCGGCACCGG CGCCGACATC
GACTCCCCCT TCCTCGATCT GTTCGGCGCG ACCCGCCCAG CACGCCCCAC ACCGACCCGG
CCACCAGCGC GCGAGCAGCA ACCGACCATT CCGCAGCAGC CAGGTGCCGG ACACCAGCCG
GCCACGGCGA CCGAGGTGGA CCGGTCCCCC GCCGAACCGC CCCCGTCGGG TGGCCGCTGG
TTGACCGGTG AACAGCCGGT GGTCGACCCG GCCCGATCCC GGTCCGGATG GGTGGGCGAG
CGCACCGCCG TCCCCCTACA GACACGCGAG ACCCCGCGGA CCGGCCCACC ACCCCGGACC
CGTACCACCG CTGATGAGCA GGCGAACGGG ACAGTCGGAC AGCCCACCCG GGTACCGCAT
CAGGCCGGGA ACAGGGCTGG CCGCCCCGAA GCCGAGGCAC CGGCCCCACC CCCGGCCGAC
ACCGAGCCGC GCCACCACAC CGGCGGGCGA CCCGAGCCGG GCACCCGAGA CCTGCCCCCT
TCCGGTCGAC GCTCCGCCGG CAGCCGGGAC CGCGCCACGA GACCGGCACA AGTGAAGCCT
CCCAAGATCA GGTTTGGTGA CCGGGATCCT TCGGTGGAAC TGGCCATCAC CGAAATCGCC
GGCCACCTGA CCTTCACCCC CAACACCGTG ACCGCCTGGT ACTGGCTACC CGAGGTGCGC
TGGGCCTTCC GACCCGACGC CGAGCGGGAG GCACTACTCG CGGCCATCTC CGAGCAGTAC
GCCGGGCTCG CCGGATTCCG CCTGCACCTG CGGCGCACCA CCCGCCCCTT CCCGGCCGAC
GAGTGGGCTC GCACCATCGA CGCCCACACC CGGGCGCCGC TACCCGACGT CCCGGACACG
CCGGGTTGGG CAGATCACCT GGTCGCCGCG CAACGGCACC TGCTGGCGGT CAATCACGCC
GAGGGGCAGA CCTACCTGGG GGTGACCTTC GCCCGCCGTT CGTTGGGTGA CTCGCTGGTG
GAACGGATGC TGCGCACCTT CGGTCGCGGC GTCGCCGAAG GCGAACGGCG GAAACTCGGC
CGCACCGTCG AGCAGTTCGA CGAGGTGCTG GGTGCCTTCG GCATGCGCGG GCGCCGGGTA
ACGACGCCGG AGCTGGAGTG GCTCCTCTAC CGTTCGGTGG CGCTCTGCAT GGCCCCACCC
GGCCGACTCT CCCCGGTCAC CGACGGGCAC TGGGAACGTG GTGACCTGCT CGCCCTGACC
GAACAGGTGG AACGCTACCG CACCCCGTAC GGATCCACGG TCAAGCTGGT CAACCGGATG
ACCGGCGAGG AAGGGCATGT CGCCGTGCTC GCCGTGGGCC GGATGGAGCC ACTGGAGATC
CCCGAACGAC ACGAGCCCTG GCTGCACTTC CACGAGCGGC TGCCCTGGCC GATGGAGTTG
TCGAGCCGGG TCGACATCCT CGGCTCCGGC GATTCCTTCC GCAACCTGGA GCACCGGCTA
CGGATGATCC GATCCCAACA GCTCGACTAC GCCGAGCACG GGATCGACGC CCCACCCGAG
TTGGAGCGCC TCGCGGAGCG GGCACTGGTG ATCGGCGACG AGATGACCAC CGGACTGCCG
GTCGAATCCG CCCGCGCCCA CGGCTGGCAT CGACTCGCGG TGGGCGGCCG GACCCGGGAG
GAGTGCCTGG AACGCGCCCG CCGGCTGATC CAGCTCTACT CCCGGGAGCT GCGCATCTCG
CTGCAGCACC CGAAGAACCA GGACTGGCTG GCCCGCGAGT TCATCCCCGG TGAGCCGATC
GCCAACACCG GCTACGTCCG CCGGATGCCG GTCAACCTCT TCGCCGCCGC GCTGCCACAG
GCCGCCTCGA CCGTGGGCGA CCGTCGGGGT GACCTGATCG GACGAACCGC CGGCACCTGT
CGCCGCCCGG TCTTCCTCGA CCTTCACTTC CCGATGGAGG TGCGGGAACG CTCGGGTCTC
GCGGTCTTCG TGGCGGAGCC CGGCGGCGGT AAGTCCACCC TGCTCGGTGC CCTCGGTTAC
CTCGCCGCCC GCCGGGGGGT ACAGGTAACG CTGCTCGACC CGTCCGGCCC CCTCGCCCGA
CTCTGCGCGA TGCCAGAGCT TGCCCCGTAC GCGCGGGTGC TGAACCTGAC CGGCTCCGAA
CCCGGCACCC TCGCCCCGTA CTCGCTGATC CCCACCCCGC TCCGCAGCGA GTTCAGCACC
GGCGAGGCCG GCGACCGGGA ATTCGAGATC GCCACCTCCA ACGCCCGAGC CGAGCGCCGG
ATGTTGGTCC AGGACATCTG CATGATGCTG GTGCCGCCGC AGGTGGCGCG GGAGGCGTCC
ACGGCCACGC TGTTCCGGCA CGCCGTACGC CAGGTGCCCG CGGAGGAGAC GGCCACCCTG
GACGACGTGG TCACCACCCT CGGCAAGCTC GACGACGACG CGGGCCGGGA ACTGGCCAAC
CTCCTGCTGG ACACCGCCGA GATGCCACTG GCCATGCTCT TCTTCGGTCG CCCCCCGGAA
GGCCTGCTCG GGCCGGACGC CACCCTCACC GTGATCACGA TGGCCGGGCT CCGCCTGCCC
GACCTCAAGA TCGAACGCGA GTACTGGTCG GCTGAGGAGG CCCTGGCCCT ACCCATGCTG
CACACCGCCC ACCGGCTCGC CGTGCGCCGC TGCTACGGCG GGTCGATGTC GTCCCGCAAG
CTGGTCGGGC TGGACGAGGC GCACTTCATG GAGGGCTGGC GTTCCGGGAG GTCGTTCCTG
GTCCGGCTGG CCCGGGACTC CCGCAAGTGG AACCTGGCCG CACTGGTCGC CTCGCAGAAC
CCAAGGGACA TCCTCGGCCT CGACGTGCAG AACCTGGTCT CCACCGTCTT CGTCGGGCGA
ATCGCCGAGG ACGCGGAGAT CGCCTCCGAG GCACTGCGCC TGCTGCGCGT CCCGGTCGAC
GACGGGTACG AGGCCACCCT CGCCTCCCTC TCCGCCGCCG ACAGCACCTC GGCCGCCCGC
CTCGGCTACC GCGAGTTCGT GATGCGCGAC GTCGACGGAC GGGTGCAGAA GGTCCGGGTC
GACGTCTCGT ACGTCGACGG GCTGCTCGAT CACCTCGACA CCACGCCGGC GGCGATGGCT
GCCGCAGCCG GGGTGCTGCC CGTCGTACCT GATCTGGAGG CGTGA
 
Protein sequence
MSLPHPPGTP AGQPAASEGN GTQPGSYANS TGLDESLLDP ALATSPGHGG VGVFQAPRPL 
QQRAPAEPTP GAGTGTGADI DSPFLDLFGA TRPARPTPTR PPAREQQPTI PQQPGAGHQP
ATATEVDRSP AEPPPSGGRW LTGEQPVVDP ARSRSGWVGE RTAVPLQTRE TPRTGPPPRT
RTTADEQANG TVGQPTRVPH QAGNRAGRPE AEAPAPPPAD TEPRHHTGGR PEPGTRDLPP
SGRRSAGSRD RATRPAQVKP PKIRFGDRDP SVELAITEIA GHLTFTPNTV TAWYWLPEVR
WAFRPDAERE ALLAAISEQY AGLAGFRLHL RRTTRPFPAD EWARTIDAHT RAPLPDVPDT
PGWADHLVAA QRHLLAVNHA EGQTYLGVTF ARRSLGDSLV ERMLRTFGRG VAEGERRKLG
RTVEQFDEVL GAFGMRGRRV TTPELEWLLY RSVALCMAPP GRLSPVTDGH WERGDLLALT
EQVERYRTPY GSTVKLVNRM TGEEGHVAVL AVGRMEPLEI PERHEPWLHF HERLPWPMEL
SSRVDILGSG DSFRNLEHRL RMIRSQQLDY AEHGIDAPPE LERLAERALV IGDEMTTGLP
VESARAHGWH RLAVGGRTRE ECLERARRLI QLYSRELRIS LQHPKNQDWL AREFIPGEPI
ANTGYVRRMP VNLFAAALPQ AASTVGDRRG DLIGRTAGTC RRPVFLDLHF PMEVRERSGL
AVFVAEPGGG KSTLLGALGY LAARRGVQVT LLDPSGPLAR LCAMPELAPY ARVLNLTGSE
PGTLAPYSLI PTPLRSEFST GEAGDREFEI ATSNARAERR MLVQDICMML VPPQVAREAS
TATLFRHAVR QVPAEETATL DDVVTTLGKL DDDAGRELAN LLLDTAEMPL AMLFFGRPPE
GLLGPDATLT VITMAGLRLP DLKIEREYWS AEEALALPML HTAHRLAVRR CYGGSMSSRK
LVGLDEAHFM EGWRSGRSFL VRLARDSRKW NLAALVASQN PRDILGLDVQ NLVSTVFVGR
IAEDAEIASE ALRLLRVPVD DGYEATLASL SAADSTSAAR LGYREFVMRD VDGRVQKVRV
DVSYVDGLLD HLDTTPAAMA AAAGVLPVVP DLEA