Gene Sare_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0602 
Symbol 
ID5704372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp681963 
End bp684992 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content70% 
IMG OID641270128 
Productlantibiotic dehydratase domain-containing protein 
Protein accessionYP_001535521 
Protein GI159036268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGGT CGGTCGATGC CGCGACCATC CGCATTGCCG CTCACTCATG GCCAGACAGC 
CCAGCGCCTT GGCCGGGCCG GAGCGGTGCC TCCGGTATCG CGTACTGGTT GCAAGCGACG
TGGCGGAAGG TCGGCCTGGC CGAAGCTGTC TGGGCCGCCA GCCCCGAGTT CGCTGCTCGC
GTAGAAGCAG TCCTCGCCAG CGGAAGCGCT GCTCCGGACC GAGGGTGGCG GATGGCGCTG
GCGTTGGCGC GGTACTTGGT CCGTGCGCAG CGGCGTGCGA CGCCCTTCGG GTTGTTCTCG
GGGGTGGCGA CCCTGCGATT CGACACGGTG GCGGCGGTGG TGCCGGCAGG ACAGCCGGCG
ATCCGGGTAC GGCCGGACGC GAGCTGGCTC GCGTCCCTGA TCGCGCGCCT GGAAGCTGAT
CCCGACGTAC ACCGCCGTCT GCCCGTGCAG GCGAACAACC TGGCGCTGGT GAACGGGTCT
CGGATCGTCG GGGCGCGCCG GCCGCACGCC TCCGCACGTA CGGAGTCGAC CTCGGTCAGG
AGCACCGCAG CGGTACGGCT GGCGATGTCA CTCACCCCTA CCCCGTTGCC GTGGGCTGAG
CTGGTCGACG CGGTCGCTGC TGCGTTTCCC GGCTTCCCAC GCGCGTCCGC TGACGCGATG
GTTGCCGGCC TGGTGGACCA CGGCGTGTTG ATCTCCGCTC TCCGACCACC ATCGACGTGC
ACAGACCCGA TCAGGCATGT CCTGGACACC CTCGATGCGA TGTTCGACGG TGCTCCTGGG
GGACAGCGAG CCCTTCGGGC CGAACTGCGC TCGCTGCATG ATCGTCTCGG TGGTACCAGC
GCAGGAAGCA CGACGTACCT TTGTGGGCTC GCCGACGATA TGCGCAGAGT GGCACCGGCC
ACCCAGCCGC TCGCGGTGGA CCTTCAGCTC GCGGATCAGG TCGCCCTGCC GGAGCAGCTC
GCCGCCGAGA TCGCCGCTTC GGTGGAGGTG CTACGACGGC TGACGCCGGA TCCGGCGGGG
CGGCCCGAGT GGTGTGCGTA CCGTGCCCGG TTCGTCGACC GTTACGGGAT GACGGCTGTG
GTGCCGCTCG CTCGGCTGGT AGATCCGCTC GTCGGGTTGG GCTTCCCAGC ACACTTCGCC
ACTGTGGCCG ATGCGCCGGT GGCTCTGCTG TCCGACCGTG ACGAGCGTCT GCTCGCGATG
GCTCAGCAGG TGGTGATCGA TGGTGCTCGT GAGGTGGTTC TCGATGACGC GGCCGTCGAA
TGGCTTGCCG GCACTGGCCG GGTCACCGGG TCTGTCAGCC CTCACGTCGA CGTTTCGGCA
GAGGTCCGCG CGGTCTCGCT GGAGGCGCTG ACACAGGGAC GGTTCACTGT CGCGCTGACC
GGGATGGGCC GCTCGGCGAT GGCGACGAGT GGGCGCTTCC TCGACCGGCT GCCCTACGCC
GATCGGGAGT TGATGTGCCG GGAGTTCGCC CGCCTTCCGG TCGCGGTGGC CGGGGCCATG
CCGGCGCAGC TGAGCTTCCC GCCGCGCAAG CTACACAGCC AGAACGTGCT CAACTCCCGG
CAGGTACTGC CGTGGCTGGT CAGCCTGGCC GAGCACCGGC ACGTGGCCGA GAACGTGATT
GGCCTCGACG ACCTCGGTGT GACCGCCGGC TGCGATCGGC TGGTGCTGGT GTCGATGTCG
CGGCGGCGGG TGGTGGAGCC GACAGTGGGG CACGCGGCTG CCGTACACAC GATGCCGTTG
CTCGGCCGGT TTCTCCTGGA GTTGCCGCGA GCCACGGATG CCCGGCTGAA ACCGTTCGAC
TGGGGTGCGG CGTCCTGCTT GCCGTTTCGA CCCGCGCTGA GGTACGGCCG TGTCCTCCTG
GCGGCGGCCC GGTGGCGGGT CGATCCGGCG GCCCTGCCCG GTGCGGAAGC AGGCGATGAC
GAATGGTTGA CGGCCTGGGA AGCACTGCGG ACGAGGCTGC GGCTGCCGGC ATGGGTGCAG
GTCGGCAACG GCGACCAGCG GCTTCGGCTC CACCTCGATC AGTCCATGGA TCGTGCCCTG
CTGCGTGCCC ACCTGGACGC CAGCGCCGGA CCGGTGACCG TGGTGGACGC GGCGAGCCCG
GAGGACTTCG GTTGGCTCTC AGGCCGCGCC CACGAGATCG TCGTGCCCGT TGCCTCGACG
GTTGCGCCCG CTGCCGCGCC GGCTGCGGTT ACCGCGCGAG GCGCATGGCC GCCACCCGCT
CCCCCCGAGC CGATTATCCC GGGTGCCAGT GGCCTGCTGT CCGCTTCGGT GGCCGTCGAG
CCTTCGACGA TGGAGCTGGT CTTGATGCGG GGGCTGCCCG CGCTGTTCGC CGACTGGCCC
GAGCCGCCGC TGTGGTGGTT CGTGCGCCTG CGCCGCTCCA CTCCGCATCT TCGACTACGA
CTGCACACCG ATGACTACGG CGACGCGGCG GTACGGGTCG GGCGGTGGGT CGCCGGGCTC
CGACAGCAAC GCCTTGCTGG CGATTGGAGC CTGGACACCT ACCATCCCGA GTCCGGTCGC
TACGGCTGCG GCACCGCGCT GGCAGCGGCC GAGGAACTCT TCGCCGCCGA CTCCACCGCA
GCCCTGGCGC AGCTGGTAGC GCAGCCCGAG TCTGGGATCG ATCGGCAGGC GCTGACCGCC
CTGAGCATGG TCGATCTGGC CGCCTCGATG CTCGGCAGTC GCACCGACGG ATGCGAGTGG
CTGGTGGCGC GCCCCGAGCA GACCGGGCAG GCACCGATCC AGCGAGCCGT GCTGCGTCAA
GCGGTCACCC TCGACCCGGG CAAACTTCCC GAGCCCATCC AGCAGGCATG GCAGGAACGC
TCCAGGGCCG CAACCCGGTA CGCCGCCGAG CTGTCCGCGG TCGCCGGCCC GCTCACACCC
GCCTCGGTGC TCGCGTCGCT GATGCACCTG CACGTCGTTC GCGCTCTCGG CCCGGACGAA
GACGCCGAGC AGCTCACGTA TCGGTTGGCC CGTCATGTCG CGCTGGCGGC GGTGCGCCGC
CGGGTTCGCG CCGTAGGAGC ATCCCGATGA
 
Protein sequence
MFRSVDAATI RIAAHSWPDS PAPWPGRSGA SGIAYWLQAT WRKVGLAEAV WAASPEFAAR 
VEAVLASGSA APDRGWRMAL ALARYLVRAQ RRATPFGLFS GVATLRFDTV AAVVPAGQPA
IRVRPDASWL ASLIARLEAD PDVHRRLPVQ ANNLALVNGS RIVGARRPHA SARTESTSVR
STAAVRLAMS LTPTPLPWAE LVDAVAAAFP GFPRASADAM VAGLVDHGVL ISALRPPSTC
TDPIRHVLDT LDAMFDGAPG GQRALRAELR SLHDRLGGTS AGSTTYLCGL ADDMRRVAPA
TQPLAVDLQL ADQVALPEQL AAEIAASVEV LRRLTPDPAG RPEWCAYRAR FVDRYGMTAV
VPLARLVDPL VGLGFPAHFA TVADAPVALL SDRDERLLAM AQQVVIDGAR EVVLDDAAVE
WLAGTGRVTG SVSPHVDVSA EVRAVSLEAL TQGRFTVALT GMGRSAMATS GRFLDRLPYA
DRELMCREFA RLPVAVAGAM PAQLSFPPRK LHSQNVLNSR QVLPWLVSLA EHRHVAENVI
GLDDLGVTAG CDRLVLVSMS RRRVVEPTVG HAAAVHTMPL LGRFLLELPR ATDARLKPFD
WGAASCLPFR PALRYGRVLL AAARWRVDPA ALPGAEAGDD EWLTAWEALR TRLRLPAWVQ
VGNGDQRLRL HLDQSMDRAL LRAHLDASAG PVTVVDAASP EDFGWLSGRA HEIVVPVAST
VAPAAAPAAV TARGAWPPPA PPEPIIPGAS GLLSASVAVE PSTMELVLMR GLPALFADWP
EPPLWWFVRL RRSTPHLRLR LHTDDYGDAA VRVGRWVAGL RQQRLAGDWS LDTYHPESGR
YGCGTALAAA EELFAADSTA ALAQLVAQPE SGIDRQALTA LSMVDLAASM LGSRTDGCEW
LVARPEQTGQ APIQRAVLRQ AVTLDPGKLP EPIQQAWQER SRAATRYAAE LSAVAGPLTP
ASVLASLMHL HVVRALGPDE DAEQLTYRLA RHVALAAVRR RVRAVGASR