Gene Sare_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2589 
Symbol 
ID5707174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2950106 
End bp2951719 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content72% 
IMG OID641272051 
Producthypothetical protein 
Protein accessionYP_001537421 
Protein GI159038168 
COG category 
COG ID 
TIGRFAM ID[TIGR03605] SagB-type dehydrogenase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0423242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.130609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCG AGAACCCCGG CCTCGCCCAC GCGTACGCGA CCGCGATCCT GCGGCGCGGC 
CGTGAACCGA TGCCCCCGGC AGACTTCACG CCCAACTGGG CCGACGCGCC CCGCCGCGGC
AAGTACTACC CGCGGGCGAG CGCCTTCGGG CTGCCCGCCC CGCCGAGCGC CGGCGTCGAC
CTGGACGCCG CGCTGCACGG CCCCGGCGAC GCCGACGAGC CGTTCACGCT GCCGCTGCTG
GCCGGCCTGC TCCATCACTC GTACGGGCTG CTCGGACGGC GGCTCGGCAT CCAGGCCAAC
TCCGATCTGG GAGTGCTGCC GTCGTACGCG TACGCCAACT GGCACCGCGG CGCCGCCTCC
GGTGGAGGTC TCTACCCGTG CAGCGTCTAC TGGGTCGCCG GGCCCGGCGC CGGGGTCACC
CCTGGGGTCT ACTACTACGC CCACGCCCGG CACGCGATGC AGCGGCTGCT CGGCGGCGAC
GTCAGCGCTC GGGTGAACGC GGCGGTCGCC CCACCCCACC CCGCCAGCCA GTTTCTGATC
GTCGGGGTGA AGTACTGGCA GAACGCCTTC AAGTACAACA ACTTCTCGTA CCACGTGGTG
TCGATGGACC TGGGTACGCT GCTGCAGTCA TGGCGGCTCT GGGCCGGCGC CCAGGGCCGA
CAGATCCGGC CCGTGCTCTG GTTCGATCAG GCGGCCGTCG CCGACCTGCT CGGGCTCGCC
CCCGACGATG AGACACTCTT CGCCGCCGTA CCCCTGACCT GGGCCGCGCC CGCCGCCCCC
GCGGCCCTGG CGCCCACGTC CGCCCCCACC CGTCGAAGGG AGCCGGCGAC GGTCCGGGTA
CGACACCGCG ACCAGGAGCG GTCCCAGACG CTGCTCACCT TCGACACGCT GCGGCAGGTC
AGCCGCAGCA CCGCCGCGTC GGTCGACCGG CCCGCTACCG GGGCGCTGGC ACCCGCCGCC
GCCCACCCGA CGCCGCCGGG CGGAACCCGC CTACCGCTAC CCGCCGCCGC GCCGCTGACG
CTGTCCGTCG AGGCCGCCCT CACCGCCCGC CGCAGCAGCT TCGGCCGGTT CCTGCACAAT
CGTCCGATGG CGGCGGAGCA GCTCGCCGCG CTGCTGCGGG CCACCACCGC CAGCACCGTG
CCATCCGAGA TCGACGGACC CGCAGACCGT CCGCTGACCC GGATCTACGC CTTCGTCAAC
GCCGTGGCCA ACGTGCCCGC CGGGGGATAC GTGTACGACC CGCAGGAGCA CAGCCTGGTC
GCCGTCACCA GTGGCCCACC AGGGGCATTC CTGCAACGCA ACTACACCCT GGCCAACTAC
AACCTGGAGC AGGCAGCGGT CGTGCTCGTC CTGACGGTGC GCACACACGC GGTGCTGGAC
GCCACCGGCG ACCGAGGCTA CAACCTCGTC AACGCCACCA TCGGCGCGAT GGCCCAGACC
TTCTACACTG TCGCCGCCGC ACTGCACCTC GGCGCGGGGG TCGCACTGGG CTTCGACGGC
ATCTCCTACG TCGAGGAGCT GGGACTCGCC GACAGCGACG AGTTCCCGCT GCTCATCATG
CTCGCCGGCG AGGAACGCGG ACAGCTCGGC GACTACCGAT ACGAGCTGCG GTGA
 
Protein sequence
MTSENPGLAH AYATAILRRG REPMPPADFT PNWADAPRRG KYYPRASAFG LPAPPSAGVD 
LDAALHGPGD ADEPFTLPLL AGLLHHSYGL LGRRLGIQAN SDLGVLPSYA YANWHRGAAS
GGGLYPCSVY WVAGPGAGVT PGVYYYAHAR HAMQRLLGGD VSARVNAAVA PPHPASQFLI
VGVKYWQNAF KYNNFSYHVV SMDLGTLLQS WRLWAGAQGR QIRPVLWFDQ AAVADLLGLA
PDDETLFAAV PLTWAAPAAP AALAPTSAPT RRREPATVRV RHRDQERSQT LLTFDTLRQV
SRSTAASVDR PATGALAPAA AHPTPPGGTR LPLPAAAPLT LSVEAALTAR RSSFGRFLHN
RPMAAEQLAA LLRATTASTV PSEIDGPADR PLTRIYAFVN AVANVPAGGY VYDPQEHSLV
AVTSGPPGAF LQRNYTLANY NLEQAAVVLV LTVRTHAVLD ATGDRGYNLV NATIGAMAQT
FYTVAAALHL GAGVALGFDG ISYVEELGLA DSDEFPLLIM LAGEERGQLG DYRYELR