Gene Sare_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3147 
Symbol 
ID5706205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3581382 
End bp3582950 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID641272579 
ProductRicin B lectin 
Protein accessionYP_001537946 
Protein GI159038693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.189477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA CGCTGAACGC GCTGGTGGGC TTCGTCCTCG GCATATGTCT CGTCACCGCG 
CCCGCCCCCG CCGTCGGCGC CGCTACCGCT GGTGTATCCG GCGCCTCGGA CACGCTCGCC
ACACCGGTCG TCGCCACACC GGTCGCCGCT GAGCCGACAG CGCTCCCGGC CGGCCAACTC
GAAGCCCTCC GGCGAGATCT CGACCTCACC GCCGACCAGT TGGCCGCGCG TCTCACCGTT
GACGCCACCG CGCCGTCGAT CGAGCGGCGG ATGCGGGCCG AACTGGCCGA CGCGTACGCC
GGAACCTGGA TCACCGCCGA TGGACGCACC ACGGTCGTCG GGTTGACCGA TCCGGCACTC
GCCGACCAGA TCCGCGCCGT CGGGGCCGAG CCTCGAACCG TCACCCGCAG CCTCGCCGAG
CTGAGGTGGC TCACCACCAG ACTGGACCGT CGGGCAGCGC GAGCCGGTGA CGCGGTACAC
GCCTGGCATG TCGCGCCGGC CAGCAACACG GTCGCGATCC AGGCCAGCAA CCCCGCGGCC
GCCACCAGCT TCGCCCGCGC CGCAGGGCTG CCCAACGACG CCGTGTCGGT GGTGGTCAGC
GACGACGCCT ACCGCCCGGT CTACGACATC CGGGGAGGCG ACCAGTATGT GATCGACAAC
CGCCTCATCT GCTCGGTCGG CTTCGCCGTG GCCGGCGGAT TCGTCACCGC CGGACACTGC
GGCGACGTCG GCGAGCCCAC CAGTGGCTCC GGCGTCGCGC AGGGGACCGT CCGTGGCTCG
TCATTCCCGG GCGACGACTA CGGCTGGGTC CAGACCAACG CCACCTGGAC TCCCCGACCA
TGGGTGTCCA CCCACGACGG CAACGTGGTC ACGGTGACCG GGTCGCAGGA GGCGGCGGTC
GGTGCCTCGG TCTGTCGGTC CGGCCGAACA ACCGGCTGGA GGTGCGGCAC CATCACCGCC
ACGAACGTCA CCGTCAACTA CTCCGGCCAA CTCGTCCACG GGTTGGTCCG CAGCACCGCC
TGCGCACAGC CCGGGGACTC CGGCGGGCCC TTCGTCGCCG GCTCCCAGGC ACAGGGTGTC
ACCTCGGGTG CCGGCGGCGA CTGCGCCTCC GGCGGCACCA CCGTCTACCA GCCGGTCAAC
GAGATCCTGT CCCGCTACGG GCTGTCACTC ACCACTTCTG GCGGCGGATC GACGAACAGG
ATCATCGGTT TGGCCAACAA GTGCGTCGAC GTACCGGGCG CCAACGGGGC CGACGGGCAG
TACCTGCACC TGTGGCACTG TAATGGCACC AACGCACAGG ACTGGACGTT CCCGGGCGAC
GGCACCATCC GGGCCTTCGG CCTCTGCATG GACGTCGCCT GGGGTTCTCG GGAGAACGGC
GCGGTGGTCC AGCTCGCGCA CTGCAGTGGC AACCCAGCCC AGCAGTGGGT GCTCACCGGC
GCCAACGACC TCGTCAACCC ACAGGCGAAC AAGTGCCTCG ACGTCAAGGA CTGGAACAGC
GCCGACGGCG CCCGGCTGCA AACCTACGAA TGCCATGGTG GCGCCAACCA GAAGTGGCGT
CTCGGGTGA
 
Protein sequence
MSRTLNALVG FVLGICLVTA PAPAVGAATA GVSGASDTLA TPVVATPVAA EPTALPAGQL 
EALRRDLDLT ADQLAARLTV DATAPSIERR MRAELADAYA GTWITADGRT TVVGLTDPAL
ADQIRAVGAE PRTVTRSLAE LRWLTTRLDR RAARAGDAVH AWHVAPASNT VAIQASNPAA
ATSFARAAGL PNDAVSVVVS DDAYRPVYDI RGGDQYVIDN RLICSVGFAV AGGFVTAGHC
GDVGEPTSGS GVAQGTVRGS SFPGDDYGWV QTNATWTPRP WVSTHDGNVV TVTGSQEAAV
GASVCRSGRT TGWRCGTITA TNVTVNYSGQ LVHGLVRSTA CAQPGDSGGP FVAGSQAQGV
TSGAGGDCAS GGTTVYQPVN EILSRYGLSL TTSGGGSTNR IIGLANKCVD VPGANGADGQ
YLHLWHCNGT NAQDWTFPGD GTIRAFGLCM DVAWGSRENG AVVQLAHCSG NPAQQWVLTG
ANDLVNPQAN KCLDVKDWNS ADGARLQTYE CHGGANQKWR LG