Gene Sros_5406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5406 
Symbol 
ID8668700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5923668 
End bp5925206 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content71% 
IMG OID 
Productconserved hypothetical protein; K01187 alpha- glucosidase 
Protein accessionYP_003340911 
Protein GI271966715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA CGTGGTGGCG GGGAGCCGCG ATCTACCAGG TCTACCTCCG GAGTTTCGCC 
GACGGCAACG GCGACGGCGT CGGCGACCTC ACGGGTCTCC GCGCGCGCCT GCCGTACCTC
TCCGACCTGG GAGTCGACGC GATCTGGCTC AACCCCTGGT ATCCCTCGCC GATGGCCGAC
GGCGGCTATG ACGTGGCCGA CTACCGCGAC ATCGAACCGT CCTTCGGCAC CCTCGCCGAG
GCGGAGAAGT TCATCGAGGA AGCCCACGAG CTGGGCATCC GCATCATCAT CGACGTGGTG
CCCAACCACA GCTCCGACCA GAGCGCCTGG TTCCGGGAGG CGCTGGCCGC CGGACCGGGG
TCGGCGGCCA GGGAGCGGTT CTGGTTCAAG GACGAGCCCA ACGACTGGAA GTCCATCTTC
GGAGGCCCGG CCTGGACGCA GGTCCCCGAC GGGCAGTGGT ACCTGCACCT GTTCGCGCCC
GAGCAGCCCG ACTTCAACTG GACCAGCCCG GAGGTGCACC AGGAGTTCCA CGACGTGCTG
CGGTTCTGGT TCGACCGCGG CATCGACGGC ATCCGCATCG ACTCGGCCGC CGTGCTGGTC
AAGGACTTCG ACGGCGGCGA CCCCGACGGC TACACCGATC TCGCCGAGGT GCACGACGTC
TACCGGAGCT GGCGGCGGCT GTCCGACGAG TACGACGAGC GGCTGCTGAT CGGCGAGGTC
TGGTTCCCCG ACCAGGAGCG CTTCGCCCGC TACCTGCGCC CCGACGAGCT GCACACCGCC
TTCAACTTCG ACTTCCTGGG CAGCCCCTGG AGCCCGGCGG CGCTGCGTGA GTCGATCCGG
CAGACCCTGG CCACGCACGG GCCGCTCGGC GCTCCGGCGA CCTGGGTGCT GTCCAACCAC
GACGTCGCCC GCCCGGTCAC GCGGTACGGC AGGGCCGACA CCTCCTGGGA CAACGGCGAC
CGCAGGGACG GGGCGCCCTC CGACCTCGAA CTCGGCCACC GCCGGGCCAG GGCGGCGGCG
CTGCTGGCCA TGGCGCTGCC CGGCAGCGTC TACGTCTACC AGGGTGAGGA GCTCGGCCTC
CCCGAGGTCG AGGACATCCC CGGCGAGCTC CGCCAGGACC CGATGTGGCA CCGTTCGGGA
CACACCGTCG CCGGCAGGGA CGGCTGCCGC GTGCCCCTGC CCTGGTCGGG AGAGGAGGCG
CCGTTCGGCT TCGGCACGGG GACCTCGTGG CTGCCGCAGC CGGAGGCGTG GAGGAAGGTC
ACCGTCGAGG CACAGCGAGC GGATCTGGGC TCGATGCTCA ACCTCTACCG CGCCGCCCTG
CGCATCCGCC GCGAGGAACT CGGCGACGGC GCCCTGACCT GGCTCGACGC CGGGGACGAC
GTGCTGGCCT TCACCCGCGA GACCGGCCTC ACCTGCGTGG TCAACCTCGG TCCCGAGCCG
GTGGCACTGC CGTCCCACGA CGCCGTCCTG CTGGCCAGCG GTCCCCTCGA CGCCGGTGAG
CTCCCCTCCG ACACCGCCGT GTGGCTCCGC ACGTCCTGA
 
Protein sequence
MSETWWRGAA IYQVYLRSFA DGNGDGVGDL TGLRARLPYL SDLGVDAIWL NPWYPSPMAD 
GGYDVADYRD IEPSFGTLAE AEKFIEEAHE LGIRIIIDVV PNHSSDQSAW FREALAAGPG
SAARERFWFK DEPNDWKSIF GGPAWTQVPD GQWYLHLFAP EQPDFNWTSP EVHQEFHDVL
RFWFDRGIDG IRIDSAAVLV KDFDGGDPDG YTDLAEVHDV YRSWRRLSDE YDERLLIGEV
WFPDQERFAR YLRPDELHTA FNFDFLGSPW SPAALRESIR QTLATHGPLG APATWVLSNH
DVARPVTRYG RADTSWDNGD RRDGAPSDLE LGHRRARAAA LLAMALPGSV YVYQGEELGL
PEVEDIPGEL RQDPMWHRSG HTVAGRDGCR VPLPWSGEEA PFGFGTGTSW LPQPEAWRKV
TVEAQRADLG SMLNLYRAAL RIRREELGDG ALTWLDAGDD VLAFTRETGL TCVVNLGPEP
VALPSHDAVL LASGPLDAGE LPSDTAVWLR TS