Gene Sros_3342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3342 
Symbol 
ID8666630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3658619 
End bp3660235 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content72% 
IMG OID 
Productconserved hypothetical protein; K01187 alpha- glucosidase 
Protein accessionYP_003339024 
Protein GI271964828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.292623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0532512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC TCGTCCACGC GGACCATGCC AGCGAGACCG CCACCCGGTG GTGGCGCGAC 
GCGGTGATCT ACCAGGTCTA CGTGCGCAGC TTCGCCGACG GCAACGGTGA CGGGATCGGC
GACCTGCTGG GCGTGCGGAG CCGGCTGCGG TATCTGGCCG ATCTGGGGGT CGACGCCATC
TGGCTGACCC CGTTCTACAC CTCGCCGATG GCCGACTTCG GCTACGACGT GGCGGACTAC
CGGGACGTCG ACCCCATCTT CGGGTCGCTG GGCGACGCCA GGGCCCTGAT CGACGACGCG
CACCGGCACG GCCTGCGGGT GATCGTCGAC GTCGTGCCCA ACCACACCTC CGACCGGCAC
GTGTGGTTCC AGCAGGCCCT GGCCGCCGGG CCCGGCAGCC CCGAGCGGGA GCGTTACATC
TTCCGCCAGG GCAAGGGGGA GAACGGGGAG CTGCCCCCGA ACGACTGGGA GTCGGTCTTC
GGCGGCCCCG CCTGGACCAG GTTGCCCGAC GGCGAGTGGT ACCTGCGCCT GTTCGCCCCC
GAACAGCCCG ACCTGAACTG GGACAACCCC GAGGTTCACG CGGAGTTCGA GTCGGTCCTG
CGCTTCTGGC TCGACCTGGG CGTGGACGGC TTCCGCGTCG ACGTGGCGCA CGGCATGGTC
AAGGCCGACG GCCTGCCCGA CGTCGGCCAC CCCGACCAGG TCCGGATGAT CGGTTCCGAC
GTGGTCCCGT TCTTCGACCA GGACGGCGTG CACGAGATCC ACCGCGGCTG GCGCAGGCTG
CTCGACTCCT ACCCGGGCGA GAGGATCGGC GTCGCCGAGG CGTGGGCGCC GTCCCCGCAG
CGGCTGGCCA ACTACGTCCG CCCGGACGAG CTGCACCAGG CGTTCAACTT CCACTTCCTG
AACACCCCGT GGGACGCGGC CGGGTTCCGC ACGGTGATCC AGGAGTCGCT CGCCACGGCC
GGACTGGTCG GCGCGCCCAG CACCTGGGTG CTGTCCAACC ACGACGTCAA GCGGCACCTG
ACCCGCTACG GCGGCGGCGA GATCGGCCTG CGCCGCTCCC GCGCCGCGGC CCTGCTGACG
CTCTCCCTGC CCGGCTCGAC CTACGTCTAC CAGGGCGAGG AGCTCGGGCT GCCGGAGGTC
CTCGACCTGC CGGAGGAGTT CCTGCGCGAC CCGCAGCGGC TGCGCAACCC CGACGACGGC
CGCGACGGCT GCCGGGTCCC CATCCCGTGG GCCGACGTCG AGCCGCACTT CGGCTTCAGC
CTGCCAGGCA TCGAGGAGTC ATGGCTGCCC ATGCCCGCCT CCTGGGGACC GCTCAGCGTC
CAGTCCCAGC TGCGCGACCC GCTCTCCACG CTGCACCTCT ACCGGACGGC GCTGGAGATC
AGGCGGGACC GCCGCTCCTT CGGCGACGCG CCGCTGACCT GGCTGGACTC ACCCGAGGGC
ACGCTGGCCT TCACCCGGGG CGACGGCTTC GCCTGCACGC TCAACCTGAC CGGCGAGCCG
GTCGAGCTGC CCGCGCCCGG ACGGGTCCTG CTGGCCAGCG AGGAACCGGT CGTCGACGGC
GACACGGTAC GGCTCGCCCC CGACTCCGCG GTCTGGTGGG AACGCGATGC CGTATAG
 
Protein sequence
MTELVHADHA SETATRWWRD AVIYQVYVRS FADGNGDGIG DLLGVRSRLR YLADLGVDAI 
WLTPFYTSPM ADFGYDVADY RDVDPIFGSL GDARALIDDA HRHGLRVIVD VVPNHTSDRH
VWFQQALAAG PGSPERERYI FRQGKGENGE LPPNDWESVF GGPAWTRLPD GEWYLRLFAP
EQPDLNWDNP EVHAEFESVL RFWLDLGVDG FRVDVAHGMV KADGLPDVGH PDQVRMIGSD
VVPFFDQDGV HEIHRGWRRL LDSYPGERIG VAEAWAPSPQ RLANYVRPDE LHQAFNFHFL
NTPWDAAGFR TVIQESLATA GLVGAPSTWV LSNHDVKRHL TRYGGGEIGL RRSRAAALLT
LSLPGSTYVY QGEELGLPEV LDLPEEFLRD PQRLRNPDDG RDGCRVPIPW ADVEPHFGFS
LPGIEESWLP MPASWGPLSV QSQLRDPLST LHLYRTALEI RRDRRSFGDA PLTWLDSPEG
TLAFTRGDGF ACTLNLTGEP VELPAPGRVL LASEEPVVDG DTVRLAPDSA VWWERDAV