Gene Sros_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3535 
Symbol 
ID8666823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3921180 
End bp3923228 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content73% 
IMG OID 
ProductBeta-galactosidase-like protein 
Protein accessionYP_003339214 
Protein GI271965018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG AGGTCCGCAA CGGTGTCACC ATCGTCGATG GAGAACCCCG GGTGCTCGTC 
ACCGCCGACT ACCCCTACTA CCGGGACGAC CCCGGCGTCT GGGCCGACCG GCTACGGGCG
ATCCGCGACG AGCTGGGCAT CGAGGTGATC AGCAGCTACA TCCCGTGGCG GCACCACCAG
CCCGACGCCG CGACGGCGCC CGACTTCACC GGTGACGGCC ACCCCGGCAG GGACGTCGTC
GGTTTCCTCA ACCTCTGTCA CGATCTCGGG CTGAAGGTCA TCGCCAAGCC CGGCCCGTTC
ATCCATGCCG AGACCACCTA CGGCGGCCTG CCCGACTGGG TCTGCCCCTC GGCCGACGCG
GAGATCGAGC CGCTGCTCGA CGCGGCGGGC GCCGCCTCGT GCTGGGCGGA CTCGGCCGCG
CGCCCGCCGG GCAGGCCGCT CCCCGCCCCG CTGGGGGCGG CGTTCCTGGC CAGGGCCGGC
AGGTGGCTGG CGGCGGTCGG CAAGGAGGTG CTGGACGCCG CGACCCACCC CGAAGGCCCG
GTCATCATGA TGCAGATCGC CAACGAGGGC ATCTACACCA ACGGCGCGCG GTCGCTGTCG
GCCTACGACT ACAGCCCGTC GGGGCTGGCG TTCTTCCGCG ACCGGCTCCA GGGATGGTAC
GGCTCGATCG AGGAGTACAA CCGCACCCAC GCGACGGCGC ACCGGCGCTG GGACGAGATC
GAGCCGCCCC GCTCCTGGAC CGGGGCCGAG CGGCCGGAGG AGATGCGGGG CCACGCCGAC
TGGGGCCGCT TCCACGCCGA ATACCTCACC GAGGTCTACC GGCGCTGGGC CGCCGCGGTG
GACTGCCGGG TCCCCGTGGT GGTCAACCTG AACCCGCCGA CGGTCGAGGA GCTGGACGGC
TGGCTGGCCC GGGTGCGCCC CGAGACGTGG GGGGACATCA CCTACGGGTT CACCAACTGG
ATGGGCGTGG TCTCGGCCGA CCCCGACGCC CAGGCGCGTT ACGTGATCGC CGCCAAGCGG
GCACCCGGCC CGAACCTAGA GGAGAACTGG GGCTTCTCCC AGCTGTACGA CCCCGCCTAC
TCCGACGCCG CCACCAGCTT CCACCAGAGC CTGCTGGCCT TGGCCGCCGG ATCCACCGGG
TTCAACGTCT ACACCGGAGC CGCCACCTCC GGCTGGTCAC CCGACCTGGA CTCCACCCAC
ACCGCGCCGT ATCCCGACAG CGCCCCGATC GCCGCCGACG GCTCCGCCAC CGCGAAGGCG
CCCGTCGTCC GCGTGCTGGC GGACTTCTTC GCCCTGCACG GGGTGGAGTT CCTGGAGTGC
GCGCCGGTCA CGGAAGAGGC CTTCGGGCTG TACCTGCCCT ACGCCGGGAT CGCCGCCTGG
CCGGGCGCGG AGCGGTTCGG GGCACCCCGG TGCGGTACGG CGCTGCGCGC CTTCCACGAC
CGCATGCGGC AGGCGGGCCG CGACTACGCC GTGGTCGAGC TGGAGAGCGC CACGCCCGAC
CGGCTGGCCG CGCACGGGAG GCTGACGGTT CCCGGCGGCC CGTTCATGCA TCGCCACGTC
CAGGACCTGC TGGCCGGCTA CCTGGCGGGC GGCGGCCGGA TCCTGCTGGA CGGCCCGGCG
CCCGGCCTCG ACGAGGACCT GCGCCCGTAC GGCGTGCTCG CCGAGGCGCT CGGCCGTACC
GCGTCCACGC CGGACGCCCC GCAGGCGGAG GCGGGGGCGG TCCGGGTGAC GCGCGGCAGG
GCGGACGCGT TCCTCCGGGC GCATCCCGGA CGCGACGTCC AGTACCTGAC GGTCCTCGTG
GACAGCGAGA ACGAGGGGCC CGTCAGGGTG GAGACCGCGT ACGGCGCCTT CGAGACCTCT
TGCGCGCGGG GCGGCGGGGC CGTGGTGAGG CTGGCCGGGG GCGTGCTGGA CGACTTCGTC
GTCAAGGGGC TCAACAGCTT CCTCGACTCC GCCGTGCCGG CCCGGATCAG TGTCGGCGAC
CAGGAGGAGC GGGCGGGCTT ACCCGCCGAC CTGGCCCGGA TCGGCAGGAG GATCCGCCTG
CTCGGGTAG
 
Protein sequence
MTIEVRNGVT IVDGEPRVLV TADYPYYRDD PGVWADRLRA IRDELGIEVI SSYIPWRHHQ 
PDAATAPDFT GDGHPGRDVV GFLNLCHDLG LKVIAKPGPF IHAETTYGGL PDWVCPSADA
EIEPLLDAAG AASCWADSAA RPPGRPLPAP LGAAFLARAG RWLAAVGKEV LDAATHPEGP
VIMMQIANEG IYTNGARSLS AYDYSPSGLA FFRDRLQGWY GSIEEYNRTH ATAHRRWDEI
EPPRSWTGAE RPEEMRGHAD WGRFHAEYLT EVYRRWAAAV DCRVPVVVNL NPPTVEELDG
WLARVRPETW GDITYGFTNW MGVVSADPDA QARYVIAAKR APGPNLEENW GFSQLYDPAY
SDAATSFHQS LLALAAGSTG FNVYTGAATS GWSPDLDSTH TAPYPDSAPI AADGSATAKA
PVVRVLADFF ALHGVEFLEC APVTEEAFGL YLPYAGIAAW PGAERFGAPR CGTALRAFHD
RMRQAGRDYA VVELESATPD RLAAHGRLTV PGGPFMHRHV QDLLAGYLAG GGRILLDGPA
PGLDEDLRPY GVLAEALGRT ASTPDAPQAE AGAVRVTRGR ADAFLRAHPG RDVQYLTVLV
DSENEGPVRV ETAYGAFETS CARGGGAVVR LAGGVLDDFV VKGLNSFLDS AVPARISVGD
QEERAGLPAD LARIGRRIRL LG