Gene Sros_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3323 
Symbol 
ID8666611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3619231 
End bp3620985 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content73% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003339005 
Protein GI271964809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0719598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.43752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC TCACCCGCTG GGGCGCGGAC CTCGACCCCG ACGCGATCCT GCCCGAATAC 
CCCAGGCCCC AGCTCGCCCG CGACAGCCAC CTCAACCTCA ACGGCCGCTG GGAGTACGCC
ATCACCGCGG ACGAGGAGGA GCCCGGCGCC TACGACGGCG CCATCCTCGT GCCGTTCTCG
CCCGAGTCGC CGCTGTCGGG CGTCGGCCGG CAGCTGCTCC CCGGACAGAC GCTGTGGTAC
CGGCGCGCCC TCACCGCGCC CGACGGGTTC CTGCCGGCCG GCGACGTGCC CGTCCGGGTG
CTGCTGCACT TCGGCGCGGT GGACCAGACG TGCCGGGTGC TGCTCAACGG CGCCGAGGTC
GGCGCCCACA CCGGCGGCTA CCTGCCCTTC ACCTGCGACA TCACCGACGC CCTGCGCGAC
GGGGAGAACA CGCTGGTCGT GGCGGTGCGC GACGACTCCG ACACCGGGCA CCACGCGCGC
GGCAAGCAGA AACTGAAGCG CGGCGGCATC TGGTACACCG CCCAGTCGGG CATCTGGCAG
AGCGTGTGGG CCGAGTGCGT GCCCGCCGTC CACGTGGAGC GGCTGACCCT CACCCCGCAC
CTCGGCGAGG GATGCGTCGA GGTGACGGTG CACGCCGGGA CCGCCGGGGA CGCCCGGGTC
GAGATCCTCG CCGCCGGGGC CGCCGTCGCG CGGGCCGTCG TGCCGGTGGG GCGGCCGGTA
CGTGTCCCGA TCCCGGACGT GCGGCCGTGG AGCCCGGAGG ACCCCTTCCT GTACGACGTC
ACGGTGGAGC TCGGCGCCGA CCGCGTGCGC AGCTACGTCG GGATGCGCTC GTTCTCCGTG
GGGCCCGACG AGAACGGCGT GCCCCGGCTG CTGCTCAACG GCCGCCCCTA CTTCCACGCC
GGGATCCTGG ACCAGGGGTA CTGGTCCGAC GGCATGTACA CCGCGCCGTC GGACGAGGCG
ATGATCTACG ACATCGCCAC GATGAAGCGG CTCGGCTTCA CCATGCTGCG CAAGCACATC
AAGATCGAGC CGCTGCGCTG GTACTACCAC TGCGACCGGC TGGGCATGCT CGTCTGGCAG
GACATGGTCA ACGGCGGCGG CGACTACCAC CCGCTGGTCA TCACCGCGCC CGTGCTCACC
CCTCTGCGGC TCAGCGACCG CCGTCACCGG TGGTTCGGGC GTGCCGACGC CCAGGGCCGG
GCCCGCTTCC GCGCCGAGCT CCGCGAGACC GTGGAACACC TGCGCAATGT GGTGAGCCTG
GCCGTCTGGG TGCCGTTCAA CGAGGGCTGG GGGCAGTTCG ACGCGGTGCG GATCGCCGCC
GAGGTCGCCG AGCTCGACCC CACCCGCACG GTCGACCACG CCAGCGGCTG GCACGACCAG
GGCGGGGGCG ACCTGAAGAG CCTGCACGTC TACTTCCGCC GCTTCCGCGT GCCGCGCCGG
CGCCGCGGCG ACCGGCGGGT CCTCGTGCTG TCGGAGTACG GCGGGTACAA CCTGCGGGTG
GACGGCCACG CGTTCAACGA CAGGGACTTC GGCTACCGGC GGTACGGCTC GGCCGGGCAG
CTCGGCGAGG CGTTCAGCCG GCTGCACACC GAGCAGATCG TCCCGGCCGT CCGGCGAGGC
CTGAGCGCGA CCGTGTACAC CCAGCTCTCC GACGTCGAGG ACGAGCTGAA CGGGCTGCTC
ACCTACGACC GCGAGACGCT CAAGCTGCCC GCCGAGCTGG TGCGGTCGGT CAACGCGCTG
CTGCGGCTGG AGTAG
 
Protein sequence
MSLLTRWGAD LDPDAILPEY PRPQLARDSH LNLNGRWEYA ITADEEEPGA YDGAILVPFS 
PESPLSGVGR QLLPGQTLWY RRALTAPDGF LPAGDVPVRV LLHFGAVDQT CRVLLNGAEV
GAHTGGYLPF TCDITDALRD GENTLVVAVR DDSDTGHHAR GKQKLKRGGI WYTAQSGIWQ
SVWAECVPAV HVERLTLTPH LGEGCVEVTV HAGTAGDARV EILAAGAAVA RAVVPVGRPV
RVPIPDVRPW SPEDPFLYDV TVELGADRVR SYVGMRSFSV GPDENGVPRL LLNGRPYFHA
GILDQGYWSD GMYTAPSDEA MIYDIATMKR LGFTMLRKHI KIEPLRWYYH CDRLGMLVWQ
DMVNGGGDYH PLVITAPVLT PLRLSDRRHR WFGRADAQGR ARFRAELRET VEHLRNVVSL
AVWVPFNEGW GQFDAVRIAA EVAELDPTRT VDHASGWHDQ GGGDLKSLHV YFRRFRVPRR
RRGDRRVLVL SEYGGYNLRV DGHAFNDRDF GYRRYGSAGQ LGEAFSRLHT EQIVPAVRRG
LSATVYTQLS DVEDELNGLL TYDRETLKLP AELVRSVNAL LRLE