Gene Sros_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1287 
Symbol 
ID8664562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1325836 
End bp1327941 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content75% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003337028 
Protein GI271962832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.686384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTGGC CGAGAGGGCT CGAAGGGCTG TGCTACGGCG GGGACTACAA CCCCGAGCAG 
TGGCCGGAGG AGGTCTGGAG GGAGGACGTC GGGCTGATGC GGCGGGCCGG GGTCAACCTG
GTCACCGTCG GGGTGTTCTC CTGGGCCCGG CTGGAGCCGT CACCCGGCGT CCACGACTTC
GGCTGGCTGG ACCGGGCCCT CGACCTGCTC CACAAGGGCG GGATCAGGGT CAGCCTGGCC
ACCCCCACCG CCTCCCCGCC GCCCTGGTTC GGCCTGGCCC ACCCGGACGC GCTCACCGTC
GCCGCCGACG GGACCCGGCT CACCCACGGC AGCCGCGACA CCTACTGCGT GAGCGCCCCG
GCCTACCGTG ACGCCGCCGT CCGGATCGCC ACCGGACTCG CCGAGCGCTA CCGCGAGCAC
CCGGCCCTGG CCATGTGGCA CGTCCACAAC GAGTACGGCA CCTGGTGCCA CTGCGACCAC
GTCGCCGCCG CCTTCAGGAC CTGGCTCGAA GCCCGTCACG GCACGCTGGA GGCGCTGAAC
GACGCCTGGA CCACCTCGTT CTGGGGCCAG CACTACTCGG CCTGGGAGCA GGTCCTGCCG
CCCCGCGCCA CCCAGTACCT CCCCAACCCG TCCCAGACCC TCGACTTCCG GCGGTTCCTG
TCCGACGCCA TGCTCGACTG CTTCCGCGAG CAGAAGGCGG TGCTGCGCGC GCTCACCCCG
GACGTCCCGG TCACCACGAA CTTCGTCTTC GGCGGCTGGG TGCCGGTGGA CCAGCGGCGC
TGGGCCGGGG AGGTGGACCT GGTCGCGATC GACCACTACC CGGCGGCCGA CCCCCCGGCG
GAGACCGCGT TCGGCGCGGA CCTCGCGCGG CACTGGGCCG GCGGGGCGCC GTGGCTGCTG
ACGGAGCAGG CGGTGGTGAC CTACACCGGG CCCCGGATGG TCGCCAAGCG GCCCGGGGAG
ATCGCCAGGC TGAGCCTGTC GCACATCGCG CGGGGCTCGC GGGGCGCGAT GTTCTTCCAG
TGGCGCGCCT CGCGGGGCGG GGCCGAGCTC TGGCACTCGG GCATGGTGCC GCACGCCGGC
CCGGACTCGC GGATCTTCCG CGAGGTCTGC GAGCTGGGCG CGCTCCTGCC CGCCCTCGAA
GAGGCCACCC GGGCGCCGGT CGAGGCGGAG GCGGCGGTCC TGTGGAACGT CGAGGCGGGG
TGGGCGCTGC AGTCGCCGGG CCTGCCGTCC ACGGAGCTGA GCTACCTGGA CGCGGTACGG
CAGGCGCACC GGGTGCTCTA CCGGCACGGC GTCACCGCCG ACTTCGCCCA TCCGTCCGAC
GATCTCTCGG CATACAGGTT CGTGCTCGTC CCCAGCCTCT ACCTGATCTC CGACGCCGAC
GCGGAGAACC TGCGCCGCTA CGTCGAGGGC GGCGGCACGC TCGTGGCCTC CTTCCTCAGT
GGGGTCGCCG ACGAGCACGC CCGCGTCCGG ACGGGCGGCT ACCCCGGGGC CCTGCGCGAC
CTGCTCGGCA TCCGGGTCGA GGAGTTCCAC CCGCTGCCCC CGGACGCCGC GATCCCGCTG
TCCACCCCCG GCGGCGGCCC CCTCGGCGCC GGCATCCTCC CGCTCGCCCG GGAGGGCGCC
GACCCGCCGC CGGGCGAGCG GGCTCCGCAC GACACCGGGA CCTTCTGGAG CGAGCACGTC
CACCTGGAGG GCGCCGAGGC GCTGGCCCTC TACGCCGTGC CGGAAGCCCC CGCCCTGGAC
ACCCCCGCCC TGGACTCCCC CGCCCCGGAC ACCCCCACCC CGCACGCGCC CGGCCCGCAC
GCGCCCGGCC CGCACGCGCT CAGGACGGAC GCTCCCGGGA CGGGTGCGCC GGGCGCCGCG
CTCGCCGGCC TGCCGGCGAT CACCCGCCAC CGGCACGGCC GGGGGACGGC CCTCTACCTG
TCCACCCGGC TCACCGACGG CGCCTACGCC CGCCTGCTGG GCCTGCGGCC CGCTCCCGTG
GAGCGCGTGC GGCGGGGCGG GTGGCTGTTC ACGATCAACC ACGGCGACGA GGAGCAGGAG
GGAACCGGCG GCCTGCGGTT ATCCCCCGGC GGTTACGCTG TACAAAAGGT GCAAGCAGGC
GTCTAG
 
Protein sequence
MDWPRGLEGL CYGGDYNPEQ WPEEVWREDV GLMRRAGVNL VTVGVFSWAR LEPSPGVHDF 
GWLDRALDLL HKGGIRVSLA TPTASPPPWF GLAHPDALTV AADGTRLTHG SRDTYCVSAP
AYRDAAVRIA TGLAERYREH PALAMWHVHN EYGTWCHCDH VAAAFRTWLE ARHGTLEALN
DAWTTSFWGQ HYSAWEQVLP PRATQYLPNP SQTLDFRRFL SDAMLDCFRE QKAVLRALTP
DVPVTTNFVF GGWVPVDQRR WAGEVDLVAI DHYPAADPPA ETAFGADLAR HWAGGAPWLL
TEQAVVTYTG PRMVAKRPGE IARLSLSHIA RGSRGAMFFQ WRASRGGAEL WHSGMVPHAG
PDSRIFREVC ELGALLPALE EATRAPVEAE AAVLWNVEAG WALQSPGLPS TELSYLDAVR
QAHRVLYRHG VTADFAHPSD DLSAYRFVLV PSLYLISDAD AENLRRYVEG GGTLVASFLS
GVADEHARVR TGGYPGALRD LLGIRVEEFH PLPPDAAIPL STPGGGPLGA GILPLAREGA
DPPPGERAPH DTGTFWSEHV HLEGAEALAL YAVPEAPALD TPALDSPAPD TPTPHAPGPH
APGPHALRTD APGTGAPGAA LAGLPAITRH RHGRGTALYL STRLTDGAYA RLLGLRPAPV
ERVRRGGWLF TINHGDEEQE GTGGLRLSPG GYAVQKVQAG V