Gene Sros_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3495 
Symbol 
ID8666783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3864976 
End bp3866439 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein, family 1 
Protein accessionYP_003339174 
Protein GI271964978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.413396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAGCC ATTCGCCCCA GCCGAGCAGG CGCGACGTCC TCCGTGGCGC CCTGGCCGCC 
GGAGCGTCAC TGACTCTGGC CGGGAGCCTG AGCGCCTGCG GATCCCGCAG CGACGCGCCC
CGGACCGCCG GCGGCGACCA GTGGCGCCAG TTCAAGGGCG CCACCCTGAA CTTCATCTCG
GAGAACACCG CTCCCACCGC GGCCATCGCC GCCGACCTCC GCCCCTTCAC CGACCTGACC
GGGATCAACG TCAACATCGT GACGCTCGAG CTGACCGCGC TGGTCCAGCG GGTCGCCCTC
GACCTGGCCT CGGGACGGGC CCAGTACCAG GTGGTCTACG CCGACCCCTA CCAGGTGCTC
GCGCCGTACC AGCGGGGGCT GGTGGACCTG CGTTCGCTGC AGGCCGATCC GGGCCTGCCG
GACCTGCCCG GCGGCGTCAC GGACTTCATC CCCACCCAGC TCGACGCCGC CGGCCGGTTC
GTCGAGCCCG GACCGATCTA CGCCCTGCCG TACGACGCGC CGACGATGAT CTGGCAGTAC
CGGAGCGACC TGTTCGGCAA GTACCACGAC CGCATGGCCG ACGACCTCGG TTTCGACCCC
GCTCCGGGCG GCGACCGGAC GTGGGAGGAG TACTTCGGGA TCGCCCGCTG GTTCAACAAG
AACGCGACGT CGGACGTCAA GTACGGCACC GGGCACCAGG CCCGCCAGCA CGACTCCCTG
ATGAACGACT TCAGCAACGT GCTGTGGTCC TACGGCGGGG ACTACTTCGC CAACGGCCGG
GAGGTGGGGC GCATGGGGTC GCGGGATCCC GGCCCGTGCC GGCTCGACTC CGAGGCCGCG
ATCGCGGGCG CGGAGTTCTA CAACCGGCTG CTCGGCATCG CCGACCCCGC CTCGAAGACG
TGGGACTGGG ACGGCGTGGG CGCCGCGTTC CGCGCCGGCC GGCTGGCGAT GTGCCCCAAC
TGGCACGAGT ACGCGGCCAG CAACGAGCTG GTGCTGCCCG GCAAGGTCGG CTACGCGCCG
CTGCCCAGGG GACCGGCCGG CACCGCCAAC ATGTACGGGG GAACCGGGGT GGCGATCAGC
GCCAACACGC TGGCCCACGA GCGCGGCGCG GCCTGGCTGT TCCTCGTGTG GGCCACCTCG
CCCCAGACGC AGCTCGCCAA CCTCAGGAGC AAGGCCGGCG GCGGCACCCC CACCCGCACC
TCCGTGTACG AGCTGCCGGA GGTGCGCGCG GCCGAGAAGC GGCCGTCGCC GATGCCCAAC
ATGCTCACGG CCGCCGCGGT GCGGCAGGCC TGGCAGGCCG ACCGGATCGG CCTCCGTCCC
AAGATCCCGA TGTGGAACGA GTGCAACACG GCGATCTTCA CGCAGCTGTC CCGGATGCTC
ACCGGGGGCG CGTCGCCGGA GGAGGCGATG CGTTCGATCA CGTCGCGGGT GGACCGGATC
GTGGCACGAG GGTGGGTGGC CTAG
 
Protein sequence
MDSHSPQPSR RDVLRGALAA GASLTLAGSL SACGSRSDAP RTAGGDQWRQ FKGATLNFIS 
ENTAPTAAIA ADLRPFTDLT GINVNIVTLE LTALVQRVAL DLASGRAQYQ VVYADPYQVL
APYQRGLVDL RSLQADPGLP DLPGGVTDFI PTQLDAAGRF VEPGPIYALP YDAPTMIWQY
RSDLFGKYHD RMADDLGFDP APGGDRTWEE YFGIARWFNK NATSDVKYGT GHQARQHDSL
MNDFSNVLWS YGGDYFANGR EVGRMGSRDP GPCRLDSEAA IAGAEFYNRL LGIADPASKT
WDWDGVGAAF RAGRLAMCPN WHEYAASNEL VLPGKVGYAP LPRGPAGTAN MYGGTGVAIS
ANTLAHERGA AWLFLVWATS PQTQLANLRS KAGGGTPTRT SVYELPEVRA AEKRPSPMPN
MLTAAAVRQA WQADRIGLRP KIPMWNECNT AIFTQLSRML TGGASPEEAM RSITSRVDRI
VARGWVA