Gene Sros_5589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5589 
Symbol 
ID8668883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6119120 
End bp6120661 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein 
Protein accessionYP_003341084 
Protein GI271966888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.333853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCCC TGAACCGCCG TGCCTTCCTC GGCACCACCG CCGGGCTGTC CGCCGCCCTG 
CTCGCCGGAT GCGGTCCCGG TCCCGCGACC GGGACCGCCG CCACGGGCCG GCCACGCAGG
GGCGGCCGGC TGCGCGTCGC GTTCGCCGGA GGCGGGGCCC AGGAGACCCT GGACCCGCAC
CTGTCCAACC TGTTCGTCGA GGCCTCGCGG TCGAAGGCGC TGTTCGACAA GCTCGCCGAC
TACGGCCCCG ACCTGTCGGT CCAGCCCCGT CTCGCCGAAC GCTGGGAGCC CGACGCCGCC
CTCACCACGT GGCGGGTCAC CCTCCGCCAG GCCGCCTTCC ACGACGGAAG GCCGGTACGG
GCCGCCGACG TCCTGTACAG CTACGCCCGG ATCGGCGACC CCGCCCGCGC CTTCCGCGCC
AGGACCACCC TGGCGGTGAT CGACCTGGCC CGCAGCAGGG CCGTCGACGA CCGCACCGTG
GAGTTCGCGC TCAGGCAGCC GTTCGCCGAG TTCCCGAACG TGCTGGCCGC CTTCGGCGTG
TTCATCGTCC CGGAGGGCAC CGAGGACTTC ACCCGGCCCG TCGGCTCCGG CCCGTTCGTC
TTCGGCTCCT TCGAGCCGGG CCGCTCCCTG CTGCTCAGGC GCAACCCCGA CTACTGGGAG
GGCGCGCCCC ACGTCGACGA ACTGCAGTAC CTGATCGCCA ACGAGGAGTC GGCCCGCGTC
AACGCGCTGC TCGGCGGGCA GGTCGACTAC GCCCACGACA TCACCGCGAC CACCGCCCGG
ACCTATCGGG GCAACGACCG CCTGGCCGTC ACGCGGCTCA CCAACAGCGG CATGCAGGCC
TTCGCCATGA AACTGGACCG GCCGCCCTTC GACGATCGCG ACCTGCGCGA GGCCATGTTC
CTGCTGGCCG ACCGGGAGCA GCTGGTCGAC ACCGTGCTGG GCGGCGCCGG GCAGACGGGC
AACGACCTGT TCGGCAGGGG ATACCAGTAC TACGCCGAGG AGATACCGCA GCGCGCCCGG
GACCTCGACA GGGCGCAGTG GCTGGTGAGA AAGGCCGGCG CGAAGGGGCT GCGGATACGG
CTCGACACGT CCGCGGCGGC CGGCGGCTTC GTGGAGTCCG CGAGCGTCTT CGCCGACCAG
ATGCGGCAGG CCGGGCTCGA CGTCAGGGTC GCCGTGGGCG ACAAGGACAC CTACTGGAAG
GACGTCCTCG ACGGGGGCAG CCTGTGCTGC TTCCGCTCCG GTGCGATGCC CATCGAGTCC
CACTTCTCGC AGCGCCTGCT CAGCACGTCC ACCACCAACA TCACCAAGTG GAGGCGCCCG
GAGTTCGACG CCCTCTACAC CAGAGCCGTC TCGCTCGCCG ACGAGAAGGC GCGCCGCGAC
GTGTACGCCG AGATGCAGCG CATGCAGCAC GCCGAGGGCG GCTACCTCGT CTGGGGTTTC
GCCGACTGGC TCGTCGCGAC CGCGCCGGGG GTGGGCGGCG TCGTCGACGC CCCGGCCAAC
ACCCTGGACT GGGCCCGATT CGACAAGGTC TGGCTGGCGT GA
 
Protein sequence
MHSLNRRAFL GTTAGLSAAL LAGCGPGPAT GTAATGRPRR GGRLRVAFAG GGAQETLDPH 
LSNLFVEASR SKALFDKLAD YGPDLSVQPR LAERWEPDAA LTTWRVTLRQ AAFHDGRPVR
AADVLYSYAR IGDPARAFRA RTTLAVIDLA RSRAVDDRTV EFALRQPFAE FPNVLAAFGV
FIVPEGTEDF TRPVGSGPFV FGSFEPGRSL LLRRNPDYWE GAPHVDELQY LIANEESARV
NALLGGQVDY AHDITATTAR TYRGNDRLAV TRLTNSGMQA FAMKLDRPPF DDRDLREAMF
LLADREQLVD TVLGGAGQTG NDLFGRGYQY YAEEIPQRAR DLDRAQWLVR KAGAKGLRIR
LDTSAAAGGF VESASVFADQ MRQAGLDVRV AVGDKDTYWK DVLDGGSLCC FRSGAMPIES
HFSQRLLSTS TTNITKWRRP EFDALYTRAV SLADEKARRD VYAEMQRMQH AEGGYLVWGF
ADWLVATAPG VGGVVDAPAN TLDWARFDKV WLA