Gene Sros_4988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4988 
Symbol 
ID8668282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5505936 
End bp5507180 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein 
Protein accessionYP_003340531 
Protein GI271966335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.601947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTTCC GCCTACCCCT TGCCTGCCTC GCCGGCCTTG CCGTCATGAC TGCCGGCTGT 
AGTTCCTCCA GCCAGCCGGA GGCCTCAGGA AAGGCCACGA TCAGCTACGC CATCTGGGAC
AAGAACGACC AGGCGAGCGC GGAGAAGATC ATCGCGGCCT TCCAGCAGGC CAATCCCAAT
GTCGCGGTGA AGCTCGAGAT CACCCCGTGG GACCAGTACT GGACCAAGCT CCAGACGGCC
GCGTCCGGCG GTGCCGCCCC CGACGTGTTC TGGATGAACA GCCTCAATGT CCGCATGTAC
GCCAAGGGGG GAATCATCAC CCCGATCGAG GAGTCGAAGG CCCAGGGCCT TCCCCCGGCG
GTCGTCGACG GGTACCGCTA CGACGGCAAG CTGTACGGCC TGCCGCACAA CGTGAGCATC
CCGGCGCTCT GGTACGACAA GAAGCTCTTC GACGCCGCCG GAGTGGCCTA CCCCACCGCC
GACTGGACCT GGGACGACGT CAAGGCTGCG GCCAAGAAGC TGACCGACCC GTCCAAGAAG
CAGTTCGGCA TCCTCGCCCA CATGTGGGAC CAGGGCGCCT TCTACCCCAC GATGCTCCAG
GCGGGCGGCC ACGTGCTGTC GCAGGACGGC AAGAAGAGCG GCTTCGACGA TCCCGCCTCG
ATCCAGGGCC TGGAGTACTG GACCGGCATG ATCAAGGACA AGGTCGGCCC CGTGGCGGAG
GTATACACCG ACACCGACCC CATCACGCTC TTCCAGTCTG GCAAGTACGG CATGCTGTAC
GGCGGCGTCT GGTTCGCCCC CACCTTCTGG GCCAACCCCG AGATCCGCGA GCGGATCGAC
GTGGCCCCGC TGCCCAAGGG CCCTGGCAAG GAAGCCGTCA TCCTGCTCGG CCTGGCCAAC
GCCGTCTCGG CCAAGAGCGA GCATCCGAAG GAGTCGGCGG CCTTCGCCGA GTTCGTCGCC
TCCGAGCAGG CCCAGAGGAT CCTCAGCGAC AGCGGCGGCG GCGCCCTCTC GCTCCGCGAC
GGCACGCAGG AGGGCTGGTT CAAGGCCTTC CCCTCCTTCC ACCTGAAGGA GACCTACGAC
GCTTCGATGC CGTACGGCGT GCCGTACCCG GTGTCACTGA ACACCGCGCA GTGGCAGGAC
GTGCAGAACA AGCTGCTCGC CGAGGCCTGG GCAGGCAAGC GGCCGGTGGC CGACGTCGCC
AAGGAGATCG CGACGCAGAT GAACGAGATC CTGGCCAAGG AGTAA
 
Protein sequence
MRFRLPLACL AGLAVMTAGC SSSSQPEASG KATISYAIWD KNDQASAEKI IAAFQQANPN 
VAVKLEITPW DQYWTKLQTA ASGGAAPDVF WMNSLNVRMY AKGGIITPIE ESKAQGLPPA
VVDGYRYDGK LYGLPHNVSI PALWYDKKLF DAAGVAYPTA DWTWDDVKAA AKKLTDPSKK
QFGILAHMWD QGAFYPTMLQ AGGHVLSQDG KKSGFDDPAS IQGLEYWTGM IKDKVGPVAE
VYTDTDPITL FQSGKYGMLY GGVWFAPTFW ANPEIRERID VAPLPKGPGK EAVILLGLAN
AVSAKSEHPK ESAAFAEFVA SEQAQRILSD SGGGALSLRD GTQEGWFKAF PSFHLKETYD
ASMPYGVPYP VSLNTAQWQD VQNKLLAEAW AGKRPVADVA KEIATQMNEI LAKE