Gene Sros_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3956 
Symbol 
ID8667246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4404701 
End bp4405951 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003339609 
Protein GI271965413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.282171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0187852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATC CGGGTGCCCG TACCGCCGCA CTCGCCGCCG TCGCTGCGGC TGCCCTGCTG 
ACCTCGGCGG CCTGCGGCAG CGGCTTCGAC GGCCCCGCGG GCGGGAACAC CGAGCAGAGC
GGCGGCCCGG CGGCCCTGCG GATCCTCATC GGCTCCTCCG GCGACGCCGA GACCGCCGCG
GTACGCTCGG CGGCCGGCGC CTGGGCCAAG GCGACGGGCA ACACCGCGAC GGTCACTCCC
GCCCAGGACC TCTCCCAGCA GCTCGGCCAG GCCTTCGCCG GGAGCGACCC CCCGGACGTG
TTCTACGTGG ACGCCTCGCG CTTCGCCGAC TACGCGAGCG TCGGGGCGCT GGAGCCGTAC
GGTGACAGGA TCTCCGACTC CGGGGACTTC TACCCGAGCC TGCGCACCAC CTTCAGCCAC
GACGGCGTCT TCTACTGCGC GCCGAAGGAC TTCGCCACCC TGGCGCTGAT CGTCAACGAC
GACCTGTGGA AGAAGGCCGG GCTGACCGGC GCGGACGTGC CCACCACCTG GGAGCAGCTC
ACCTCGGCGG CGGAGAGGAT CAAGGCCGCG GGGGTCACCC CGCTGGTCGT CGGCGACACC
CATGAGCGGA TCGGGGCCTT CATGGTGCAG GCCGGGGGCT GGATCACCAG CGACGACGGC
AGGCGGGCCA CCGCCGACAG CGCCGCGAAC GTCACCGCCC TGCAGTACGT GCGGGGCCTG
CTCAAGGGCG GGCTCGCCCG GTTCCCCAAG CAGCTCGACG CCGGATGGGG CGGTGAGGCC
TTCGGCAGGG GCAGGGCCGC GATGACCGTC GAGGGCAACT GGATCAGGGG GGCGATGAGA
GCGGACCACC CCGGCGTCGC CTACACCGTC CACGAGCTGC CGGCCGGGCC GGCGGGCAAA
GGCACGCTGT CCTTCACCAC CTGCTGGGGC ATAGCCGCCA AGAGCAGGCA CAAGAAGCAG
GCGATCAGCT TCGTCGAGGA GATGACCAGG GCCGGCCGGC AGATGGAGTT CGCCAGGGCG
TTCGGCGTGA TGCCCTCCCG CCGGTCCGCC AGGGCCGCCT TCACCGGGGA GTTCCCGGAC
GACACCCCGT TCGTGAACGG CGCCGACCAC GCCCACGGCC CGGTGAACAC CCCGAAGATG
GCCAATGTGC TGGCCGACTT CGACGACGGC CTCCAGCAGC TCGCCTCCAC CGACCCGAAG
ACGCTCCTGG CCCGCCTGCA GAAGAACACC CGGGCCGCGC TCGGCGACTG A
 
Protein sequence
MRHPGARTAA LAAVAAAALL TSAACGSGFD GPAGGNTEQS GGPAALRILI GSSGDAETAA 
VRSAAGAWAK ATGNTATVTP AQDLSQQLGQ AFAGSDPPDV FYVDASRFAD YASVGALEPY
GDRISDSGDF YPSLRTTFSH DGVFYCAPKD FATLALIVND DLWKKAGLTG ADVPTTWEQL
TSAAERIKAA GVTPLVVGDT HERIGAFMVQ AGGWITSDDG RRATADSAAN VTALQYVRGL
LKGGLARFPK QLDAGWGGEA FGRGRAAMTV EGNWIRGAMR ADHPGVAYTV HELPAGPAGK
GTLSFTTCWG IAAKSRHKKQ AISFVEEMTR AGRQMEFARA FGVMPSRRSA RAAFTGEFPD
DTPFVNGADH AHGPVNTPKM ANVLADFDDG LQQLASTDPK TLLARLQKNT RAALGD