Gene Sros_5000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5000 
Symbol 
ID8668294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5525208 
End bp5526428 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340542 
Protein GI271966346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.728211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.505446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCTGT CCGACCCCAT CCGCTACGGC GCCAACTACG TGCCCTCGGC CGGCTGGTTT 
CACAGTTGGC TCGACCTCTC GCTCGACGCC GCCCGCCGCG ACTTCGAGGA CCTGGCCTCG
ATCGGCCTGG ACCACGTGCG CGTGTTCCCG ATCTGGCCGT GGATCCAGCC CAACCGGGCG
CTGATCCGGC AGCGCGCCGT GGACGACCTG CTGGCGCTGA TCGACGTCGC CGCCGAGTTC
GGCCTGTCCG TCGCCGTGGA CCTGCTCCAG GGACACCTGT CCAGCTTCGA CTTCCTGCCG
TCGTGGGTGC TCACCTGGCA CCGGCGCAGC CTCTTCACCG ACCGCGGCGT GCGCGACGGC
ATCGCCGCGT ACGCCGACCG GCTCGCCCGC GCCGTCGCCA CCCGCGACAA CGTGTTCGCC
GTCACGCTCG GCAACGAGGT CAACAACCTC TACCCGAGCA ACCCCACCAC GCCCGAGGCG
TCCACGGCCT GGGCCGCCGA ACTGGTCGAC GTCGTGCGCT CCGCCGCGCC GGGCCTGCTC
GCCCTCCACT CGCTGTACGA CGCCACGTGG TACGACCCGG AGCACCCGTT CCATCCCGCC
GACAACGTGG ACCTCGGCGA CCTGACCACG GTCCACTCCT GGGTGTTCAA CGGCGTCTCC
GCGATCGACG GCCCGCTCGG CCCGGCCACC GTCGGCCACG CCGACTACCT CGTCGAACTG
GCCGCGGCCA CCTCGCTCGA CCCGGCCCGG CCCATCTGGC TGCAGGAGAT CGGCGTTCCC
CTGCCCGACG TGCCCGAGGC CCACGCCGCC GAGTTCGTCC GCCGCACGCT CGACACGGTG
ACCGCCAACC CCGCCCTGTG GGGCGTCACC TGGTGGTGCT CCCACGACCT GGAACGTTCC
CTCACCGACT TCCCGGAGCG TGAGTACGGC CTGGGCCTGT TCACCGTGGA CCACCGCCCC
AAGCCCGCGG CCAAGGAACT CGCGGCGATC ATCGGCGAGC GCCGCCGCCG TACCGGGGAA
AGGCGCCCCG CCCTGCGGTG CGACGTGGAC CTGCGGACCG AGCCCGGCCG CCGGGCCGAG
GTCGCGCCCG GCAGCGCCTT CCACACCGAA TGGGTCCGGC TGCGCCAGAC CGGACCAGTG
GCCATCGTCG CCGGCGACCG CGCCGCCGAC CCGGGCCACC TCACGACTCG GGGAATCGAC
ACCGTCCTCA CCCAGGAATG A
 
Protein sequence
MPLSDPIRYG ANYVPSAGWF HSWLDLSLDA ARRDFEDLAS IGLDHVRVFP IWPWIQPNRA 
LIRQRAVDDL LALIDVAAEF GLSVAVDLLQ GHLSSFDFLP SWVLTWHRRS LFTDRGVRDG
IAAYADRLAR AVATRDNVFA VTLGNEVNNL YPSNPTTPEA STAWAAELVD VVRSAAPGLL
ALHSLYDATW YDPEHPFHPA DNVDLGDLTT VHSWVFNGVS AIDGPLGPAT VGHADYLVEL
AAATSLDPAR PIWLQEIGVP LPDVPEAHAA EFVRRTLDTV TANPALWGVT WWCSHDLERS
LTDFPEREYG LGLFTVDHRP KPAAKELAAI IGERRRRTGE RRPALRCDVD LRTEPGRRAE
VAPGSAFHTE WVRLRQTGPV AIVAGDRAAD PGHLTTRGID TVLTQE