Gene Sros_5703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5703 
Symbol 
ID8668997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6237603 
End bp6238820 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content76% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003341194 
Protein GI271966998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.231398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG ACGTCATGCT CGGGGACCTG GAGGAGCTCG TCTCGTGCGA GTCGTTCTCC 
GCCGACCACG AGGCGGTGGC CCGCAGCGCC CGGGTGGTCG CCGACCAGGG GCTGCGCAGG
CTCGGCGCGC GCCCCGAGAC GATCGTGATC GACGGGGTCA CCCACCTGCG GTGGACCTTC
GGCACGCCCC GGGTCCTGCT GGTCGGCCAC CACGACACCG TCTGGCCGAT CGGGACGCTC
GCCGAACATC CCTGGTCGCT GGTGGACGGG ATCGCCCGCG GGCCCGGGGT GTTCGACATG
AAGGCCGGGC TGGTGCAGGC CTTCCACGCG CTGGCCGCGC TGCCGTCGCC GGAAGGGGTG
TGCCTGCTGG TCACCGGGGA CGAGGAGGTC GGCTCACCGT CCTCGCGCGC GCTGATCGAG
GAGTCGGCGC GCGGCTGCGC GGCCGCGTTC GTGCTGGAGG CCGGCGCCGA CGGCGGCGCG
CTCAAGACCG CGCGCAAGGG CACGTCGATC TACGAGCTCG TGGTGCACGG CAGGGCCGCG
CACGCAGGCC TGGAGCCCGA GAGGGGCGCC AACGCCGGGA TCGAGCTGGC CCACCAGATC
CTCGCCCTCG CCGGGATCGC CGACCGGGTG AACGGTGGCC GTCCAGGCGT GCCGGCGGCG
GAGACCTCAC CCGCGGCCGC GCGCGGACTG CCCCTTCCCC CGGGCCCCGC GGCACCGGGG
ACGCCGTCCG CCGCGGCGGC GCCCGGCGGA CCGGGCGGCC TCGGACCGGT CACCGTCACC
CCGACCGTGC TGTCCGGGGG CACCACGACC AACACCGTGC CCGCGCTGGC CCGCGTGGAG
GTGGACGTGC GGGTGCCCAC CCTCGCCGCG CAGGAGCGGG TGGACGAGCT GATGAGGGCC
CTGTCCCCCC GGCTCGCCGG GACCCGGCTG GAGGTCCTGG GAGGGCCGAA CCGCCCTCCC
CTGGAGGAGA CCTCGTCGGC CGGGCTGTTC GCCCTGGCCC AGCGGATCGC CGCCGGCCTC
GGGCTCGCCC CCCTCTCCGG CGTCGGAGTG GGCGGAGCCT CCGACGGCAA CTACACCGCC
GGAGCCGGCT GCCCCACCCT CGACGGGCTG GGCGCGGTGG GCGGCGGCGC CCATGCCGCA
CACGAGCACG TGGTCGTGGC CGAGATGCCC GGCCGTACGG CACTGCTGAC CGGCCTGATC
CGGGCGGTGC TCGGATGA
 
Protein sequence
MNLDVMLGDL EELVSCESFS ADHEAVARSA RVVADQGLRR LGARPETIVI DGVTHLRWTF 
GTPRVLLVGH HDTVWPIGTL AEHPWSLVDG IARGPGVFDM KAGLVQAFHA LAALPSPEGV
CLLVTGDEEV GSPSSRALIE ESARGCAAAF VLEAGADGGA LKTARKGTSI YELVVHGRAA
HAGLEPERGA NAGIELAHQI LALAGIADRV NGGRPGVPAA ETSPAAARGL PLPPGPAAPG
TPSAAAAPGG PGGLGPVTVT PTVLSGGTTT NTVPALARVE VDVRVPTLAA QERVDELMRA
LSPRLAGTRL EVLGGPNRPP LEETSSAGLF ALAQRIAAGL GLAPLSGVGV GGASDGNYTA
GAGCPTLDGL GAVGGGAHAA HEHVVVAEMP GRTALLTGLI RAVLG