Gene Sros_0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0028 
Symbol 
ID8663291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp31061 
End bp32506 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003335831 
Protein GI271961635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.125752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.588325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CCCGGATCAG CGCCGACGAG AACCGGCGCC AGGTGCTCTA CTGGCGGCTC 
CTGGCGCGGC TGTTCGACGG TGACGAACAG CCCTCGCTGG AGGCGGCCAG CATGGCCATC
GTCGGCGACC TCGGACTGCC GCCGGCGCTG CTGGACCCGG CGGTGTCGGT GGACAACATC
GTGCAGCGCT TCCCTGAGCT GGGCGCCGAG CTGCAGGGCC TGCTGGCGCA GGAGGACGGC
GTCGCGTCCG CTTCCACCGA GGGGCACCCG CCGGGCCGGG AAGGCGAGGA CGATGTCGCG
CCCGCTACCG GTGACGGGCA CCTGCCGGAT GAGGCGGAGG ACGGTGTCGC GTCCGCTCCG
GCCGGCGGGC ACTCGCCGGA GCGGGCCGGT GCGGCCGAGG TGCGCCGGGC GGCCCTGGTG
TCGAAGCTCC TGCTCAACGT GTTCTCGACG GGGACGGGCA GCGTCAGCGC CACCGATCTG
GCCCGCTGGC AGCAGGACGC CGGCTGGTTC GAGCAGGCGC TCGGGGCCGA GCCGGGACAG
CTGCGGTCCC GTCCCGGAGG CGGCGAACTG GCCGGCGTGC TGGCCGGCCT GGAAGGCGAC
CTGGTCCGCC GCATGCGCCT GCGCGAGGTG CTGGCCGACT CGGCCCTGGC CGCCCGGCTG
ACGCCGAGCA TGTCGCTCAT CGAGCAGCTG CTGCGGGACA AGTCCAACCT GTCGGGAGTC
GCGCTGGCCA ACGCCAAGGC GCTGATCCGC AGGTTCGTCG AGGAGGTCGC CGAGGTGCTG
CGCACGCAGG TCGAGAAGAC CAGCGTGGGT GTCATCGACA GGTCGGTGCC GCCGAAGCGG
GTGTTCCGCA ACCTCGACGT CGAACGCACG ATCTGGAAGA ACCTCACCAA CTGGAGCCCG
GAGGACCAGC GGCTGTACGT GGACCGGCTG TACTACCGGC AGACCGCCCG GCGGACCACG
CCGGCACGGT TGATCGTGGT CGTCGACCAG TCGGGCTCGA TGGTCGACGC GATGGTGAAC
TGCACGATCC TCGCCTCGAT CTTCGCCGGC CTGCCCAAGG TGGACGTGCA TCTGGTCGCC
TACGACACCC GCGCGCTCGA CCTGACCCCG TGGGTGCACG ACCCGTTCGA GGTGCTGCTG
CGGACGACGC TCGGGGGTGG CACCAACGGG ACCGTCGCGA TGGCCGTCGC CCGGCCGAAG
ATCGCCGACC CGCGTAACAC CGTGATGGTG TGGATCTCCG ACTTCTACGA CAACCGGGCG
CTGATCACCG ACTTCGAGGC GGTGCACCGT TCGGGCGTGA AGTTCATCCC GGTCGGCTCG
GTGAACAGCT CGGGACACCA GAGCGTGGAC CCGTGGTTCC GCCAGAAGCT CAAGGACCTG
GGCACCCCGG TGATCTCGGG TCACATCCGC AAACTCGTGT TCGAGCTCAA GAACTTCCTC
GCCTAG
 
Protein sequence
MSDSRISADE NRRQVLYWRL LARLFDGDEQ PSLEAASMAI VGDLGLPPAL LDPAVSVDNI 
VQRFPELGAE LQGLLAQEDG VASASTEGHP PGREGEDDVA PATGDGHLPD EAEDGVASAP
AGGHSPERAG AAEVRRAALV SKLLLNVFST GTGSVSATDL ARWQQDAGWF EQALGAEPGQ
LRSRPGGGEL AGVLAGLEGD LVRRMRLREV LADSALAARL TPSMSLIEQL LRDKSNLSGV
ALANAKALIR RFVEEVAEVL RTQVEKTSVG VIDRSVPPKR VFRNLDVERT IWKNLTNWSP
EDQRLYVDRL YYRQTARRTT PARLIVVVDQ SGSMVDAMVN CTILASIFAG LPKVDVHLVA
YDTRALDLTP WVHDPFEVLL RTTLGGGTNG TVAMAVARPK IADPRNTVMV WISDFYDNRA
LITDFEAVHR SGVKFIPVGS VNSSGHQSVD PWFRQKLKDL GTPVISGHIR KLVFELKNFL
A