Gene Sros_8141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8141 
Symbol 
ID8671469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8977185 
End bp8978888 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content65% 
IMG OID 
Producttrehalose synthase 
Protein accessionYP_003343539 
Protein GI271969343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.351371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCAGC TACCTGAGCC AATTCCGAAC ACCTTCGACG AGGAGAAGCC GCGCGATCCG 
TACTGGTTCA AGCGCGCTGT TTTCTACGAG GTGCTCATCC GGGGTTTCGC CGATTCCAAC
GGGGACGGCA CCGGAGACAT CCGCGGCCTC ATCGACAAAC TGGACTATCT CCAGTGGCTC
GGCGTGGACT GCCTCTGGCT GCTGCCCCTG TACGAATCGC CGCTCCGCGA TGGCGGCTAC
GACATCGCGG ACTTCATGAA GATCCTCCCC GAGTTCGGTG ATCTCGGAGA TTTCGTCAAG
CTGGTGGACG AAGCCCACAA GCGGGGCATG CGCGTCATCG CCGACCTGGT GATGAACCAC
ACCAGCGACC AGCACCCCTG GTTCCAGGCC TCCCGCCACG ACCCCGAGGG CCCGTTCGGC
GACTTCTACG TCTGGAGCGA CTCCGACGAG CTCTACAAGG ACGCCCGGGT CATCTTCATC
GACACCGAGA CGTCCAACTG GTCCCACGAC CCGGTGCGGG GCCAGTACTA CTGGCACAGG
TTCTTCTCCC ACCAGCCGGA CCTCAACTAC GAGAACCCGG ACGTGCAGGA CGCGATGCTG
GAGGTGCTGC GGTTCTGGCT GGACCTGGGC ATCGACGGGT TCCGGATGGA CGCCATCCCC
TACCTGTTCG AGCAGGACGG CACGAACTGC GAGAACCTGC CCAGGACCCA CGAATACCTG
AAGCGGGTCA GGGCCGAGGT CGACCGCCTC TACCCCGACC GGGTGCTGCT GGCCGAGGCC
AACCAGTGGC CGGCGGACGT GGTGGAGTAC TTCGGCGACC CGGCGACCGG CGGCGACGAG
TGCCACATGG CGTTCCACTT CCCGCTGATG CCGCGCATCT TCATGGCCGT CAGGCGGGAG
TCCCGCTACC CGATCTCGGA GATCATGGCC CAGACGCCGA AGATCCCCGA GAACTGCCAG
TGGGGCATCT TCCTGCGCAA CCACGACGAG CTCACGCTTG AGATGGTGAC CGACGACGAG
CGCGACTACA TGTACTCGGA GTACGCCAAG GACCCCCGGA TGCGGGCCAA CGTCGGCATC
CGGCGGCGGC TGGCCCCGCT GCTGGAGAAC GACCGCAACC AGATCGAGCT GTTCACCGCG
CTGCTGCTCT CGCTGCCCGG TTCCCCGGTG CTCTACTACG GCGACGAGAT CGGGATGGGC
GACAACATCT GGCTGGGCGA CCGCGACGGC GTCCGCACTC CGATGCAGTG GAGCCCCGAC
CGCAACGCCG GGTTCTCCGA CTGCGACCCC GCCCGGCTCT ACCTGCCGGT CATCATGGAC
CCGATCTACG GCTATCAGGC GATCAACGTC GAGGCGCAGC AGAAGAGCTC CGGCTCGCTG
CTGCACTGGA CCAAGCGGAT GATCGACATC CGCAAGCGCC ACCCGGTCTT CGGCCTGGGG
GCGTTCACCG AGCTGAACTC CTCCAACCCG AGCGTCCTCG CCTACGTGCG CGAGTACGGC
GACGACCGCA TCCTGTGCGT CAACAACCTG TCGCGGTTCC CGCAGCCGGT GGAGCTGGAC
CTGCGCCGGT TCGAGGGATC GGTGCCCGTC GAGACCATGG GCGGAGTACC GTTCCCACCG
ATTGGCGAAC TTCCGTATCT TTTGACGCTT CCTGGGCATG GGTTCTATTG GTTCACCCTG
CCACCCGTAA CCCAGGAGGC GTAA
 
Protein sequence
MSQLPEPIPN TFDEEKPRDP YWFKRAVFYE VLIRGFADSN GDGTGDIRGL IDKLDYLQWL 
GVDCLWLLPL YESPLRDGGY DIADFMKILP EFGDLGDFVK LVDEAHKRGM RVIADLVMNH
TSDQHPWFQA SRHDPEGPFG DFYVWSDSDE LYKDARVIFI DTETSNWSHD PVRGQYYWHR
FFSHQPDLNY ENPDVQDAML EVLRFWLDLG IDGFRMDAIP YLFEQDGTNC ENLPRTHEYL
KRVRAEVDRL YPDRVLLAEA NQWPADVVEY FGDPATGGDE CHMAFHFPLM PRIFMAVRRE
SRYPISEIMA QTPKIPENCQ WGIFLRNHDE LTLEMVTDDE RDYMYSEYAK DPRMRANVGI
RRRLAPLLEN DRNQIELFTA LLLSLPGSPV LYYGDEIGMG DNIWLGDRDG VRTPMQWSPD
RNAGFSDCDP ARLYLPVIMD PIYGYQAINV EAQQKSSGSL LHWTKRMIDI RKRHPVFGLG
AFTELNSSNP SVLAYVREYG DDRILCVNNL SRFPQPVELD LRRFEGSVPV ETMGGVPFPP
IGELPYLLTL PGHGFYWFTL PPVTQEA