Gene Sros_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3497 
Symbol 
ID8666785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3868619 
End bp3872131 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339176 
Protein GI271964980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCATA CCCCCCGGCA CTGGCGAGCG GTCCTGATCG GGTCCGTCGC CGCCCTCGCG 
CTCATGCTCC CCCAGGGCGT GGCCCAGGCC GCGCCCGACC CCGGAAAGAT CCACGAAACC
GTCCAGGCGG AGCTGGCCGC GGACGACAAG GCCACCTTCT GGGTCCGCCT GAAGAACGAC
GCCGACCTCA GCGCGGCCCG CAGTGCGAAG ACCAAGACCG AGAAGGCCCG GCAGGTCTTC
CGGGCCAAGA CCGAGGTCGC CACGGCATCG CAGGCGGGCC TGCGCAAACT CCTCACCGCC
GAGCACGCCG ACTTCACCCC GTTCTGGATC GCCAACGCCG TCCAGGTGAC CGGCGACGCC
AAGCTGGCCG GGGAGATCGC GAAGCTGCCC GAGGTGGAGC GGATCGAGCC CGGCCGCGTC
ACCAAGCTGC CCGAGCCGCT GCCGGGCAAG GAGACGGCGA AGGTCGACGC GGTCGAGTGG
AACATCGACC GCGTCAACGC CCCGAGGGTA TGGAGCGAGC TCGGCACCCG CGGTGACGGC
ATCGTCGTGG CCAACATCGA CTCCGGCGTC CAGTTCGACC ACCCGGCGCT GGCCGCGCAG
TATCGCGGCA AGAAGGCCGA CGGCAGCGTC GAGCACGACT ACAACTGGTT CGACCCGGCG
GGTGTGTGCC CGAGCGCCGC GCCGTGTGAC AACAACGACC ACGGCACCCA CACGATGGGC
ACCATGGTGG GCGGCGAGGG CGCCAACACC ATCGGCGTCG CCCCCGGCGC CAAGTGGATC
GCGGCGAAGG GCTGCGAGAC CAACAGCTGC TCCGACGCCT CGCTGCTCGC CGCCGGGCAG
TGGGTGCTCG CGCCCACCGA CCTGAACGGC GACAACCCGC GTCCCGACCT GGCGCCGGAC
ATCGTCAACA ACTCCTGGGG CGGCGCCGGC TTCGACGCCT GGTACAAGGA GATCGTCGAG
GCCTGGGTGG CGGCGGGCAT CTTCCCGGCC TTCTCCAACG GCAACGTCAC CGGCGCGGGC
TGCAACTCCA GCGGCTCGCC CGGCCAGTAC GGCTCCAGCT ACAGCGCCGG CGCGTTCGAC
GTCAACAACG CCATCGCGAG CTTCTCCACC CGCGGTTCCG GCGAGAACGG CGAGATCAAG
CCCAACCTCG CGGCACCCGG CGTGAACGTG CGCTCGTCGG TGCCCGGCGG TTACGACTCC
TTCTCCGGCA CGTCGATGGC CAGCCCGCAC GTGGCGGCCA CCGTGGCGCT CATGTGGTCC
GCCTCGCCGG CCCTGCAGGG CGACATCGCC GCCACGCGGG CGCTGCTCGA CCGGACCGCG
GTCGACGTCG ACGACACCCG CTGCGGCGGC ACCGCCGCCG ACAACAACGT CTGGGGTGAG
GGCAGGCTCG ACACGTTCGC CGCGGTCCAG GCCGTCCCGG TCGGGGCGCT CGGCGCACTG
CAGGGCACCG TCACCTCCGG TGGCACGCCG GTGGCCGCGG CGACCCTCAC CGTCACCGGG
CCGCTGGGCC GTACGGTCAC CACCGCGCAG GACGGGACGT ACGCGCTGCC GCGCCTGCTG
GCGGGCGACT ACCAGATCAC CGTCAAGAAG TTCGGCTACG ACGACGCGAC GGCGACCGTC
ACCGTCGTGG CCGACCAGAG CGTCACCAAG GACGTGTCGC TGACGCAGCA GTCCGCGGGC
GAGGTGTCGG GGACGGTGAC CGCCGCGGGC GCACCCGAGG CGGGCGCCAC GGTGACCGCG
GTGGGCACGC CGGTCAGCGT GGTCACCGAC GCCGCGGGAC GGTACCGGCT CACGCTGCCG
AACGGCGGCT ACGAGCTGAG GATCACCCCG GTCTCCCGGT GCGCGGGCGG TCTCACCGTG
CCGATCACGG TGAACGGCGA CCTGACCAAG GACGTCGACC TGCCGCGCCG CGCCGACTCC
TTCGGTTACA CCTGCTCCGC CGCCGCCGAG GCGTACGTCG CGGGCACCGA CAAGCACCCG
CTCACCGGCG ACGACGCGGC CCAGCCGGTC ACGCTGCCGT TCACCTTCCC GTTCTACGGC
GGCGGCCACA CCAGCGGCTG GATCAGCAGC AACGGCTTCC TGAACTTCGC CGCCAGCAGC
ACCACGGCCA CCAACGGGGC CCTGCCCTCC ACCGCCGCGC CCAACACGGC GATCTACCCC
TACTGGGACG ACCTGGTGCT GGACGACCAG TCCGGGGTCT ACACCGCCAC CATCGGAACC
GCGCCCAAGC GCACCTTCGT CATCGAATGG CGCAACGCCC GGTTCTACTC CGACGCCGCT
CCGCGCATCT CGTTCTCGAC GCTGCTCGGC GAGGACGGCT CGATCGGGTT CCGCTACCGC
GGCATCACCA GCGAACGCGC CTCCGGGACC AGTGCCACGG TGGGCATCGA GAGCCCGGGC
GGCACCGACG CGCTGCAGTA CTCCCACAAC AGCGCCGCGC TCGCCGACGG CCAGAGCCTG
ACCTTCGCCG CGAGCCGGCA CGGGCTGCTG ACCGGCACCG TCACCGACGC CAACGACGGC
AAGCCGCTGG CGGGCGCGAC GGTCAAGGTG GGCGACGTGG CGACCTTCAC CACCGGGGAG
AACGGCACGT TCCTCGGGCA GGTCCTGGTG GGCGACTACC GGGTCGAGGT CTCCAAGGAC
AACTACGGGA CCTTTGCCCA GGAGATCACC GTCACCGCCG GAACCGTGAC CCGGGTCGAC
ACCGCCCTGG CCACCGGCCA GGTGACGGCC TCGGCCGGAG AGCTCACCCT GGTGATGCCG
GCCGAGTCCA CCAGGACCGG CACGGTCGAC CTGTCCAACC TGGGAGGCGC CACGACCTAC
ACCGTCGTGA CCGACCCCGC CCAGGGCTGG CTGGGCGTGA CGCCCGCCGC CGGCGAGCTC
GGCTCGGGCA AGTCGGTGAC GTTGAAGGTC ACCGCCTCCA GCGCGGGCGT CCAGCCGGGC
ACCGTCCGCA CCGGCAAGCT GCTGGTGCGC TCGGCGAGCG GCCGCAACCC GCAGATCGAG
ATCATCGTGA CGGTCGTGGT GCCCAAGCAC CAGGTCGCCA TCGACGCGGG CGGCACCAAG
GACCTCGTCG ACGCCGCCGG CGACCGCTGG ACCGCCGACC GAAAGTACAG CGCGGGCGGC
CACGGCTACG TGGGCTCCGG CACCAGGACG CACACGTCCA GCAAGGCGAT CAAGGGCACC
ACCGAGCAGG AGCTGTTCAA GCGCGCCCGC GAGTCGATGC TGGAGTACCG GTTCGACCAG
GTGCCCAACG GCACCTACAC CGTCGAGCTG GACTTCGCCG AGACCCGGGC CATGCGTGAG
GGCCGGCGCG TCTTCGACGT CCTCGTCGAG GGCCAGCTCG CGATCCCCGC GCTGGACCTG
GCTCTGGAGG CCGGCACGTA CACCGCCGTC ACCCGGCAGT ACACCGTGAA GGTCACCGAC
GGCCAGCTCA ACGTCCGGTT CGCCGAGCGG GTCGGCGATC CCATCGTCAA CGCCATCCGC
ATCTCCGAGC GTCCCGACAA GGCCACTCCG TAG
 
Protein sequence
MSHTPRHWRA VLIGSVAALA LMLPQGVAQA APDPGKIHET VQAELAADDK ATFWVRLKND 
ADLSAARSAK TKTEKARQVF RAKTEVATAS QAGLRKLLTA EHADFTPFWI ANAVQVTGDA
KLAGEIAKLP EVERIEPGRV TKLPEPLPGK ETAKVDAVEW NIDRVNAPRV WSELGTRGDG
IVVANIDSGV QFDHPALAAQ YRGKKADGSV EHDYNWFDPA GVCPSAAPCD NNDHGTHTMG
TMVGGEGANT IGVAPGAKWI AAKGCETNSC SDASLLAAGQ WVLAPTDLNG DNPRPDLAPD
IVNNSWGGAG FDAWYKEIVE AWVAAGIFPA FSNGNVTGAG CNSSGSPGQY GSSYSAGAFD
VNNAIASFST RGSGENGEIK PNLAAPGVNV RSSVPGGYDS FSGTSMASPH VAATVALMWS
ASPALQGDIA ATRALLDRTA VDVDDTRCGG TAADNNVWGE GRLDTFAAVQ AVPVGALGAL
QGTVTSGGTP VAAATLTVTG PLGRTVTTAQ DGTYALPRLL AGDYQITVKK FGYDDATATV
TVVADQSVTK DVSLTQQSAG EVSGTVTAAG APEAGATVTA VGTPVSVVTD AAGRYRLTLP
NGGYELRITP VSRCAGGLTV PITVNGDLTK DVDLPRRADS FGYTCSAAAE AYVAGTDKHP
LTGDDAAQPV TLPFTFPFYG GGHTSGWISS NGFLNFAASS TTATNGALPS TAAPNTAIYP
YWDDLVLDDQ SGVYTATIGT APKRTFVIEW RNARFYSDAA PRISFSTLLG EDGSIGFRYR
GITSERASGT SATVGIESPG GTDALQYSHN SAALADGQSL TFAASRHGLL TGTVTDANDG
KPLAGATVKV GDVATFTTGE NGTFLGQVLV GDYRVEVSKD NYGTFAQEIT VTAGTVTRVD
TALATGQVTA SAGELTLVMP AESTRTGTVD LSNLGGATTY TVVTDPAQGW LGVTPAAGEL
GSGKSVTLKV TASSAGVQPG TVRTGKLLVR SASGRNPQIE IIVTVVVPKH QVAIDAGGTK
DLVDAAGDRW TADRKYSAGG HGYVGSGTRT HTSSKAIKGT TEQELFKRAR ESMLEYRFDQ
VPNGTYTVEL DFAETRAMRE GRRVFDVLVE GQLAIPALDL ALEAGTYTAV TRQYTVKVTD
GQLNVRFAER VGDPIVNAIR ISERPDKATP