Gene Sros_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3724 
Symbol 
ID8667012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4129270 
End bp4132269 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339390 
Protein GI271965194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.606148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.741628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCCAT CCCGCAGCCC CGGCGCGGGC CGCCTGAGCC TGCTCATCTC CACTCTGATC 
GCCCTCATCG CATCCGGCCT GGCCGTCACC GCCCCCGCCG CCGCTGCCGC TGCCGCCGAC
ACCCTGCTGT CCCAGGGCAG GGCCGCCACC GCCTCCTCCA CCGAAGGCGC CGCCTACGGC
GCCGCCGCCG CCTTCGACGG CGACCCCGCC ACCCGCTGGG CCAGCGCCTT CAGCGACCCG
CAATGGATCC AGGTCGACCT CGGCGCCACC GCCACCGTCA GCCAGGTCGT GCTGAACTGG
GAGGCCGCCT ACGGCAGGGC CTTCAAGATC CAGATCTCGG CCGACGGGAC CGCCTGGAAC
GACGCCTACA CCACCACCGC CGGCACCGGC GGCACCCAGA CCCTCCCCGT CTCCGGCACC
GGCCGCTACG TCCGCCTGCA CGGCACCGTC CGGGCCACCG GCTACGGCTA CTCCCTGTGG
GAGTTCCAGG TCTACGGCGC CACCGGCCCC GGCCCGGACC CGGGCGCCTG GACCGAGGTG
TGGCGGGACG ACTTCGACGG GACCTCCGGG ACCTCCCCGT CGGGGGCGAA CTGGATCCTG
CGCACCGGCA CCCGGTATCC CGGGGGTGCC GCGAAGTGGG GCACCGGCGA GGTCGAGACC
ATGAGCGCCT CGACCGCCAA CGTCTCCCTC GACGGCACCG GCAAGCTCAA CATCAGGGCC
GTCAGGGACG GGGCCGGCGA CTGGACTTCC GGGCGCGTCG AGACCCAGCG CACCGACTTC
GCCCCGCAGC CCGGTGAGCG GCTGAGGTTC ACCGCCAGGC TCAAGCAGCC CGACGTCCCC
GACGCGACGG GCTACTGGCC CGGATTCAGG GCGACCGGCG CCGCCTACCG GGGCGACTAC
AACAACTGGC CGGCTGTCGG CGAGACCGAC ATCATGACCG ACGTCAACGG CCGCAGCCAG
CTGTCGAACA CGCTGCACTG CGGCACCGCC CCCGACGGCG CGTGCAACGA GTACGGCGGC
CGCACCAGCG GCCTGGCCAC CTGCGAGGGC TGCCAGACCG GCTTCCACGA GTACACCCAG
ATCATCGACC GCACCAGGTC GGACGAGGAG ATCCGCTTCT ACCTCGACGG CGTGCAGACA
TGGGTGGTCC GCCAGAGCCA GGTCGGCGTC TCCGCCTGGC GGGCCGCCGT CCACCACGGC
TTCTACCTCC GGTTCGACCT CGCCATCGGC GGGTCGCTGC CCGACGCGAT CGCCGGCTTC
ACCACCCCCA CCGCGGCCAC CACCTCCGGC GGCGTCCTGA GCGTGGACTC GGTCTCGGTC
TCCAGGAGCG CCGGCACCGT GCCCGCCCCG ATGACCGACC CGCCCACCCC GGCGGGGCCG
AGCACCGTCA GGGTGACCGG CGGCCAGGGC AACTGGCGGC TGACCGTCAA CGGCGCGCCG
TACGAGGTCA AGGGGCTCAC CTACGGCCCG CCCCAGGCCG CGGCGGACGG CTACATGCGC
GACCTCAGGT CGATGGGGGT CAACACGATC CGCACCTGGG GCGTGGACGA CACCCACACT
CCGGCCCTGC TCGACAGGGC CGCCCAGCAG GGCATCAAGG TGATCGTCGG GCACTGGCTG
AACCAGGGCG CCGACTACGT CAACGACACC GCCTACAAGA CGGCCACCAA GAACGAGATC
GTCGCCCGGG TCAACGCCCT CAAGGGCCAC CAGGGCGTGC TCATGTGGGA CGTGGGCAAC
GAGGTCCTGC TCACCATGCA GGACCACGGC CTGCCCGCCC CGGTGGTCGA GGAGCGGCGC
GTCGCCTACG CCAGGTTCGT CAACGAGCTC GCGCAGGCCA TCCACGCCGC GGACCCGGAC
CACCCGGTCA CCTCGACCGA CGCCTGGACC GGCGCGTGGA CGTATTACAA GGACCACTCC
CCGGCCCTCG ACCTGCTCGC GGTCAACGCC TACGGCGCCA TCGGCGGGGT CAGGCAGGCC
TGGATCGACG GCGGCCACAA CAGGCCGTAC ATCATCACCG AGGCCGGCCC GGACGGCGAG
TGGGAGGTGC CCGACGACGT CAACGGGGTC CCCTCCGAGC CCACCGACCT CCAGAAGCGG
GCGCAGTACA CCGCGAGCTG GAACGCGGTC AAGGCTCACC CCGGCGTCGC GCTCGGCGCC
ACCGAGTTCC ACTACGGCCT GGAGAACGAC TTCGGCGGCG TGTGGCTCAA CACCTTCACC
GGCGGCTGGC GCCGGCTGGG ATACCACGCG CTCAGGCAGG CCTACACCGG CCTGCCCTCC
GGCAACATGC CCCCGGAGAT CACCGGCATG ACGGTCGGCT CGCAGACCGC CGTCCCGGCG
GGCGGCCGGT TCACCGTCGA GGTGACGGCC TCCGATCCCG ACAACGATCA GATCCGCTAC
AACCTGATGT ACAGCGACAA GCACGTCAAC GGCAACAGGG GGTTCGGCCA CGTCGGGTTC
ACCCAGACCG GTCCGGGCCG GTTCTCGGTG ACCGCGCCCG AGCGGCTGGG CGTCTGGAAG
GTCTACGTCT ACGCCTTCGA CGGCCACGGC AACGTCGGCA TCGAGACGAG GTCGTTCCGC
GTCGTTCCGC CGCCGGTCGG CGGCACGAAC GTGGCGCTCG GCAGACCGAC CACCGCGTCG
TCCCACCAGC CGACGGGCGA CGGCGGGCCG TTCCTGCCCG CCAAGGCCAC CGACGGCAGC
TTCACCACCC GCTGGGCGAG CGAGTGGAGC GACGCCCAGT GGCTCCAGGT CGACCTCGGC
CAGGTGACCC CGGTCAACCG CGTCCGGCTC GGCTGGGAGG GCGCCTACGG CAAGGCCTAC
CAGATCCAGA CCTCGGCCAA CGGCTCGGAC TGGAGCACCG TCCACTCCAC CACGACCGGT
GACGGCGGAT TCGACACCAT CGACGTCAGC GCGTCCGCCA GGTACGTGCG GCTGAACCTC
ACCCAGCGCG CGACCGCCTG GGGCTACTCC CTGTGGGAGT TCGGCGTCTA CCGGTCCTGA
 
Protein sequence
MHPSRSPGAG RLSLLISTLI ALIASGLAVT APAAAAAAAD TLLSQGRAAT ASSTEGAAYG 
AAAAFDGDPA TRWASAFSDP QWIQVDLGAT ATVSQVVLNW EAAYGRAFKI QISADGTAWN
DAYTTTAGTG GTQTLPVSGT GRYVRLHGTV RATGYGYSLW EFQVYGATGP GPDPGAWTEV
WRDDFDGTSG TSPSGANWIL RTGTRYPGGA AKWGTGEVET MSASTANVSL DGTGKLNIRA
VRDGAGDWTS GRVETQRTDF APQPGERLRF TARLKQPDVP DATGYWPGFR ATGAAYRGDY
NNWPAVGETD IMTDVNGRSQ LSNTLHCGTA PDGACNEYGG RTSGLATCEG CQTGFHEYTQ
IIDRTRSDEE IRFYLDGVQT WVVRQSQVGV SAWRAAVHHG FYLRFDLAIG GSLPDAIAGF
TTPTAATTSG GVLSVDSVSV SRSAGTVPAP MTDPPTPAGP STVRVTGGQG NWRLTVNGAP
YEVKGLTYGP PQAAADGYMR DLRSMGVNTI RTWGVDDTHT PALLDRAAQQ GIKVIVGHWL
NQGADYVNDT AYKTATKNEI VARVNALKGH QGVLMWDVGN EVLLTMQDHG LPAPVVEERR
VAYARFVNEL AQAIHAADPD HPVTSTDAWT GAWTYYKDHS PALDLLAVNA YGAIGGVRQA
WIDGGHNRPY IITEAGPDGE WEVPDDVNGV PSEPTDLQKR AQYTASWNAV KAHPGVALGA
TEFHYGLEND FGGVWLNTFT GGWRRLGYHA LRQAYTGLPS GNMPPEITGM TVGSQTAVPA
GGRFTVEVTA SDPDNDQIRY NLMYSDKHVN GNRGFGHVGF TQTGPGRFSV TAPERLGVWK
VYVYAFDGHG NVGIETRSFR VVPPPVGGTN VALGRPTTAS SHQPTGDGGP FLPAKATDGS
FTTRWASEWS DAQWLQVDLG QVTPVNRVRL GWEGAYGKAY QIQTSANGSD WSTVHSTTTG
DGGFDTIDVS ASARYVRLNL TQRATAWGYS LWEFGVYRS