Gene Sros_6143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6143 
Symbol 
ID8669441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6738104 
End bp6740185 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003341616 
Protein GI271967420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.601803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.287239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACC TCCGTCACGT TCCCCGAGCG CTCGCGGTGG CCGCGGTCCT GGCGGCCGCG 
CTGCTGCGTC CCCCCGCGAC GGCCGCCGGC GAAGGCCCCG GCCACCCCTG GCTGGACCCC
GCCAGGCCCG TCGCAGAGCG CGTGGAGCTC CTGCTCGGCG CGATGACGCA GGACGAGAAG
CTGAACATGG TCCGCGGGCA CGGCCTCGGG CTGATCCCCC GGGGACGGGT GGCGCCCGTT
CCCCGCCTGG GCATCCCCGA GCTCCGCATG CACGACGGCC CGCACGGCGT GAGCGACGTG
AGCGGTGTGA CCGCCTTCCC CGCGCCGGTC ACCCTGGCCG CCACCTGGGA CACCTCCCTC
GCCCGCCGGT TCGGGGCCGC GCTCGGCGCC GAGGCGAGGG GCAAGGGCGT CAACGTCCAC
CTCGCCCCCG CCGTGGACAT CCTGCGGGTG CCGCAGTCGG GAAGGGCCTG GGAGAGCTTC
GGGGAGGACC CCTATCTCGC CTCCGCGATG GCGGGGGAGG AGGTCGCCGG GATCCGGGGG
CAGAACGTCA TCGCCGCCGT CAAGCACTAC GTCGGCAACA ACCAGGAGAC CGACCGCATG
CGGGTCGACG CCGTGATCGA CGAGCGGACG CTGCGGGAGA TCTACTACCC GCCCTTCGAG
GCCGCCGTCA CCGCCGGGGC CGGCGCGGTC ATGTGCGCCT ACAACAGGGT CAACGGCCCG
TTCGCCTGCG AGAACCCCCA GACGCTCACC GCCGCTCTGA AGGACTCCAT GGGCTTCGGG
GGCTGGGTCA TGTCCGACTG GCTGGCCACC CGGAGCGGGC AGAAGGCCGC CCTGGCCGGC
CTGGACCAGG CGATGCCGGA CGACCCGCTG TTCGCCGCGA ACCTCAAGGT GGGGATCGCC
CTGGGGACGT TCCCCGCCGC GCGGCTCGAC GACATGGCCC GGCGCGTGCT CACGCCGATG
TTCGCCGACG GGCTCTTCGA GACCGGGTAC GGCGCGCCGG ACCGGGACGT CCGCACCCCC
GAGCACACCG CCCTGGCCCG CGACATCGCG GTCTCCGGCA CCGTCCTGCT CAAGAACCGA
GACGGGATCC TGCCGCTCAC CGGCGGCCGG ATCGCCGTCA TCGGGACCGC CGCGCACGAC
AACGTCGAGA TCACCTCCGC CGGCTCCGGC CGGGTGCTGC CGCCCTACGT GGTCACGCCG
TACCAGGGCC TCGACGCCAG GGCCGGAGGC ACCGCGACCT ACTCGCCGGG AGACGCGGAG
GGGGCGCTGC CCTGGGATGT CGACAAGCGG CTGCGCGACG CCGCCGACGC GGCCCGGCGG
GCCGACGTCG CCGTGGTCGT GGTGGGCCTG TCCTCCCAGG AGGGGGAGGA CAGGCCGGAC
CTCGGGCTGC CGGGCAACAT GGAGACGCTG ATCGAGACCG TCGCCGCGGC CAATCCGCGT
ACGGTCGTCG TGCTGACCTC CCCCGCGCAG GTCCTCATGC CCTGGGCCGG CCGGGTCAAG
GGCGTGGTCT CGGCCTTCAT CGGGGGGCAG GAGCTCGGCA ACGCGCTCGC GGCCGTGCTG
TACGGCGACG CCGACCCCGG CGGTCGCCTG CCGATGACGT TCGCCGCCCG TGCCGCCGAC
TACCCGGCGA GCACTCCCGA GCAGTTTCCC GGGGTCGGTC ACCGGCAGGT CTACAGCGAG
CGGCTGCGGG TCGGCTACCG GCACTTCGAC GCGACCGGCC TCACGCCGCT GTTCCCCTTC
GGGCACGGCC TGTCCTACAC GAGCTTCTCC TACGACTCCC TGTCGATCTC CGGGGCCACG
GTGCGGGCCA GGGTGACCAA CACGGGCTCA CGCGCGGGCG TCGCCGTACC GCAGCTGTAT
CTCGGGTTCC CGCCGGAGGC GGGAGAGCCG CCCCGGCAGC TCCGCGGGTT CGCCAGGCTC
CGGCTCGCGC CGGGGGAGTC GGCGACCGTC GCCTTCCGGC TCCCGGAGCG GGCCTTTCAG
GTCTGGACGT CCCAGGGGTG GAGTACCGTG CGCGGCGACC ACACCGTCCA GGTGGGCGCC
TCCTCGCGGG ATCTCCCCCT CAGCGGGACG TTGACCCGCT GA
 
Protein sequence
MPDLRHVPRA LAVAAVLAAA LLRPPATAAG EGPGHPWLDP ARPVAERVEL LLGAMTQDEK 
LNMVRGHGLG LIPRGRVAPV PRLGIPELRM HDGPHGVSDV SGVTAFPAPV TLAATWDTSL
ARRFGAALGA EARGKGVNVH LAPAVDILRV PQSGRAWESF GEDPYLASAM AGEEVAGIRG
QNVIAAVKHY VGNNQETDRM RVDAVIDERT LREIYYPPFE AAVTAGAGAV MCAYNRVNGP
FACENPQTLT AALKDSMGFG GWVMSDWLAT RSGQKAALAG LDQAMPDDPL FAANLKVGIA
LGTFPAARLD DMARRVLTPM FADGLFETGY GAPDRDVRTP EHTALARDIA VSGTVLLKNR
DGILPLTGGR IAVIGTAAHD NVEITSAGSG RVLPPYVVTP YQGLDARAGG TATYSPGDAE
GALPWDVDKR LRDAADAARR ADVAVVVVGL SSQEGEDRPD LGLPGNMETL IETVAAANPR
TVVVLTSPAQ VLMPWAGRVK GVVSAFIGGQ ELGNALAAVL YGDADPGGRL PMTFAARAAD
YPASTPEQFP GVGHRQVYSE RLRVGYRHFD ATGLTPLFPF GHGLSYTSFS YDSLSISGAT
VRARVTNTGS RAGVAVPQLY LGFPPEAGEP PRQLRGFARL RLAPGESATV AFRLPERAFQ
VWTSQGWSTV RGDHTVQVGA SSRDLPLSGT LTR