Gene Sros_8049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8049 
Symbol 
ID8671377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8860467 
End bp8863142 
Gene Length2676 bp 
Protein Length891 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343447 
Protein GI271969251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCG ATCCGGTCGT ACGGCTCGCG GGCGCCGGCC GCGTGCACGG TGACGGGCCG 
CGCGCCGCAC GCTCCCCGCT GGCCGCGCTT CTGGCCGCCC TGCGGATCTC CCGGCGGGAC
GCGCTGCGGT CCAGGGGCCG CAGCGCCCTG ATCGTCGCGA TGGTCGGCCT GCCCGTTCTC
GTCATCACCG TGACCCTGAC CTTCGCCGAG ACGGCGGACA TCACCCCGCG CGAGGGCCTC
ACCGCGCAGA TCGGCGCCGC CGACCTGCGG ATCTGGACGC TCGGTGGGCA GCGATTCCCG
GCGCGAGACG CCGGCGGTGA GATCTCCTGG ACGGAGGTTC CCAGCTCGCC GGCGGAGATC
GTGAGTCTGC TCGACCACGG AGCCCGGGTC ATCCCGGCGA ACGACGGCAG CGTCGACTTC
AGGGGTGAGG ATGGCTACGA CCACGTGGCC GCCCATGAGC TGGACCTGCG CGACCCGCTG
ACGGCCGGGA TGTACCGCCT GTTGCGCGGC CGCCTCCCGG CCGCTCCCGG GGAGATCGCG
GTGACTCCGG GGACGGAGGA GCGGGGAGCG CCTCTCGGCA CGACACCGGC CGTCGGCCCG
AAGAAGACGC CCAAGCGGGT GGTCGGCGTG GTCGAACATC CGCACCGGAC CAGGAGTGCG
GAGATCGTGG GCTTGCCGGG CACCCTGCTT CTCGATGGGC GGGACGGTCA GGGCACCGGC
TGGCTGGCCG ACACCTCGCA ACCGATCACC CGGGAGGACA TCCGCAGGCT GAACGCCGCC
GGGCTGGGCG TCCGCGCTCC TGTCGTCCCC GAAGGCTCGG ACGGCTCAAA CGACTGCTGC
TGGATCGTCA GCCCCTATGA CGTCGAACAG CTCATGGCCA TCGGCCTCGC CGTGGTCATG
AGTGTGCTGG AGGTGGTGCT GCTCGCCGGT CCCGCCTTCG CGGTCGGGCT GCGCCGCCGC
CGCCGCGAGC TGGCGCTGAT CGCCGCGCAG GGCGGGGCAC CGGGGCACCT CCGCATGATC
GTTTTAGCCG ATGGTGTGGT TCTGGGCGGT GGCGCGGCGC TGCTCGGCCT GATCCTGGGC
GTCGGCCTGG GGGCACTGGC GGCACTGCTG GAAGCGGGTC GGCTGATCGG CGGGCTCGGT
CCACTGGACA TCCCGTGGCA CCCGATCCTG ACCGTCACCC TGCTGGGGGC GGTCAGCGGG
ATCACGGCCG CCGTGGTGCC CGCCGTACAG GCGGCCCGTC AGGACGTCGC GGCGGTGCTC
GGCGGGCGTA GGAGCCAGGC GCGCGACCGC GCGGGCCGTC CGGTCCTCGG CCTGGTCCTG
CTCGTGGTCG GCGCCGCTGC GGCCGTGTTC GCTGTCAGGT TCCACCCGGT CTGGGTCCTC
GCCGCGGCCG TCCTGGCCCA ACTGGGGTTG GTCGCGCTGG CTCCGGCGCT GGTCAGGATC
GCGGCGAGCC TGGCCACGCG GCTTCCCCTG CCGGTCCGGC TGGCCGCCCG CGACGCCTCC
CGCCACCGGG GCCGTACGGC CTCCGCTGTC GCGGCGGTCA TGACGGCGGC GGCGGCCTTC
ACCGCGATGG CTGTCATGAC CAACAGCAAC TTCGCCGTCA GCCGGGACTC TTTCCAGGCG
GCGCTCCCCG AGGGGATCAT GAGGATCAGC GGGCCCGACC GCGACGACGC CCGGTGGACC
GGGATCAAGG CGGTCGCCGA GCGGATGTTC CCCGAGACAC CGCTGATCGA GGCCGCCGAG
CCCTTGGACA TCACGGGAAC CTCCATCGAG CTCGCCCCCC TGCACTTCTC GGACGGCGGG
CCGTACGCGG GACGTTCCTA CTCCGGAGGG CTGCCAGTCG GAGGTCGGCA GCTGCTGGAG
CTCGTCCAGG GGCGGCGGGA TCCGGTCGCC GCGGCGGCGC TCGACGCGGG CAAGGCCGTG
GCGTTCGACC CGCGCCTGGT GCGTGACGGG CGGATGAGGC TGCATGTCAT GGCGTTCACC
TCCGAGGAGT ACGAGGTCCC GGACGACCTC ACGGTCCCCG CGGTCGTGGC CGAGGCCGCC
GACCCCCAGT ACGCAGTCGG GGTGCTCCCT GCCTCCGCAC TCCATGCCAT CGGGCTGAAG
ACGAAGGCCG ACACGCTGTA CATCGACCCC GCCGGCCACC TACTCGACCG GGAGCGGGAG
GGCAAGCTGG AGCGGGAGTT GGCGGCGGCG GCAGGCGGCA GCGTCGACGT GTTCGTTCAG
GGCGGGCTCG GGCAGGGCGT TCTTCCGCAG CTGGCGCTGT TCCTCGCTGC GGCGTCGGTC
CTGGTGCTGG GTGGCACGTT CGCCGCCACC GGGCTCGCGG CGGCCGACCT GCGGCCCGAC
CTCGCGACCA TGGCCGCCGT CGGGGCGCCG CCCGGCACGC GGCGGCTGGT CGTCGCCGGG
CAGGCGGGGT TCATCGCCGG GCTTGGCGCG CTGGTCGGTG CCCTCGCGGG GATCGCGACC
GGGATCGCGG CGATCTGGCC GGTGCTGGGA CAGGCAAGGA ATGTCGGCGG GCCGACGACG
CCCGGGGGGC TGCCGCCGTT CCCCGGCAGG GCGCCGACCA TCGAGATCCC CTGGCTCTTC
CTCGTGGCAC TGGTGGTCGG GCTGCCAGTG CTGGCGGCGT TGCTGGCGGG GGCGTTCACC
CGGACGAGAA CGACGCTGAC CCGCCGGACC GGATAG
 
Protein sequence
MSFDPVVRLA GAGRVHGDGP RAARSPLAAL LAALRISRRD ALRSRGRSAL IVAMVGLPVL 
VITVTLTFAE TADITPREGL TAQIGAADLR IWTLGGQRFP ARDAGGEISW TEVPSSPAEI
VSLLDHGARV IPANDGSVDF RGEDGYDHVA AHELDLRDPL TAGMYRLLRG RLPAAPGEIA
VTPGTEERGA PLGTTPAVGP KKTPKRVVGV VEHPHRTRSA EIVGLPGTLL LDGRDGQGTG
WLADTSQPIT REDIRRLNAA GLGVRAPVVP EGSDGSNDCC WIVSPYDVEQ LMAIGLAVVM
SVLEVVLLAG PAFAVGLRRR RRELALIAAQ GGAPGHLRMI VLADGVVLGG GAALLGLILG
VGLGALAALL EAGRLIGGLG PLDIPWHPIL TVTLLGAVSG ITAAVVPAVQ AARQDVAAVL
GGRRSQARDR AGRPVLGLVL LVVGAAAAVF AVRFHPVWVL AAAVLAQLGL VALAPALVRI
AASLATRLPL PVRLAARDAS RHRGRTASAV AAVMTAAAAF TAMAVMTNSN FAVSRDSFQA
ALPEGIMRIS GPDRDDARWT GIKAVAERMF PETPLIEAAE PLDITGTSIE LAPLHFSDGG
PYAGRSYSGG LPVGGRQLLE LVQGRRDPVA AAALDAGKAV AFDPRLVRDG RMRLHVMAFT
SEEYEVPDDL TVPAVVAEAA DPQYAVGVLP ASALHAIGLK TKADTLYIDP AGHLLDRERE
GKLERELAAA AGGSVDVFVQ GGLGQGVLPQ LALFLAAASV LVLGGTFAAT GLAAADLRPD
LATMAAVGAP PGTRRLVVAG QAGFIAGLGA LVGALAGIAT GIAAIWPVLG QARNVGGPTT
PGGLPPFPGR APTIEIPWLF LVALVVGLPV LAALLAGAFT RTRTTLTRRT G