Gene Sros_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4438 
Symbol 
ID8667732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4951396 
End bp4954665 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340051 
Protein GI271965855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0339173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00783982 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCT TGGCCAGGTT GAGCCTTGCC AACCGCAGCC TCGTCATAAT GATCGCGGTC 
GTGTTGAGCG CGTTCGGCGC CTTCGCGATC CCGTCGCTGA AACAACAGCT TCTGCCGTCG
CTTTCGTTCC CGGGCGCCTT CGTGGTCGCC CCCTACCTGG GTGCCTCTCC CGAGATCGTG
GAGGAGCAGG TCACCAAGCC GATCGAGGAC AGCTTCCAGG GCATCGAGGG GGTCACCGAG
GTCACCTCCA CCTCCCAGGA GGGGATGTCG CAGGTTCAGG TGGCCTTCGA GTACGGCACC
GACATCGACG CGGCGGTGGC GAAGATGCAG CAGGCGGTGG CGCGGATCGG CACCCAGCTG
CCCGACGGGG TGGACCCGCA GGTCCTGGCC GGCGGCACCG ACGACATCCC GGTGCTGGTG
TTGGCGGTGG GGACCGGCGG TGACGAGCGG GCCATGGCCG ACAAGCTCCA GCGGATCGTG
GTGCCGGAGC TCCAGGGCAT CGACGGCGTC CGTGAGGCCA CGGTGACCGG CGCCCGCGAC
GAGACCGTGG TGATCACTCC GGATGTCAAG AAGCTCGCCG CGCGCGGGCT GGCGCCGACG
GCGGTGACCG ATGCGCTGCG CGCCAACGGG CAACCGGTCC CGGCGGGCAG CCTGGTGCAG
GACAAGGCGT CGCTGACGGT TCAGGTGGGC AGCCGGATCG CCTCGATCGA GGATCTGAAG
AACCTCTATC TGATCCCCGC GGCTCCGGCG CAGGCCCAGC AGGCTCCGGC CCAGGCCCAG
GCGCAGCAGG CCGCCCAGCA GGGACGTCAG ATGCCGGGGC AGCAGAGTGC GGCCGCGGCC
CAGCAGCGGG CCCAGCAGGC GCAGCGCCCC GTGGCAGCGC CCAAGCCGGT GAAGCTGGGC
GAGGTGGCCG AGGTCGAGCG TTCGCTGGCC ACCAGCACCT CGCTGACGCG CACCAACGGC
GAGCCGAGCC TGGGGGTGTC GGTGACGATG ACGCCCGACG GCAACGCGGT GGCGATCTCC
CATGCGGTCA ACGAGAAGAA GGCCGAGCTG GTCCGGGCGA TCGGCGATTC GGCGCAGGTG
ACGGTGGTCT TCGACCAGGC GCCGTATGTG GAGCGCTCGA TCGAGGACCT GACCACCGAG
GGCCTGCTGG GTCTGGTCTT CGCGGTGCTG GTCATCTTGA TCTTCTTGCT GTCGGTGCGT
TCGACGCTGG TGACGGCGGT GTCGATCCCG CTGTCGGTGG TCATCGCGCT GATCGCCCTG
TGGCTGGGCG ACTACTCCCT CAACATGCTC ACGCTGGGGG CGCTGACGAT CGCGGTCGGG
CGGGTGGTGG ACGACTCGAT CGTGGTTTTG GAGAACATCA AGCGGCACCT GGGCTACGGC
GAGGCCAAGC GTGATGCGGT GCTGGCGGGC GTGCGTGAGG TGAGCGGGGC GGTGACGGCC
TCGACGCTGA CCACGGTGGC GGTGTTCCTG CCGATCGCGG TGGTCGGCGG CATGGTGGGG
CAGCTGTTCG CGCCGTTCGC GATCACGGTG ACGGTGGCGC TGCTGGCTTC GCTGCTGGTG
TCGCTGACGG TGATCCCGGT GCTGGCCTAC TGGTTTTTGA AGGCGCCGGT GCTCAGCGCC
GAGCAGGCGC GCGTGGTGCG TGAGCAGGCG GAGGCCAAGG AGCTGCGCAG TCCGTTGCAG
CGGGCCTACA TGCCGGTGTT GCGCTTTGCG ACCAAGCGGC GCCTGGTCAC CGTCCTGGTC
GGCGTCGTGA TCTTCGTGGG GACGATGGGC CTGGCCGGCC GGCTGGAGAC CAACTTCCTG
GACTCCTCGG GGCAGAACAC GCTCTCGGTG ACGCAGAAGA TGCCCGCCGG CACGGACCTG
GCCACCACGG ATGCGGCGGC CAAGAAGGTG GAGGAGGTGC TGGGCTCCCT GAAGGACGTG
GAGGGCTACC AGGTCAACGT GGGTGCCGGC GGCGGGTTCA TGGGCGCCAC GGGCGGTGGC
GGCGATCGCG CCTCCTACTC GGTGACGGTC GCCGAGGGCG CCGACACCGC GGCGCTGGAG
AGCGCACTGC GTGATCGGCT GAAGACGCTG GCCGATATCG GCGAGGTCAC CGTCGGTGGC
GCGCAGGGCG CCCCGGGCGG CGGTTCGACC AACCAGCTTC AGGTGATCGT CCAGGCGCCG
GACACCGCGA CGTTGCAGAG CGCGGCCGAG GCGGTGCGCG GCGCGATGGC GGGACTTGAC
GGGCTGCGCG ATGTCTCCTC CAACCTGCAG GCCAGCGGCC GGCGCGTGGA GGTCCAGGTG
GACCGGCGCA AGGCGGCGGC CAAGGGATTG ACCGAGGCGC AGATCGCGCA GGTGGTCGCC
CAGCGTTTCC GTGGCGCGCC GCTGGGCCAG ATCACCCTGG ACGGGCGGGC CAGTGATCTG
ATCCTGCGCC TCGATGAGGC GCCGGAGGAT CTGAAGGCGA TCGGTGCGCT GAAGCTGCCG
TCGGCCGCCG GCGAGGTCGA GTTGTCGGAT GTGGCGAAGG TGGCCAAGGT CGACGGGCCG
ACGCAGGTGA CGCGGATCGA CGGTGAGCGC AGCGCGACGG TGTCGGGTAC GGCCACCGGC
AGCGACCTGG GCAGTGCCAC CGCGGCGCTG ACGGCGAAGC TGGAGGGGCT GTCGTTGCCG
GCGGGGGCGA AGTTCACCAT CGGCGGCGTC AGCGCCGACC AGGAGGAGGC GTTCGGCCAG
CTGGGTCTGG CGATGCTGGC GGCCATCGCG ATCGTCTTCA TGATCATGGT GGCGACGTTC
CGTAGCTTCG TGCAGCCGCT GGTGCTGCTG GTGTCGGTGC CGTTCGCGGC GACCGGCGCG
ATCGGCCTGC TGCTGGCCAC CGGCACGCCG CTGGGCGTTC CGGCGCTGAT CGGCATGCTG
ATGCTGATCG GCATCGTGGT GACCAACGCC ATCGTGCTGA TCGACCTGAT CAACCAGTAC
CGGGAGCAGG GGATGGGCGT GGTGGAGGCG GTGATCGAGG GCGGCCGGCG GCGCCTGCGG
CCGATCCTGA TGACCGCGGT GGCGACGATC TTCGCGTTGA TCCCGATGGC GCTGGGCCTG
ACCGGGTCGG GCGGGTTCAT CTCCCAGCCG CTGGCGATCG TGGTGATCGG TGGCCTGCTG
TCGTCGACGC TGCTGACGCT GGTGCTGGTG CCGACGCTGT ATGTGATGCT GGAGCGGACC
AAGGAGCGGT TCGGCCGCGG GCCTCGGCCC GGCAGGCACG AGGCCGAGGT GAAGGTCTAC
GACGCCGACG AGCTGGCCTC GGCCCGGTAG
 
Protein sequence
MTALARLSLA NRSLVIMIAV VLSAFGAFAI PSLKQQLLPS LSFPGAFVVA PYLGASPEIV 
EEQVTKPIED SFQGIEGVTE VTSTSQEGMS QVQVAFEYGT DIDAAVAKMQ QAVARIGTQL
PDGVDPQVLA GGTDDIPVLV LAVGTGGDER AMADKLQRIV VPELQGIDGV REATVTGARD
ETVVITPDVK KLAARGLAPT AVTDALRANG QPVPAGSLVQ DKASLTVQVG SRIASIEDLK
NLYLIPAAPA QAQQAPAQAQ AQQAAQQGRQ MPGQQSAAAA QQRAQQAQRP VAAPKPVKLG
EVAEVERSLA TSTSLTRTNG EPSLGVSVTM TPDGNAVAIS HAVNEKKAEL VRAIGDSAQV
TVVFDQAPYV ERSIEDLTTE GLLGLVFAVL VILIFLLSVR STLVTAVSIP LSVVIALIAL
WLGDYSLNML TLGALTIAVG RVVDDSIVVL ENIKRHLGYG EAKRDAVLAG VREVSGAVTA
STLTTVAVFL PIAVVGGMVG QLFAPFAITV TVALLASLLV SLTVIPVLAY WFLKAPVLSA
EQARVVREQA EAKELRSPLQ RAYMPVLRFA TKRRLVTVLV GVVIFVGTMG LAGRLETNFL
DSSGQNTLSV TQKMPAGTDL ATTDAAAKKV EEVLGSLKDV EGYQVNVGAG GGFMGATGGG
GDRASYSVTV AEGADTAALE SALRDRLKTL ADIGEVTVGG AQGAPGGGST NQLQVIVQAP
DTATLQSAAE AVRGAMAGLD GLRDVSSNLQ ASGRRVEVQV DRRKAAAKGL TEAQIAQVVA
QRFRGAPLGQ ITLDGRASDL ILRLDEAPED LKAIGALKLP SAAGEVELSD VAKVAKVDGP
TQVTRIDGER SATVSGTATG SDLGSATAAL TAKLEGLSLP AGAKFTIGGV SADQEEAFGQ
LGLAMLAAIA IVFMIMVATF RSFVQPLVLL VSVPFAATGA IGLLLATGTP LGVPALIGML
MLIGIVVTNA IVLIDLINQY REQGMGVVEA VIEGGRRRLR PILMTAVATI FALIPMALGL
TGSGGFISQP LAIVVIGGLL SSTLLTLVLV PTLYVMLERT KERFGRGPRP GRHEAEVKVY
DADELASAR