Gene Sros_6547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6547 
Symbol 
ID8669856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7181539 
End bp7184676 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003342003 
Protein GI271967807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000462957 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGTATT CACCACGGAA CCTACGCAAC GCGCTGATCG CTCTCACCCT CGCCCTGCCG 
GCGGCGGTGA CGGCCCTGCC CGCCCCCGCG CTCGCCAGGA CCGAGGCGGG CGTCACCGGG
GTCGCCGTCG GCCAGAGCGC CGTCACCGTC ACCGGCAGGG CGAGCGGCCA GGTGGCGCTG
TACGCGCTGG ACACCTGGCA GGACCCCGCG AGCCACACGG GCCTGCAGCC GGTCGCCGTC
GTCGAGCCCC GCGCCGACGG CGCCTTCAGC GCCGAGGTAC CGCGCATGGA CGGCGCGCAG
GACCGGCTGC ACGACAAGTT CCTCGCCGTC GCCGACGGCC AGGTGCTCGG CGACGTGCAC
TACGCCGACG ACCTGCGCTT CCCCTCGGCC AACGACGCCC CCTACCCGCA GGTGCCCGGC
AAGAAGGGCC TGCAGGTCCA GATGACCGAC GACGCCGAGG AGATCGGCGT CGAGCACGCG
GGTATCAACC TGGCACTGGA CTCGGTCATG ATGGCCGGAC CCGGCGCGGC GGGCAACACC
ATCGAGCACG TCTCGCAGGG CAGGACGTAC TACTTCGACA AGGGCGCGGT CGCCGGGCTG
GACGCACAGG TCAAGCCGCT GTCCGACAAC GGCGTCCTGG TCAACCTGAT CGCCATCGTC
TACGACAACA AGGCCGCCAA CTCCGCCGCG CCCAAGCTGA TCCACCCCGA GGCCGAGCGC
GGCAAGGGCA CGGTGTACGC CTTCGACGTG AAGACCGCCG AGGGCGAGGG CTACTTCACG
GCCGCGATGG AGTTCTTCGC CCAGCGCTAC AGCCGCGCCG ACGCCAAGTA CGGCAGGGCC
TGGGGCTGGA TCGTCGGCAA CGAGATCGAC GCCCAGCAGT ACTGGTACAA CATGGGCCCC
CAGGACCGCG ACGTCTTCCT GGAGCAGTAC ACCAGGGCCA TGCGGCTGAC CTGGCAGGCC
GTGCGCAAGT CCGACGCCAA CGCCCGCGTC TACACCTCGC TGACCCACTT CTGGACCGGC
GCCGTGGGCG ACGACCCGAA GTACACCTAC AAGGGCCGTG ACGTGGTGGA GGGCCTCAAC
GCCCTGACCA AGGCCCAGGG CGACTTCGAC TGGAACATCG CCCATCACCC GTATCCGGAG
AACCTGTTCA ACCCGGCGTT CTGGAACGAC AAGACGGCGA CCGACAGCTT CGACACGCTG
CGCATCACGT TCAAGAACAT CGAGCTGCTG CCGCGCTACC TGGCCCAGCA GCACCTGCTC
CACGACGGGC GGCCGCGCCG GGTGATCCTT TCGGAGCAGG GCCTGAACTC CCAGGACTAC
ACCGACGAGC AGCTCAAGCT GCAGGCGGCG GCCTACGCCT ACGCCTACTA CAAGATCGCA
TTTGCTGAGG GGATCGACTC GTTCATCCTG CACAGGCACG TCGACCACAA GCAGGAGGGC
GGCCTCCGCC TCGGCCTGTG GACATGGGAC GACGGGCACG CCGCGCCGTC CAACGCCGGC
GACCGCAAGC CGGTGTACGA AGTGTTCAAG TACATCGACA CCGAGCGCTC GCTCGAGGTG
ACCGAGTTCG CCAAGAAGAT CATCGGGATC TCCGACTGGA AGGACGTCAT CCCCGGCTTC
GACCCGGCGA AGCTGGCGAC CGGGAAGCTC CCGGCCACCG TCGGCGTACA GCTGGACGCC
AGGCCCGCGC TGGAGCGCGT GGTCTCCGAC TTCGAGCGGG ACACCGGCGG CTGGCGGCCC
TCCGACAACG CGGCCGCCGT CGAACGGGTG GCCGTCGACG GCGGGCACGC GCTGCGCGTG
CGCTTCGACC GCGACCTGCC GGGCTGGTCG ATGTACGCCA AGTCGTACAA GGGGACCGAT
CTGGCGCTCG ACCGGCCGCT CGACGCCTCG CTCAGGTCAC AGCTGTCGGT GTCGGTACGG
GTGCCGGAGA ACGCCGGCGA CGGCTTCGAG CCGGGCAACG CCTTCTCCGC GAAGGTCCGC
GTCTACGGCC CGGACGGCGA GGTCGCCGAG GGCGTCGGCG CGATCGACCC CGCCCGCGGC
TGGAACCGGC TCACCCTCGA CCTGTCCCGC TGGGCCGGCC GCAAGGCGAT CTCCCGGGTC
AAGGTGTGGG TACGCGGCTC GGTCGGCTCC GACTGGGCGG GCTCGTACGA GATCGACAGG
CTGAGCCTGG CGGCCGCCGC GGTCCCGGCG GCCGACCGGC GCAACGTCGA GATCACCGCC
GCGACGGGCG AGCGCGGCCG GATCGGCTCC ACGGTGAGCT TCACCGTGAC CAACCACGAT
GTGCTGCCCA TGGCCGGCAA GGTCACTTTG CGGGCCTGCG ACGGGGTGAG CCTCACCCCG
GCCTCGCTCG GGGTGGGCGG CCTGCGGACC GGCGCCGGCC GTACCTTCAC CACCGAGCTG
ACCGCCTACG CGCCGGCCGA CCGGGAGCAC CCGGTGGTGT GCGCGGACTA CCTCAAGCAG
GAGCAGCGGG TCACCTTCCG GCTGCCGCCC GAGGCGGCCT ACGTGCCGCC CTCTCCCGAC
TCCTTCGCCA ACCGGGAGCT GTCCGACGGC TTCGACACCG ACAGCTCGGC CCGCTACCGG
ACGCACCGGG TCGAGCCGGA GAACAAGGAG GTCCCGCAGG TGAACGTCGG CGGCGGCACG
CTGTCGGCGA GCCACGCCTC CGCGAGCTGG TTCGGGCTGC TCTCCTCGGA CGTCTCGCCG
CGCAACCCGG CCTTCAGCAC CGCGATCACG GTCAAGGGAT TCCAGGGGAA CTCCACGGAC
ATGGACACGG TCTACACCGG CCTGGTGAAG GACGGGCGGA CGGACGTGGT GGCCTTCTAC
GTCAACACCC ACAAGTTCGC CGGGTTCGAG GTGCGCGGGC CCCAGGTCCC GGGCGGTCTC
GCGGTGTTCG GCGTCAAGCA GGGGGTGTCG ATCCCGGACG GCGGCCGGTT CGCCCTGTCG
CTGGTCGGGG ACCGGGCGGC CATGTACGCC GACTCCGGCG ACGGCTGGCG GCTGGTCACC
GCCGCCACGC TGGAGCACCT GCCCCGGCTG ACGGACCCGG ACGTGCGGGC GGAATACCGC
TACGGCTTCG GCGTCCGCGG TGACGCCGGC GCCGCCCCGC TGGTGCTCGA CGCGGTCGAG
GGCCGTAGCA TCTCCTAG
 
Protein sequence
MPYSPRNLRN ALIALTLALP AAVTALPAPA LARTEAGVTG VAVGQSAVTV TGRASGQVAL 
YALDTWQDPA SHTGLQPVAV VEPRADGAFS AEVPRMDGAQ DRLHDKFLAV ADGQVLGDVH
YADDLRFPSA NDAPYPQVPG KKGLQVQMTD DAEEIGVEHA GINLALDSVM MAGPGAAGNT
IEHVSQGRTY YFDKGAVAGL DAQVKPLSDN GVLVNLIAIV YDNKAANSAA PKLIHPEAER
GKGTVYAFDV KTAEGEGYFT AAMEFFAQRY SRADAKYGRA WGWIVGNEID AQQYWYNMGP
QDRDVFLEQY TRAMRLTWQA VRKSDANARV YTSLTHFWTG AVGDDPKYTY KGRDVVEGLN
ALTKAQGDFD WNIAHHPYPE NLFNPAFWND KTATDSFDTL RITFKNIELL PRYLAQQHLL
HDGRPRRVIL SEQGLNSQDY TDEQLKLQAA AYAYAYYKIA FAEGIDSFIL HRHVDHKQEG
GLRLGLWTWD DGHAAPSNAG DRKPVYEVFK YIDTERSLEV TEFAKKIIGI SDWKDVIPGF
DPAKLATGKL PATVGVQLDA RPALERVVSD FERDTGGWRP SDNAAAVERV AVDGGHALRV
RFDRDLPGWS MYAKSYKGTD LALDRPLDAS LRSQLSVSVR VPENAGDGFE PGNAFSAKVR
VYGPDGEVAE GVGAIDPARG WNRLTLDLSR WAGRKAISRV KVWVRGSVGS DWAGSYEIDR
LSLAAAAVPA ADRRNVEITA ATGERGRIGS TVSFTVTNHD VLPMAGKVTL RACDGVSLTP
ASLGVGGLRT GAGRTFTTEL TAYAPADREH PVVCADYLKQ EQRVTFRLPP EAAYVPPSPD
SFANRELSDG FDTDSSARYR THRVEPENKE VPQVNVGGGT LSASHASASW FGLLSSDVSP
RNPAFSTAIT VKGFQGNSTD MDTVYTGLVK DGRTDVVAFY VNTHKFAGFE VRGPQVPGGL
AVFGVKQGVS IPDGGRFALS LVGDRAAMYA DSGDGWRLVT AATLEHLPRL TDPDVRAEYR
YGFGVRGDAG AAPLVLDAVE GRSIS