Gene Sros_5165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5165 
Symbol 
ID8668459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5678375 
End bp5680087 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340686 
Protein GI271966490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0452988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAAGA TGAGATCTAC TCTCGTGCTG CTGCTGTTGT CGATGGGCGG GGCGGTTCTG 
GCAGCGCCGT TGCCCGCCGT GGCAGCGGCT CCGGCGGCCC AGTCGCGGTT CGCCTGTGAC
GAGGTGGATG CCAAGGACAA GGGCTGGGTG CTGCCGATCT ATGTCCACCA GCCCGGGCAG
GACGCCTGGG AGGACGACAG CGCCGCCCTG CTGAACACCA TCTGGGAGAC CGACCAGACG
GTCGACTCGA GCGCGGAGCG GTTCGGGGGC TCCCGGCGGC TCCGGTTCGT GCAGGACGGT
GACTGCCGTC CGGTGGTGGC GAAACTGCCG TTCACCAAGG GCCGCAACCG GGCCGAGATG
GGCAAGGCGA TGGCGGAGAA CCTCGCCTCC CAGCCTGCCC TGGTGCGTAC GCTCTGGCAG
ACCAACCGGG TCAAGCCGTT GTACTTCGTC AGGGACAACG AGATCACGGA CTCCTGCACG
GGCGGCGGCG CCAACGCCGG GCTGAGCACC GGCAACGTCA TCCTCCCGCG CTGGTGCTGG
AGCGAGGCCG GGCTCACCCA CGAGCTGATC CACAGCTTCG GGCTCTCGCA CTGCGACGGG
GGCGGCGTGA ACGGCAACGA CCCGGTCTGC CGGAACATGG GCACCCGGAA GGAGTGCACG
AGCGACCTCG CGGCCAACTA CCACCTTGAC TCGTGCCGGA TCGACGAATT CCGCTACTTC
GAGCCGACGC CGGTACGGCA GCCCGAGCTG GAGAAGATCA GGAACGTCGC GTTCAGCCCG
TACCTGATCC AGAACCAGCC GAGCCCGGTG TGGCAGTTCC GCATCAAGGT GGTGGACAGC
GGCAGATGCC TCGACGCAAG CGCGGCGCAG GTCGTGCAGC GCGCGTGTAC GGACAGCTCC
GCGCAGACGT GGCAGCGCAG CATCGACGAC GACGGCTACC TCACCATCCG CAACGCGGCG
AACGGCCGCT GCCTCACCAT GGCGGACACC GTCGTGACCG GCCCGTGCGC GAAGAAGGAC
AAGTCGCAGC AGTGGCTGCC GCAGGCTGGT CAGGACAGGA CCAACTTCGC CGGCCGCGCC
GGTGGGAAGC TGTCCGTCAA GGACAACCGT GACGGCGGAG CGGTGGTGCG CGACGGCAAG
GGTGAGTTCG TGACCGAGCT GCTGGGCGGC CTGGCCTCCT CGCCCACCCA GCCGAACACC
CCGGCCCCGA CGGCCACGTC GCAGCCGACC GCGGAGCCGA CCGCAAGGCC GACCCCCAGG
CCGACCGCGG CACCGACCTC CGCTCCGGAG GCGGCCGTCA CGCCGGGCCC GGTCGAGTCG
CTCGACCCCG CCAAAACTCC TGCGCAGGGC CGGAACGTCC AGTTCAAGAG CGCGTACGGC
ACCTGCCTGA CCGCGTCCGG CACCAGGGTG CGCCTCGGCG CCTGCGACAC CAGGTGGAAC
GTCGTGACGG TCGGCAAGCA CGTGCAGGTA CGCCACCAGA ACCGATGCAT GGCGCTCGGC
AAGGTCAGCG GTGGCAAGCG CTCGGTCGTC CTGGCCAAGT GCGGCACGGC CGCCAAGGGG
CAGCGCTGGT TGCTCGAGAA GGCCGGCGGC TCGGTCACGC TGAAGAGCGC GACCACAAAG
GCCACCCGGC TCATCGCCTT CACCGCCAAG CCTGCCAGCG TCTACGCCAA AGCCGCGTAC
CAGAAGAATT CCATCAAATT TATAATCAGA TAA
 
Protein sequence
MSKMRSTLVL LLLSMGGAVL AAPLPAVAAA PAAQSRFACD EVDAKDKGWV LPIYVHQPGQ 
DAWEDDSAAL LNTIWETDQT VDSSAERFGG SRRLRFVQDG DCRPVVAKLP FTKGRNRAEM
GKAMAENLAS QPALVRTLWQ TNRVKPLYFV RDNEITDSCT GGGANAGLST GNVILPRWCW
SEAGLTHELI HSFGLSHCDG GGVNGNDPVC RNMGTRKECT SDLAANYHLD SCRIDEFRYF
EPTPVRQPEL EKIRNVAFSP YLIQNQPSPV WQFRIKVVDS GRCLDASAAQ VVQRACTDSS
AQTWQRSIDD DGYLTIRNAA NGRCLTMADT VVTGPCAKKD KSQQWLPQAG QDRTNFAGRA
GGKLSVKDNR DGGAVVRDGK GEFVTELLGG LASSPTQPNT PAPTATSQPT AEPTARPTPR
PTAAPTSAPE AAVTPGPVES LDPAKTPAQG RNVQFKSAYG TCLTASGTRV RLGACDTRWN
VVTVGKHVQV RHQNRCMALG KVSGGKRSVV LAKCGTAAKG QRWLLEKAGG SVTLKSATTK
ATRLIAFTAK PASVYAKAAY QKNSIKFIIR