Gene Sros_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1788 
Symbol 
ID8665066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1905552 
End bp1908476 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337521 
Protein GI271963325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGG CGACATCCGA GAGCGGCGGT GCTTCGGCAC GCGACCGCGG CAAGCGGCCT 
GCCGCCCGGT TCGGCTTGCG GAACTGGCGT GTGCGGTCGC GCCTCACGGC TCTGATCCTG
GTGCCGACCG CGGTGGGCGT CGTGCTCGCG GGCACCCGCG TGGTCGCCTC CGTCGACAGC
GTGGCGGGCT ACCAGCGGAC CGCCTCGGCG GCCGACTACT CCGGAGACAT CCGCGACCTG
GCCCAGGCCC TCGGTTTGGA ACGTGACCGG GGTGCGTGGT CCTCCTTCCA GGCGTCGAAC
AAGGGGCTGA AGGACAGCGC CGAGGCGCAG AAGAAGACGG TCGACGCGCT GATCGGCAAG
GTCCGGCTCG ACCTGCAGGC GATCGACGAC TCCTACGGCG TCCGGGCGGC CGAGGCCGCC
AGGGACGTGG GCTACCAGCT CTCCGGACTG AAGGAGCTCC GCCAGATACC GGGGACGAAC
CGTACGGAAC GGTACGGCCT CATGATCGCC CCGCTGCTCC AGCTCCACGA GGAGCTCACG
CTGGTCAGCG ACGACCCGGA GATCATCGGC AACACCCGTG CGCTCAGCGC GCTGGCCTAC
GCCAAGGAGG AGGTCTCCAA GCAGCGGGCC CGCCTGCTGG CCGGCTACTA CGCGCCCTCC
CTGGTCAACG CCCAGGAGAT CGAGAAGTTC ATCGCCTCCC GGTCGCGCAT GGAGGAGTAC
AAGGCGGACT TCGGCGTCGA GGCGAGCCCG GAGAACGGCC AGCTGCTCGT CAAGGCCATG
ATCGACGAGA AGGTGCACCG CGCCGAGCTG ACCAAGTCCC GGGCGATCGT GTTGGCCAGC
GACCCGCGGA ACGAGGCGCG CCTGCTCTCC GACTTCGACG AGGTCAAGCA GTGGTTCTCG
GACAACGGCG CGATCCTCGA CCGGATGAGG AAGGTCGAGG TCAAGGTCGC CGGTGACGTG
ATCACCCGGG CCCGGGTGCT GGAGGAGACC GAGCAGCGCA ACGCGATCAT CGCGTCCGGG
CTCATCCTGC TCCTGCTCCT GCTGGTCCTC GGTCTCACCG TCGTCATCGC CCGCTCGATG
GTCCTGCCGC TGCGCCGCCT GCGCGCCGAG GCACTGGACG TCGCCGGATT CACCCTCCCC
GAGGTCGTGC GCAGGCTCCG GGTCTCCGGA GACTCCCAGA CCCCCGAGAT CGCCTCCATC
TCGGTCGACA CCAAGGACGA GATCGGGGAG GTCGCCAAGG CCTTCGACGA GGTCCACCGG
CAGGCCGTAC GGCTGGCCGC CGAGGAGTCC GAGCTCCGCT CCAACATCAG CGCGATGTTC
GTCAACCTGT CGCGCCGCAC CCAGACCCTG GTCGAGCGGC AGATCTCGCT GATCGACGGC
CTGGAGAAGG GCGAGGAGGA CGGCGGCCGC CTGGCGGACC TGTTCAAGCT CGACCACCTC
GCCACCCGCA TGCGCCGCAA CTCCGAGAAC CTGCTGGTCC TCGCCGGTCA CGAGCCCACC
CGCAGGCGCA GCCAGCCGGC CAAGCTCGTC GACGTCGTGC GCGCCTCGCT GTCGGAGGTC
GAGGACTACG AGCGGGTCCA GGTCAAGGTG CACCGCACGA TCTCGGTCGC CGGGAGCGCC
GCCAACGACA TCGTCCACCT GGTCGCCGAG CTGGTGGAGA ACGCCATCCA GTTCTCGCCC
CGGGCCAGCC AGGTCGTGGT CTCCAGCAGC ATGATCGAGG GCGGTGGCGC GCTGCTCGCG
GTCAGCGACG CCGGCATCGG CATGACCACC GAGGAGCTGA TCGAGACCAA CCGGCGCCTG
GCCGACCCGC CCGTCGTCGA CGTCTCGGTG TCACGGCGGA TGGGCCTGTT CGTGGTCGGC
CGGCTGGCCC TGCGCCACGG CATCCGCGTC CAGCTCCGCC CGCAGGAGGT CGGCGGCCTC
ATCGCGATGG TCCTCTTCCC GCCGGAGCTG ATCGTCGAGG CGATCCAGCC GCCGTCCATC
ACCCCGTCGT GGGGCGCCGA GTCGCGGGCC CAGCAGCCTT CCCCGGACCA GAACCCCTTC
GGGCAGACGC CGTTCGGCCA GACCTCCTTC GGCCAGACGT CGTTCGGGCA GGCCTCGATC
CCCCCGGCCT CCTCGTTCGG GCAGACCTCG TTCGGGCGCA CTCCCGACCC GCAGGCCTTC
CCGTCCGGCT CCACGTCGTT CCCCGGCGAC CAGCCGCTCC CGGACCGCCG GCAGCCGCTC
CCCGCGAGCC CGCCCGGCGC TCCGGCGTCG GCCGGCGACT TCCCCCTCCC CAAGCGCCCG
GTCCCCGGGG CGAGCGGTGG CGGGAGCGTT CCAAGCCCGG GTTTCTCCCC CTGGGGACAG
AACTCCCAGG ACGACCCGGC GACCGCGTCG ATGCCGGCCG TACGGGTCTC CCCGCTCGAA
TCCGAGCAGG AGGAGTTCCT GCCGATCTTC GCGTCCATCG AGTCGGCCTG GTTCCGCCGG
GCCGAGGACA CCGGCGAGCA CGCCGTCGCC GCCGGGGAGG GGGAGAGCGC GCAGCGCGCG
GGCTCCCCGC TGACCGACCC GTTGACCGAC CCACTGGCCG GCCCGCCGGG CTCGGAGGCC
GAGGAGGCCC CCACGCCCCC GGACGGCCTG CCGGTCCGTG AGCCGGTGGC GGCCCAGGAG
GCGGGCGGCT GGCAGACCCC GGCCGACGCC GGGTGGCAGG CGGCCCAGGC GGCGAGCGAT
CCCACCCTCG GCGGGATCAC CTCGGCGGGC CTGCCCAAGC GGACTCCCAA AGCCAACCTG
GTGCCCGGCG CCGCGGCCAG CGTGCCGTCC ACCCCGATGC CGTCTCTCTC GCCCGAACGG
GTACGTAGCC GCTTGTCGAG TTTCCAGCAG GGTGTACGGC GGGGCCGTGC CGAACTGAAC
GAGGACACCG CCAGGAGCCT GGCGGATAGG GAGGAGGGCT CGTGA
 
Protein sequence
MTTATSESGG ASARDRGKRP AARFGLRNWR VRSRLTALIL VPTAVGVVLA GTRVVASVDS 
VAGYQRTASA ADYSGDIRDL AQALGLERDR GAWSSFQASN KGLKDSAEAQ KKTVDALIGK
VRLDLQAIDD SYGVRAAEAA RDVGYQLSGL KELRQIPGTN RTERYGLMIA PLLQLHEELT
LVSDDPEIIG NTRALSALAY AKEEVSKQRA RLLAGYYAPS LVNAQEIEKF IASRSRMEEY
KADFGVEASP ENGQLLVKAM IDEKVHRAEL TKSRAIVLAS DPRNEARLLS DFDEVKQWFS
DNGAILDRMR KVEVKVAGDV ITRARVLEET EQRNAIIASG LILLLLLLVL GLTVVIARSM
VLPLRRLRAE ALDVAGFTLP EVVRRLRVSG DSQTPEIASI SVDTKDEIGE VAKAFDEVHR
QAVRLAAEES ELRSNISAMF VNLSRRTQTL VERQISLIDG LEKGEEDGGR LADLFKLDHL
ATRMRRNSEN LLVLAGHEPT RRRSQPAKLV DVVRASLSEV EDYERVQVKV HRTISVAGSA
ANDIVHLVAE LVENAIQFSP RASQVVVSSS MIEGGGALLA VSDAGIGMTT EELIETNRRL
ADPPVVDVSV SRRMGLFVVG RLALRHGIRV QLRPQEVGGL IAMVLFPPEL IVEAIQPPSI
TPSWGAESRA QQPSPDQNPF GQTPFGQTSF GQTSFGQASI PPASSFGQTS FGRTPDPQAF
PSGSTSFPGD QPLPDRRQPL PASPPGAPAS AGDFPLPKRP VPGASGGGSV PSPGFSPWGQ
NSQDDPATAS MPAVRVSPLE SEQEEFLPIF ASIESAWFRR AEDTGEHAVA AGEGESAQRA
GSPLTDPLTD PLAGPPGSEA EEAPTPPDGL PVREPVAAQE AGGWQTPADA GWQAAQAASD
PTLGGITSAG LPKRTPKANL VPGAAASVPS TPMPSLSPER VRSRLSSFQQ GVRRGRAELN
EDTARSLADR EEGS