Gene Sros_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0159 
Symbol 
ID8663425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp161945 
End bp164392 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content51% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003335956 
Protein GI271961760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACC GGATGGGTAG AAGACGTCGC GCTGACCGTC AAATAGTTAC CGGCGACACG 
GAAATATCGC ATGCCCTCTT CGAGAAAATA CAAACGATAG TCGGTCGCCT AAATCCAGGG
GAAGACTTTC TACAGTATGC GAGGCGGAGA GACTCTATCT TGCGCGACAA GGTGATGACT
ATCTCAAACC TGGTGGCCCC TTATGACGCC TTTGATGTTA TTGACTTAAT GCGCCAAAGG
GAGATGCCTT TAATTCTCGA CGGATACCGA GAAAGTCTCG ATGAAGGAAG GGCTGCCGCC
ATCGAGATCG TCGCTCTGAT CCTCCTTGCA AGGGGCCGAC GGGAGGGTGG CAGCCCAAAT
CTCGCAGATG CGCCGAACAG TATCATCCAG GAGTTGCATC ACCACACTTT GGAGATGCTC
GACGTTGGCA CTTTCGCTCT TCTTGCGGAG GGGACGGAGT CCAGTCACGG CCCCTTGGCT
CCACTAGCAG CCGATTATCG TAGCAGTGAG CTGAATATTC GCAATAAGCA ATACGCGCAT
ATTCATGACC GCTTCAACGA AGCACTATTC GGTTCGCCTG TTTGCGGCAG CCTTATATTG
GATGCACTTG GTTTCACATA CGAGGAATTT ATTGCTGTTC GTGAGTCGAT TCGTGATATA
TATGTGGACG GAATCACATC GAATCTGGAC GCACTTGGCG AAGTGGCAAT GAACTGGAAT
GGCGGCAAAG ATGGTTATGA GCAGAACGCC GAAGAGATTG AGACAGGGCG AGCTGCGGCG
ACAAGCATCT TCTTCTTTCC CGGAAATCGC GCCTCAACAA ATCCAGAAGC CGTATCACAA
AAGAGCGGTG TAGGAATTGA TCAAGTAAGG GCGATTCTGC GACTGTTTAG CGTCAGTTTT
GAATCTGCAG ATCCCGTTCA CAGCGTACAA GAATTCTTCG ACGGCAAAAA CGTCTTCTCT
CGCGCTGCTC TCATTTGCGA TGGAGCAGAT AATTTTCTCA CGCTCAGTTC GCCGATCGGA
AATGACTGCT TTCGACACGT TGCAGAAGAT GCGCTTAAAC CTAGCCCCGC ATGGAACAAG
TATGACCGTC TGAGAACACG AGTAAGTGAG GGGCTCGCCA CAGATTATCT CCAATCCCTT
CTGGATTGCG AGGCATCCTA CACTCAACTT AAATACTTCC GGCCTAAGAA AGGTGTCGAA
ACTGCTGCAC TAGGCGCGGC CGCCGTAGGG CTTACCGAGC TGGCTGATGA GGCAGAAGCC
GATGCGCTCT TCCTCATTGA AGATGTAGCC ATATGTGTAG AGGTCAAGGG GCGAAGCGTC
TCTGAACGTG CAAAGCTTGG CATGGTGAAG AGACTAGCGA CCGATTTAGA GGTGACTGTC
GGTGAAGCGG CCTCGCAGGC ACATCGATTG GAGGAACTTA TAACAACCAA TGGGGGCATA
TGGATTTCAA AGGATCAATG GCTTGCGCTT GATCATATAA GCGAGGTTCG GTCTATCGCA
ATTTGTCTTG ACGATATGGG GCCGCTGGCA GTTGCCCTTG ATCAGTTGGT CCGTAGTGGA
ATCCTACAAA CTAAGAAGCT CCCTTGGATA GTATCTCTGC ATGATCTAGC CGTTATAGCA
GAAACACTAG ACCGCCCAAG TGAGTTTCTC CTTTATCTGC GGCGGCGATC CGATTCGGAT
ATATCAAAGC ACTACATCGC CGTCGACGAG TTGGATATGT TCATGCTTTT TCTGGCCGGC
GGTCTATATA TCGAACCGGA CCCTGAGCTC GTCTATAAAC TGCATCCAAC GGCAGGCAAA
CCTACTGGTG CCGCGAGGGC GCGGTATCGA GCGCAAGCTA TTCCCACCCG CGTTGGCACA
CACACCGATG AACTAGACGC TTGGATATAT TATCAAGAGG GTGCAAGTAC AACCGAACAA
ACAAAGCCGA GATTCCGGTC AAATGATGAC GTTCTAAGAA TCGTTGATTT TTTGGCTGAT
GGCCACAAGC CGGGCTGGCT TCGCTTTGGT GCTGATTTGT TGAATCTTTC TTCCGAAGCG
CAGGAGAATT TGTCCACGGG AATGTTAAAG ACAATTCGGC AAACGCAAAT TGATCACCAG
CGACACTCAT TGGCTCAAGG CTACGCCGGC GCTTGGGGCT TTCCGTCGCT TTTCATTTGT
AGCCAACCTA TTGGGTCGAG CCAAAATGAT TGTCTTCAAT GGATGTCCAC CTATATGGTG
GCGAAGAAGC ATCAGCTACA GTCGGATAGA TGTCTTGGCC TTCTCATGTT GGAGAGAGGC
GACATATCTG CGGTCAGATA CGACAACTCT CCAGTTCGGC AATCGGCTGA GCTTGATCAG
CTTGTAGTGG ATATGCAGTT GCAGCCGCTC GATAGAATCG GAAGGACCAT TCCGCCATCA
GCTCGGCGAG CAAAGAAACA GTTGCGAGGA AGTAGGGGAA AGGGCTGA
 
Protein sequence
MGNRMGRRRR ADRQIVTGDT EISHALFEKI QTIVGRLNPG EDFLQYARRR DSILRDKVMT 
ISNLVAPYDA FDVIDLMRQR EMPLILDGYR ESLDEGRAAA IEIVALILLA RGRREGGSPN
LADAPNSIIQ ELHHHTLEML DVGTFALLAE GTESSHGPLA PLAADYRSSE LNIRNKQYAH
IHDRFNEALF GSPVCGSLIL DALGFTYEEF IAVRESIRDI YVDGITSNLD ALGEVAMNWN
GGKDGYEQNA EEIETGRAAA TSIFFFPGNR ASTNPEAVSQ KSGVGIDQVR AILRLFSVSF
ESADPVHSVQ EFFDGKNVFS RAALICDGAD NFLTLSSPIG NDCFRHVAED ALKPSPAWNK
YDRLRTRVSE GLATDYLQSL LDCEASYTQL KYFRPKKGVE TAALGAAAVG LTELADEAEA
DALFLIEDVA ICVEVKGRSV SERAKLGMVK RLATDLEVTV GEAASQAHRL EELITTNGGI
WISKDQWLAL DHISEVRSIA ICLDDMGPLA VALDQLVRSG ILQTKKLPWI VSLHDLAVIA
ETLDRPSEFL LYLRRRSDSD ISKHYIAVDE LDMFMLFLAG GLYIEPDPEL VYKLHPTAGK
PTGAARARYR AQAIPTRVGT HTDELDAWIY YQEGASTTEQ TKPRFRSNDD VLRIVDFLAD
GHKPGWLRFG ADLLNLSSEA QENLSTGMLK TIRQTQIDHQ RHSLAQGYAG AWGFPSLFIC
SQPIGSSQND CLQWMSTYMV AKKHQLQSDR CLGLLMLERG DISAVRYDNS PVRQSAELDQ
LVVDMQLQPL DRIGRTIPPS ARRAKKQLRG SRGKG