Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6547 |
Symbol | |
ID | 8669856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 7181539 |
End bp | 7184676 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003342003 |
Protein GI | 271967807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000462957 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGTATT CACCACGGAA CCTACGCAAC GCGCTGATCG CTCTCACCCT CGCCCTGCCG GCGGCGGTGA CGGCCCTGCC CGCCCCCGCG CTCGCCAGGA CCGAGGCGGG CGTCACCGGG GTCGCCGTCG GCCAGAGCGC CGTCACCGTC ACCGGCAGGG CGAGCGGCCA GGTGGCGCTG TACGCGCTGG ACACCTGGCA GGACCCCGCG AGCCACACGG GCCTGCAGCC GGTCGCCGTC GTCGAGCCCC GCGCCGACGG CGCCTTCAGC GCCGAGGTAC CGCGCATGGA CGGCGCGCAG GACCGGCTGC ACGACAAGTT CCTCGCCGTC GCCGACGGCC AGGTGCTCGG CGACGTGCAC TACGCCGACG ACCTGCGCTT CCCCTCGGCC AACGACGCCC CCTACCCGCA GGTGCCCGGC AAGAAGGGCC TGCAGGTCCA GATGACCGAC GACGCCGAGG AGATCGGCGT CGAGCACGCG GGTATCAACC TGGCACTGGA CTCGGTCATG ATGGCCGGAC CCGGCGCGGC GGGCAACACC ATCGAGCACG TCTCGCAGGG CAGGACGTAC TACTTCGACA AGGGCGCGGT CGCCGGGCTG GACGCACAGG TCAAGCCGCT GTCCGACAAC GGCGTCCTGG TCAACCTGAT CGCCATCGTC TACGACAACA AGGCCGCCAA CTCCGCCGCG CCCAAGCTGA TCCACCCCGA GGCCGAGCGC GGCAAGGGCA CGGTGTACGC CTTCGACGTG AAGACCGCCG AGGGCGAGGG CTACTTCACG GCCGCGATGG AGTTCTTCGC CCAGCGCTAC AGCCGCGCCG ACGCCAAGTA CGGCAGGGCC TGGGGCTGGA TCGTCGGCAA CGAGATCGAC GCCCAGCAGT ACTGGTACAA CATGGGCCCC CAGGACCGCG ACGTCTTCCT GGAGCAGTAC ACCAGGGCCA TGCGGCTGAC CTGGCAGGCC GTGCGCAAGT CCGACGCCAA CGCCCGCGTC TACACCTCGC TGACCCACTT CTGGACCGGC GCCGTGGGCG ACGACCCGAA GTACACCTAC AAGGGCCGTG ACGTGGTGGA GGGCCTCAAC GCCCTGACCA AGGCCCAGGG CGACTTCGAC TGGAACATCG CCCATCACCC GTATCCGGAG AACCTGTTCA ACCCGGCGTT CTGGAACGAC AAGACGGCGA CCGACAGCTT CGACACGCTG CGCATCACGT TCAAGAACAT CGAGCTGCTG CCGCGCTACC TGGCCCAGCA GCACCTGCTC CACGACGGGC GGCCGCGCCG GGTGATCCTT TCGGAGCAGG GCCTGAACTC CCAGGACTAC ACCGACGAGC AGCTCAAGCT GCAGGCGGCG GCCTACGCCT ACGCCTACTA CAAGATCGCA TTTGCTGAGG GGATCGACTC GTTCATCCTG CACAGGCACG TCGACCACAA GCAGGAGGGC GGCCTCCGCC TCGGCCTGTG GACATGGGAC GACGGGCACG CCGCGCCGTC CAACGCCGGC GACCGCAAGC CGGTGTACGA AGTGTTCAAG TACATCGACA CCGAGCGCTC GCTCGAGGTG ACCGAGTTCG CCAAGAAGAT CATCGGGATC TCCGACTGGA AGGACGTCAT CCCCGGCTTC GACCCGGCGA AGCTGGCGAC CGGGAAGCTC CCGGCCACCG TCGGCGTACA GCTGGACGCC AGGCCCGCGC TGGAGCGCGT GGTCTCCGAC TTCGAGCGGG ACACCGGCGG CTGGCGGCCC TCCGACAACG CGGCCGCCGT CGAACGGGTG GCCGTCGACG GCGGGCACGC GCTGCGCGTG CGCTTCGACC GCGACCTGCC GGGCTGGTCG ATGTACGCCA AGTCGTACAA GGGGACCGAT CTGGCGCTCG ACCGGCCGCT CGACGCCTCG CTCAGGTCAC AGCTGTCGGT GTCGGTACGG GTGCCGGAGA ACGCCGGCGA CGGCTTCGAG CCGGGCAACG CCTTCTCCGC GAAGGTCCGC GTCTACGGCC CGGACGGCGA GGTCGCCGAG GGCGTCGGCG CGATCGACCC CGCCCGCGGC TGGAACCGGC TCACCCTCGA CCTGTCCCGC TGGGCCGGCC GCAAGGCGAT CTCCCGGGTC AAGGTGTGGG TACGCGGCTC GGTCGGCTCC GACTGGGCGG GCTCGTACGA GATCGACAGG CTGAGCCTGG CGGCCGCCGC GGTCCCGGCG GCCGACCGGC GCAACGTCGA GATCACCGCC GCGACGGGCG AGCGCGGCCG GATCGGCTCC ACGGTGAGCT TCACCGTGAC CAACCACGAT GTGCTGCCCA TGGCCGGCAA GGTCACTTTG CGGGCCTGCG ACGGGGTGAG CCTCACCCCG GCCTCGCTCG GGGTGGGCGG CCTGCGGACC GGCGCCGGCC GTACCTTCAC CACCGAGCTG ACCGCCTACG CGCCGGCCGA CCGGGAGCAC CCGGTGGTGT GCGCGGACTA CCTCAAGCAG GAGCAGCGGG TCACCTTCCG GCTGCCGCCC GAGGCGGCCT ACGTGCCGCC CTCTCCCGAC TCCTTCGCCA ACCGGGAGCT GTCCGACGGC TTCGACACCG ACAGCTCGGC CCGCTACCGG ACGCACCGGG TCGAGCCGGA GAACAAGGAG GTCCCGCAGG TGAACGTCGG CGGCGGCACG CTGTCGGCGA GCCACGCCTC CGCGAGCTGG TTCGGGCTGC TCTCCTCGGA CGTCTCGCCG CGCAACCCGG CCTTCAGCAC CGCGATCACG GTCAAGGGAT TCCAGGGGAA CTCCACGGAC ATGGACACGG TCTACACCGG CCTGGTGAAG GACGGGCGGA CGGACGTGGT GGCCTTCTAC GTCAACACCC ACAAGTTCGC CGGGTTCGAG GTGCGCGGGC CCCAGGTCCC GGGCGGTCTC GCGGTGTTCG GCGTCAAGCA GGGGGTGTCG ATCCCGGACG GCGGCCGGTT CGCCCTGTCG CTGGTCGGGG ACCGGGCGGC CATGTACGCC GACTCCGGCG ACGGCTGGCG GCTGGTCACC GCCGCCACGC TGGAGCACCT GCCCCGGCTG ACGGACCCGG ACGTGCGGGC GGAATACCGC TACGGCTTCG GCGTCCGCGG TGACGCCGGC GCCGCCCCGC TGGTGCTCGA CGCGGTCGAG GGCCGTAGCA TCTCCTAG
|
Protein sequence | MPYSPRNLRN ALIALTLALP AAVTALPAPA LARTEAGVTG VAVGQSAVTV TGRASGQVAL YALDTWQDPA SHTGLQPVAV VEPRADGAFS AEVPRMDGAQ DRLHDKFLAV ADGQVLGDVH YADDLRFPSA NDAPYPQVPG KKGLQVQMTD DAEEIGVEHA GINLALDSVM MAGPGAAGNT IEHVSQGRTY YFDKGAVAGL DAQVKPLSDN GVLVNLIAIV YDNKAANSAA PKLIHPEAER GKGTVYAFDV KTAEGEGYFT AAMEFFAQRY SRADAKYGRA WGWIVGNEID AQQYWYNMGP QDRDVFLEQY TRAMRLTWQA VRKSDANARV YTSLTHFWTG AVGDDPKYTY KGRDVVEGLN ALTKAQGDFD WNIAHHPYPE NLFNPAFWND KTATDSFDTL RITFKNIELL PRYLAQQHLL HDGRPRRVIL SEQGLNSQDY TDEQLKLQAA AYAYAYYKIA FAEGIDSFIL HRHVDHKQEG GLRLGLWTWD DGHAAPSNAG DRKPVYEVFK YIDTERSLEV TEFAKKIIGI SDWKDVIPGF DPAKLATGKL PATVGVQLDA RPALERVVSD FERDTGGWRP SDNAAAVERV AVDGGHALRV RFDRDLPGWS MYAKSYKGTD LALDRPLDAS LRSQLSVSVR VPENAGDGFE PGNAFSAKVR VYGPDGEVAE GVGAIDPARG WNRLTLDLSR WAGRKAISRV KVWVRGSVGS DWAGSYEIDR LSLAAAAVPA ADRRNVEITA ATGERGRIGS TVSFTVTNHD VLPMAGKVTL RACDGVSLTP ASLGVGGLRT GAGRTFTTEL TAYAPADREH PVVCADYLKQ EQRVTFRLPP EAAYVPPSPD SFANRELSDG FDTDSSARYR THRVEPENKE VPQVNVGGGT LSASHASASW FGLLSSDVSP RNPAFSTAIT VKGFQGNSTD MDTVYTGLVK DGRTDVVAFY VNTHKFAGFE VRGPQVPGGL AVFGVKQGVS IPDGGRFALS LVGDRAAMYA DSGDGWRLVT AATLEHLPRL TDPDVRAEYR YGFGVRGDAG AAPLVLDAVE GRSIS
|
| |