Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4438 |
Symbol | |
ID | 8667732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4951396 |
End bp | 4954665 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003340051 |
Protein GI | 271965855 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0339173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00783982 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCCT TGGCCAGGTT GAGCCTTGCC AACCGCAGCC TCGTCATAAT GATCGCGGTC GTGTTGAGCG CGTTCGGCGC CTTCGCGATC CCGTCGCTGA AACAACAGCT TCTGCCGTCG CTTTCGTTCC CGGGCGCCTT CGTGGTCGCC CCCTACCTGG GTGCCTCTCC CGAGATCGTG GAGGAGCAGG TCACCAAGCC GATCGAGGAC AGCTTCCAGG GCATCGAGGG GGTCACCGAG GTCACCTCCA CCTCCCAGGA GGGGATGTCG CAGGTTCAGG TGGCCTTCGA GTACGGCACC GACATCGACG CGGCGGTGGC GAAGATGCAG CAGGCGGTGG CGCGGATCGG CACCCAGCTG CCCGACGGGG TGGACCCGCA GGTCCTGGCC GGCGGCACCG ACGACATCCC GGTGCTGGTG TTGGCGGTGG GGACCGGCGG TGACGAGCGG GCCATGGCCG ACAAGCTCCA GCGGATCGTG GTGCCGGAGC TCCAGGGCAT CGACGGCGTC CGTGAGGCCA CGGTGACCGG CGCCCGCGAC GAGACCGTGG TGATCACTCC GGATGTCAAG AAGCTCGCCG CGCGCGGGCT GGCGCCGACG GCGGTGACCG ATGCGCTGCG CGCCAACGGG CAACCGGTCC CGGCGGGCAG CCTGGTGCAG GACAAGGCGT CGCTGACGGT TCAGGTGGGC AGCCGGATCG CCTCGATCGA GGATCTGAAG AACCTCTATC TGATCCCCGC GGCTCCGGCG CAGGCCCAGC AGGCTCCGGC CCAGGCCCAG GCGCAGCAGG CCGCCCAGCA GGGACGTCAG ATGCCGGGGC AGCAGAGTGC GGCCGCGGCC CAGCAGCGGG CCCAGCAGGC GCAGCGCCCC GTGGCAGCGC CCAAGCCGGT GAAGCTGGGC GAGGTGGCCG AGGTCGAGCG TTCGCTGGCC ACCAGCACCT CGCTGACGCG CACCAACGGC GAGCCGAGCC TGGGGGTGTC GGTGACGATG ACGCCCGACG GCAACGCGGT GGCGATCTCC CATGCGGTCA ACGAGAAGAA GGCCGAGCTG GTCCGGGCGA TCGGCGATTC GGCGCAGGTG ACGGTGGTCT TCGACCAGGC GCCGTATGTG GAGCGCTCGA TCGAGGACCT GACCACCGAG GGCCTGCTGG GTCTGGTCTT CGCGGTGCTG GTCATCTTGA TCTTCTTGCT GTCGGTGCGT TCGACGCTGG TGACGGCGGT GTCGATCCCG CTGTCGGTGG TCATCGCGCT GATCGCCCTG TGGCTGGGCG ACTACTCCCT CAACATGCTC ACGCTGGGGG CGCTGACGAT CGCGGTCGGG CGGGTGGTGG ACGACTCGAT CGTGGTTTTG GAGAACATCA AGCGGCACCT GGGCTACGGC GAGGCCAAGC GTGATGCGGT GCTGGCGGGC GTGCGTGAGG TGAGCGGGGC GGTGACGGCC TCGACGCTGA CCACGGTGGC GGTGTTCCTG CCGATCGCGG TGGTCGGCGG CATGGTGGGG CAGCTGTTCG CGCCGTTCGC GATCACGGTG ACGGTGGCGC TGCTGGCTTC GCTGCTGGTG TCGCTGACGG TGATCCCGGT GCTGGCCTAC TGGTTTTTGA AGGCGCCGGT GCTCAGCGCC GAGCAGGCGC GCGTGGTGCG TGAGCAGGCG GAGGCCAAGG AGCTGCGCAG TCCGTTGCAG CGGGCCTACA TGCCGGTGTT GCGCTTTGCG ACCAAGCGGC GCCTGGTCAC CGTCCTGGTC GGCGTCGTGA TCTTCGTGGG GACGATGGGC CTGGCCGGCC GGCTGGAGAC CAACTTCCTG GACTCCTCGG GGCAGAACAC GCTCTCGGTG ACGCAGAAGA TGCCCGCCGG CACGGACCTG GCCACCACGG ATGCGGCGGC CAAGAAGGTG GAGGAGGTGC TGGGCTCCCT GAAGGACGTG GAGGGCTACC AGGTCAACGT GGGTGCCGGC GGCGGGTTCA TGGGCGCCAC GGGCGGTGGC GGCGATCGCG CCTCCTACTC GGTGACGGTC GCCGAGGGCG CCGACACCGC GGCGCTGGAG AGCGCACTGC GTGATCGGCT GAAGACGCTG GCCGATATCG GCGAGGTCAC CGTCGGTGGC GCGCAGGGCG CCCCGGGCGG CGGTTCGACC AACCAGCTTC AGGTGATCGT CCAGGCGCCG GACACCGCGA CGTTGCAGAG CGCGGCCGAG GCGGTGCGCG GCGCGATGGC GGGACTTGAC GGGCTGCGCG ATGTCTCCTC CAACCTGCAG GCCAGCGGCC GGCGCGTGGA GGTCCAGGTG GACCGGCGCA AGGCGGCGGC CAAGGGATTG ACCGAGGCGC AGATCGCGCA GGTGGTCGCC CAGCGTTTCC GTGGCGCGCC GCTGGGCCAG ATCACCCTGG ACGGGCGGGC CAGTGATCTG ATCCTGCGCC TCGATGAGGC GCCGGAGGAT CTGAAGGCGA TCGGTGCGCT GAAGCTGCCG TCGGCCGCCG GCGAGGTCGA GTTGTCGGAT GTGGCGAAGG TGGCCAAGGT CGACGGGCCG ACGCAGGTGA CGCGGATCGA CGGTGAGCGC AGCGCGACGG TGTCGGGTAC GGCCACCGGC AGCGACCTGG GCAGTGCCAC CGCGGCGCTG ACGGCGAAGC TGGAGGGGCT GTCGTTGCCG GCGGGGGCGA AGTTCACCAT CGGCGGCGTC AGCGCCGACC AGGAGGAGGC GTTCGGCCAG CTGGGTCTGG CGATGCTGGC GGCCATCGCG ATCGTCTTCA TGATCATGGT GGCGACGTTC CGTAGCTTCG TGCAGCCGCT GGTGCTGCTG GTGTCGGTGC CGTTCGCGGC GACCGGCGCG ATCGGCCTGC TGCTGGCCAC CGGCACGCCG CTGGGCGTTC CGGCGCTGAT CGGCATGCTG ATGCTGATCG GCATCGTGGT GACCAACGCC ATCGTGCTGA TCGACCTGAT CAACCAGTAC CGGGAGCAGG GGATGGGCGT GGTGGAGGCG GTGATCGAGG GCGGCCGGCG GCGCCTGCGG CCGATCCTGA TGACCGCGGT GGCGACGATC TTCGCGTTGA TCCCGATGGC GCTGGGCCTG ACCGGGTCGG GCGGGTTCAT CTCCCAGCCG CTGGCGATCG TGGTGATCGG TGGCCTGCTG TCGTCGACGC TGCTGACGCT GGTGCTGGTG CCGACGCTGT ATGTGATGCT GGAGCGGACC AAGGAGCGGT TCGGCCGCGG GCCTCGGCCC GGCAGGCACG AGGCCGAGGT GAAGGTCTAC GACGCCGACG AGCTGGCCTC GGCCCGGTAG
|
Protein sequence | MTALARLSLA NRSLVIMIAV VLSAFGAFAI PSLKQQLLPS LSFPGAFVVA PYLGASPEIV EEQVTKPIED SFQGIEGVTE VTSTSQEGMS QVQVAFEYGT DIDAAVAKMQ QAVARIGTQL PDGVDPQVLA GGTDDIPVLV LAVGTGGDER AMADKLQRIV VPELQGIDGV REATVTGARD ETVVITPDVK KLAARGLAPT AVTDALRANG QPVPAGSLVQ DKASLTVQVG SRIASIEDLK NLYLIPAAPA QAQQAPAQAQ AQQAAQQGRQ MPGQQSAAAA QQRAQQAQRP VAAPKPVKLG EVAEVERSLA TSTSLTRTNG EPSLGVSVTM TPDGNAVAIS HAVNEKKAEL VRAIGDSAQV TVVFDQAPYV ERSIEDLTTE GLLGLVFAVL VILIFLLSVR STLVTAVSIP LSVVIALIAL WLGDYSLNML TLGALTIAVG RVVDDSIVVL ENIKRHLGYG EAKRDAVLAG VREVSGAVTA STLTTVAVFL PIAVVGGMVG QLFAPFAITV TVALLASLLV SLTVIPVLAY WFLKAPVLSA EQARVVREQA EAKELRSPLQ RAYMPVLRFA TKRRLVTVLV GVVIFVGTMG LAGRLETNFL DSSGQNTLSV TQKMPAGTDL ATTDAAAKKV EEVLGSLKDV EGYQVNVGAG GGFMGATGGG GDRASYSVTV AEGADTAALE SALRDRLKTL ADIGEVTVGG AQGAPGGGST NQLQVIVQAP DTATLQSAAE AVRGAMAGLD GLRDVSSNLQ ASGRRVEVQV DRRKAAAKGL TEAQIAQVVA QRFRGAPLGQ ITLDGRASDL ILRLDEAPED LKAIGALKLP SAAGEVELSD VAKVAKVDGP TQVTRIDGER SATVSGTATG SDLGSATAAL TAKLEGLSLP AGAKFTIGGV SADQEEAFGQ LGLAMLAAIA IVFMIMVATF RSFVQPLVLL VSVPFAATGA IGLLLATGTP LGVPALIGML MLIGIVVTNA IVLIDLINQY REQGMGVVEA VIEGGRRRLR PILMTAVATI FALIPMALGL TGSGGFISQP LAIVVIGGLL SSTLLTLVLV PTLYVMLERT KERFGRGPRP GRHEAEVKVY DADELASAR
|
| |