Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6496 |
Symbol | |
ID | 8669805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 7123980 |
End bp | 7127135 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003341953 |
Protein GI | 271967757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAATC ATAGTAAAAG GACCGCTTTT GGATGGAGAC GGGCGCTCCG AATTGCGCCT TTCGTCTTCG GGCTCAGCGT TTTGATCAGC CCGCTCGGGG TCGCCCAGGC GCTGACACCC GCCCCGGAGG AAAAGCCCGC GGGGGCGGTG GCCCCTCCTT CGCAGACGAC CGCCGCCGGC CACAGGAACG AGGCCCGCCC ACCGGCCGAC ACCGATGACG GCAGGCACGA AGTCCGCCCG CCGGCCGACG CCGACGGCGA GAGGGCGGGC GACAACAACA GGAGAAGGCC TCTGTCCGCT CACACGCGGG CGCTCCGCGC CGCCTGCCCG CCGACTTCCG TCATCTGCCT GGAGAACAGC CTGCCCGGCA ACCCTCCCAG TGAATGGGAC ATCCCCGGAT CCGGGTTCGG TGACATAGAT GGCTATACAA CCCAGATAAG CGTCAACAAA GGGCAGACCG TACAGTTCAA GGTCCTCACC GAGGCGACCG ACTACCGTGT GGACATCTAC CGCATCGGTT ACTACAGCGG CATGGGCGCA CGGTTGATCA CAACGGTCTC CCCCTCGGTG CCCCTGCCCC AGATTCAGCC GGACTGCTCG CAGGACGTGA CCACCGGACT GCTCGACTGC GGCAACTGGG CGGTGTCCGC CTCCTGGGCG GTTCCGGCGA GCGCCGTCTC CGGGGTCTAC ATCGCGAACC TCATCCGGGA GGACGGGACA CCCGGCGCCG GCCAGATGAT ATTCGTGGTC CGGGACGACG AGCGCGGGTC AGAGCTGCTG ATGCAGACCT CCGATGCCAC CTGGCAGGCC TACAACAGGT TCGGCGGCAG CAGCCTCTAC ACCGGCATTC CGGTCGGGCG CGCCTTCAAG GTCAGCTACA ACCGTCCGTT CACCACCCGG GCCACCAATC CGGAGAGCTT CTTCTTCAAT TCGGAGTATC CGATGATCAG GTGGCTGGAG GCCAATGCGT ACGACGTCAG CTACACCAGC AACGTCGACA CCACCGCACG GGCCACCGAA TTGCTCGAGC ACAAGATATT CATCTCCGTC GGCCATGACG AGTATTGGTC CAACGAGATG CGGAACAATG TGGAGAACGC GCGCGACAAC GGCCTCGACC TCGCGTTCTT CTCCGGGAAC GAGATTTTCT GGAAGACCCG GTGGGAGGAC AGCGCGGATG GCACCGCCAC CCCGTTCCGG ACGCTTGTCT GCTACAAGGA GACGATCGCC AACGCCAAGA TCGACCCCAG CCCCCAGTGG ACCGGTACCT GGCGCGACCC GCGTTTCTCT CCACCGTCGA ACGGCGGCAG GCCGGAGAAC AGCCTGACCG GGACCCTGTT CAAGGTGAAC GGCATCGTCA ACGACCCGAT GCGCGTGCCC GCCGAGTACG CCCCCATGCG GTTCTGGCGG AACACCTCCG TGGCCACCCT CCAACCGGGA GAGACGGCGG TCTTCCCCGA CGGGGTGCTC GGCTACGAAT GGGATGAGTC ACCCGACAAC GGGGTCGAGC CGATGGGGCC GGTGCGGCTG TCCAGGACGA CGTTCACCAA GCCGTCCAAG ATCCTGCTCA ACTTCGGCAG CGCGTACGGC GCCGGGACCG CCACCCACAG CCTGACGCTC TACCGGAATC CGGGCGGTGC ACTCGTCTTC GGCGCGGGAA CCACCCAGTG GTCGTGGGGG CTCGACGCCG TCCACGACCG GCCGGGCACT CCCACGGACA TCCGGATGCA GCAGGCGACG GTCAACCTCT TCGCCGACAT GGGAGCCCAG CCGGCAAGCC TCCAGCCGAA CCTTGTCCCG GCGACGCCGT CGACGGACAC CGCCCCACCG ACCTCGGCGA TCACCAGCCC TTCCAGCGGC GCGACCGTCG CCGGCGGCGT GGCCATCCCG ATCCAGGGAA CGGCGAGCGA CACCGGAGGC GGCGTCGTGG CGGGCGTCGA GATCTCCGTC GACGGCGGCA CCACGTGGTT CCAGACGACG GGCCGCAGCA ACTGGCAGTA CACCTGGACG CCCGGTGCCG CCGGCACGAC GACGATCAGG ACACGGGCGT TCGACGACAT CGGCAACATC CAGGGGACGC CGACCGTGCG GACCGTGACC GTGACCGACG GGTGCCCGTG CTCCGTCTGG GCGCCGACCG GCACGCCCGC CGTCAGTTCG CATCCCGATC CCAAAGCGGT CGAGCTCGGA GTGAAGTTCC AGGCCTCGAG TACCGGCTTC GTCTCGGGAA TCAGGTTCTA CAAGGGCGCG CAGAACACGG GAACGCACAC CGGTAATCTC TGGAGCTCCA CGGGAACCCT CCTGGCGAGG GCGACGTTCA CCAACGAGAC GGCGACCGGA TGGCAGCAGG TGAACTTCTC CAAACCCGTT CAGGTGATCC CCGGCACCAC ATATGTGGCG TCTTACCACA CGACGTCGGG AAATTATTCG ATCACACGGC CGTATTTCAC AACGCAGTAC TCGACCGGTC TGCTCACCGC CCTCGCGAAC AGCACGCCGG GAGGGAACGG CGTTTACCGA TACAGCGCGA CGAGCACTTT CCCGACCACT CCCTACCAGG CCACGAACTA CTGGGTCGAC GTGGTGTTCT CACCGCCGAT CAGCCTGTGG GACGACAGCA CGCTCCCGGC GGTCCCGACC ATGGACGACC CCAAGGCCGT GACGCTCGGT GTGAAGTTCC AGGCCACGCG CACGGGCGCG ATCGAGGGGA TCAGATTCTA CAAAGGCCCG CAGAACACCG GAACCCACGT CGGAAGCCTG TGGACGGCCA GCGGTCAGCT GATCACGAGC GCGACGTTCA CCGACGAGAC AGCGACCGGA TGGCAACAGG TGCTATTTGA TACACCTATT ACAATAACTG CCAACACTAC ATATGTAGCG TCATATCACA CGACATCGGG ATTCTATTCG ATAACACGGC CGTACTTCAC CCAGCAGTAC ACGCGCGGTC CGCTCGTCGC CCTTGCAAGC GCCGCGGCAG GCGGAAACGG GCTGTACAGG TACGGCGCCA CGAACGCCTT CCCGAATGCC AGCTACCAAT CGACCAACTA CTGGGTCGAC GTGCTGTTCC TGGCCGACGC GCCCCTGGCC GCGAACCTCC ACCGGCGTGG CGGCCGGGTT CCCGAGGGAA CCGGATCCCA CAGATCCCGG CGTTGA
|
Protein sequence | MSNHSKRTAF GWRRALRIAP FVFGLSVLIS PLGVAQALTP APEEKPAGAV APPSQTTAAG HRNEARPPAD TDDGRHEVRP PADADGERAG DNNRRRPLSA HTRALRAACP PTSVICLENS LPGNPPSEWD IPGSGFGDID GYTTQISVNK GQTVQFKVLT EATDYRVDIY RIGYYSGMGA RLITTVSPSV PLPQIQPDCS QDVTTGLLDC GNWAVSASWA VPASAVSGVY IANLIREDGT PGAGQMIFVV RDDERGSELL MQTSDATWQA YNRFGGSSLY TGIPVGRAFK VSYNRPFTTR ATNPESFFFN SEYPMIRWLE ANAYDVSYTS NVDTTARATE LLEHKIFISV GHDEYWSNEM RNNVENARDN GLDLAFFSGN EIFWKTRWED SADGTATPFR TLVCYKETIA NAKIDPSPQW TGTWRDPRFS PPSNGGRPEN SLTGTLFKVN GIVNDPMRVP AEYAPMRFWR NTSVATLQPG ETAVFPDGVL GYEWDESPDN GVEPMGPVRL SRTTFTKPSK ILLNFGSAYG AGTATHSLTL YRNPGGALVF GAGTTQWSWG LDAVHDRPGT PTDIRMQQAT VNLFADMGAQ PASLQPNLVP ATPSTDTAPP TSAITSPSSG ATVAGGVAIP IQGTASDTGG GVVAGVEISV DGGTTWFQTT GRSNWQYTWT PGAAGTTTIR TRAFDDIGNI QGTPTVRTVT VTDGCPCSVW APTGTPAVSS HPDPKAVELG VKFQASSTGF VSGIRFYKGA QNTGTHTGNL WSSTGTLLAR ATFTNETATG WQQVNFSKPV QVIPGTTYVA SYHTTSGNYS ITRPYFTTQY STGLLTALAN STPGGNGVYR YSATSTFPTT PYQATNYWVD VVFSPPISLW DDSTLPAVPT MDDPKAVTLG VKFQATRTGA IEGIRFYKGP QNTGTHVGSL WTASGQLITS ATFTDETATG WQQVLFDTPI TITANTTYVA SYHTTSGFYS ITRPYFTQQY TRGPLVALAS AAAGGNGLYR YGATNAFPNA SYQSTNYWVD VLFLADAPLA ANLHRRGGRV PEGTGSHRSR R
|
| |