Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1894 |
Symbol | |
ID | 8665172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 2016746 |
End bp | 2019661 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Phosphoenolpyruvate synthase/pyruvate phosphate dikinase-like protein |
Protein accession | YP_003337625 |
Protein GI | 271963429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGGCA CCCCACTGAT CCTGAAATTT GATGACATCG ATGCGGGGAT GCTGCCTCTC GTCGGCGGCA AGGCCGCGAA CCTCGGCGTG CTCACCGCTG CCGGGCTCCC CGTCCCCCCG GGACTCTGCG TCACCACCGA GGCCTACCGG AGGGTCACCG AGCAGGCAGG GCTGGAGGAC GTGCTCGACG CCCTCGCCGT CACGGCCGCC GGGGAGACCC GGGTTCTGAA CGAGCTGGCC GGCCGGGCCC GCGAGCTCGT GCTGTCCGCC CCGGTGCCCG CCGACATCGC CGACGCCGTA CGGCGGAGCG CGCACGGCCC CGTCGCGGTC CGCTCCTCGG CCACCGCCGA GGACCTGCCG CACGCCAGCT TCGCCGGCCA GCAGGACACC TACCTCAACG TGATCGGCGC CGACGCCGTG CTGGACGCGG TCCGGCGCTG CTGGGCCTCG CTCTGGACCG ACCGGGCCGT GGCCTACCGC GCCGCCAACG GCATCGACCA CCGGGCCGTG CTGCTGGCCG TCGTGATCCA GGAGATGGTC CAGTCGGAGG TCGCCGGGGT GATGTTCACC GCCAACCCGG TCACCGGACG GCGGCGCGAG GCGGTCATCG ACGCGAGTCC GGGCCTCGGC GAGGCCGTGG TGTCCGGCGC GGTCAATCCC GACCACTTCG TGGTGGACAC CGCCACCGGG CGGATCACCG CGCGGCGGCT GGGCGACAAG CGGCTGGCCG TACGGTCCCT GCCCGGTGGC GGCGTCGAAC ACGTCGAGAC CGCCGTGGAG GGCGCCTGCG TCACCGACGC CCAGGTCCGG GCGCTGGCCG AGCTGGGCGG CCGGGTCGAG GACCACTACG GCTCCCCGCA GGACACCGAG TGGGCCGTCG ACGCCGGCGG CGCCCTCTGG CTGACCCAGG CACGGCCGAT CACCACCCTC TTCCCGATCC CCCGGCACCC GCGTCCGGCC GGAGCGGGTC ACCCGGAGGA CCGGACCACC CGTCTCCGCG CCGGGCACGG AGGGTCGGGC GGCTCCGGCC CGGACGGCGG GGCGCAGGGC GGCGCGCGGA TCTACTTCTC CTTCAGCGTG GCCCAGGGCA TCTACGCGCC GATCACCCCG ATGGGCATGT CGGCCTTCCG GCTGCTGTCG TCGTCGGCCG CCGCCCTGCT GGGCACCCCG GTCACCGACC GGCTGGACGG GGCGCCCCAG TTCGCCGAGG CGGCCAGCCG GATGTTCATC GACGTCACGG GGATCATGCG CAGCCGGGTG GGCCGGATGG GCCTGCCCCG GGTGCTCGAC GTGATGGAGG CCAGGTCGGC CACCGTGCTG CGCGGCCTGT CCGACGATCC CCGCTTCAGC GTGACCCAGC GCTCGGTGCG CCCCGCCCTG CGCCGGCTCG TCAGGAACGC GATCCGCTTC CGCATCCCGG CACGCGCCGC GCAGGCCCTC GTGATGCCCG AGAAGGCGCA CCGGCGTGCG GAGCGGCTGG GAGTACGGCT CCGGACACAG CCGGCCGCTC CGGCGGGCGC CACCGCCCTC GAACGGCTCG ACCACGTCGA ACGGATCCTG GGCACCCGTG CCGTCCCCCT GCTGCCGACC GTCCTGCCGG GGCCGCTGGC CGGGTTCGCC ATGCTCGGCC TGGCCTACCG CCTCCTCGGC GACCGCGCCC GGCCCGGTGA GCTCCAGACC GTCCTGCGGG GACTGCCGCA CAACGTGACC ACCGAGATGG ACCTGGCGCT GTGGCACCTG GCCACCCGGA TCCGCGCCGA CCGGGAGGCC GCCTCCCTGC TGCTCGGCAC ACCGGCCGCC GAGCTGGCGG CCCGCTTCGG CGCCGGGTCG CTGCCCGGCG TGGTGGACCG GGGCCTGAAG GAGTTCCTGT CCGTCTACGG CGTCCGCGCG GTCGCCGAGA TCGACCTCGG CGTGCCCCGC TGGTCGGAGG ATCCGACGCA CGTCATCGGC GTCCTGGCCA ACTATCTCCG CCTGGAGGAC CCCGCGCTCT CCCCCGACGC CCTGTTCGCC AGGGGCGCCG CGGAGGCCGT TCTCATGATC AAAACCCTGA GCGCCCGCGT GGGGGGCGTC CGGGGCCGGG TCGTCCGCTT CGCCCTGGGC CGGGCCCGGG CCCTGGCTGG CGTGCGCGAG CTGCCCAAGT TCTACATGGT GACCATCCTG GCCGCCATGC GCGCCGAGCT GGTGACCGTC GGCGCCGACC TGACCGCCCG CGGCCTCCTC GACTCGCCCC GGGACATCTT CTTCCTGACC CTGGAAGAGG CCCGCACCGC CCTCACGGCC GCATCCGGGA AAACCACCCC CGAACAGAAC GCACCGACCA CCCCCGAGGG GAGCACACCG GCCACCCCCG GACAGAGCAC ACCGACCACC CCCGGGGGAA GAACGCCGAC CACCCCCGGA GGGAGAGCGC CGACCGCCCC CGGACAGAGC GCACCGACCA CCCCCGGCGA CGCGGCGCCG CCGTACCCCG CGGGGCTGCG GGCGCTCGTC TCCGAGCGGC GCGAGGACGC CGCGCGCGAG CGGCGCCGCA GGCACCTGCC GCGTGTCCTG CTGTCGGACG GCACCGAGCC CGAGGCGGTC GCCACGTCGG CCCCGGTGGA CGGCGCGCTC ACCGGCACAC CGGCCTCCGC GGGCAGCGTC ACGGGGATCG CCCGGGTGGT CCTCGACCCG GTCGGCGCCC ACCTGGAGCC CGGCGAGATC CTGGTCTGCC CTTCCACCGA CCCCGGCTGG ACCCCGCTGT TCCTCACCGC GGGAGGCCTG GTCATGGAGA TGGGCGGCGC CAACTCGCAC GGCGCGGTGG TCGCGCGCGA GTACGGCATC CCGGCCGTCG TGGGCGTCGC CCGCGCGACC GAGCACATCG TCACCGGCCA GCGGATCACC CTGGACGGCA CCTCCGGCGC GGTGATCACG ACCTGA
|
Protein sequence | MYGTPLILKF DDIDAGMLPL VGGKAANLGV LTAAGLPVPP GLCVTTEAYR RVTEQAGLED VLDALAVTAA GETRVLNELA GRARELVLSA PVPADIADAV RRSAHGPVAV RSSATAEDLP HASFAGQQDT YLNVIGADAV LDAVRRCWAS LWTDRAVAYR AANGIDHRAV LLAVVIQEMV QSEVAGVMFT ANPVTGRRRE AVIDASPGLG EAVVSGAVNP DHFVVDTATG RITARRLGDK RLAVRSLPGG GVEHVETAVE GACVTDAQVR ALAELGGRVE DHYGSPQDTE WAVDAGGALW LTQARPITTL FPIPRHPRPA GAGHPEDRTT RLRAGHGGSG GSGPDGGAQG GARIYFSFSV AQGIYAPITP MGMSAFRLLS SSAAALLGTP VTDRLDGAPQ FAEAASRMFI DVTGIMRSRV GRMGLPRVLD VMEARSATVL RGLSDDPRFS VTQRSVRPAL RRLVRNAIRF RIPARAAQAL VMPEKAHRRA ERLGVRLRTQ PAAPAGATAL ERLDHVERIL GTRAVPLLPT VLPGPLAGFA MLGLAYRLLG DRARPGELQT VLRGLPHNVT TEMDLALWHL ATRIRADREA ASLLLGTPAA ELAARFGAGS LPGVVDRGLK EFLSVYGVRA VAEIDLGVPR WSEDPTHVIG VLANYLRLED PALSPDALFA RGAAEAVLMI KTLSARVGGV RGRVVRFALG RARALAGVRE LPKFYMVTIL AAMRAELVTV GADLTARGLL DSPRDIFFLT LEEARTALTA ASGKTTPEQN APTTPEGSTP ATPGQSTPTT PGGRTPTTPG GRAPTAPGQS APTTPGDAAP PYPAGLRALV SERREDAARE RRRRHLPRVL LSDGTEPEAV ATSAPVDGAL TGTPASAGSV TGIARVVLDP VGAHLEPGEI LVCPSTDPGW TPLFLTAGGL VMEMGGANSH GAVVAREYGI PAVVGVARAT EHIVTGQRIT LDGTSGAVIT T
|
| |