Gene Sros_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1894 
Symbol 
ID8665172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2016746 
End bp2019661 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content75% 
IMG OID 
ProductPhosphoenolpyruvate synthase/pyruvate phosphate dikinase-like protein 
Protein accessionYP_003337625 
Protein GI271963429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGCA CCCCACTGAT CCTGAAATTT GATGACATCG ATGCGGGGAT GCTGCCTCTC 
GTCGGCGGCA AGGCCGCGAA CCTCGGCGTG CTCACCGCTG CCGGGCTCCC CGTCCCCCCG
GGACTCTGCG TCACCACCGA GGCCTACCGG AGGGTCACCG AGCAGGCAGG GCTGGAGGAC
GTGCTCGACG CCCTCGCCGT CACGGCCGCC GGGGAGACCC GGGTTCTGAA CGAGCTGGCC
GGCCGGGCCC GCGAGCTCGT GCTGTCCGCC CCGGTGCCCG CCGACATCGC CGACGCCGTA
CGGCGGAGCG CGCACGGCCC CGTCGCGGTC CGCTCCTCGG CCACCGCCGA GGACCTGCCG
CACGCCAGCT TCGCCGGCCA GCAGGACACC TACCTCAACG TGATCGGCGC CGACGCCGTG
CTGGACGCGG TCCGGCGCTG CTGGGCCTCG CTCTGGACCG ACCGGGCCGT GGCCTACCGC
GCCGCCAACG GCATCGACCA CCGGGCCGTG CTGCTGGCCG TCGTGATCCA GGAGATGGTC
CAGTCGGAGG TCGCCGGGGT GATGTTCACC GCCAACCCGG TCACCGGACG GCGGCGCGAG
GCGGTCATCG ACGCGAGTCC GGGCCTCGGC GAGGCCGTGG TGTCCGGCGC GGTCAATCCC
GACCACTTCG TGGTGGACAC CGCCACCGGG CGGATCACCG CGCGGCGGCT GGGCGACAAG
CGGCTGGCCG TACGGTCCCT GCCCGGTGGC GGCGTCGAAC ACGTCGAGAC CGCCGTGGAG
GGCGCCTGCG TCACCGACGC CCAGGTCCGG GCGCTGGCCG AGCTGGGCGG CCGGGTCGAG
GACCACTACG GCTCCCCGCA GGACACCGAG TGGGCCGTCG ACGCCGGCGG CGCCCTCTGG
CTGACCCAGG CACGGCCGAT CACCACCCTC TTCCCGATCC CCCGGCACCC GCGTCCGGCC
GGAGCGGGTC ACCCGGAGGA CCGGACCACC CGTCTCCGCG CCGGGCACGG AGGGTCGGGC
GGCTCCGGCC CGGACGGCGG GGCGCAGGGC GGCGCGCGGA TCTACTTCTC CTTCAGCGTG
GCCCAGGGCA TCTACGCGCC GATCACCCCG ATGGGCATGT CGGCCTTCCG GCTGCTGTCG
TCGTCGGCCG CCGCCCTGCT GGGCACCCCG GTCACCGACC GGCTGGACGG GGCGCCCCAG
TTCGCCGAGG CGGCCAGCCG GATGTTCATC GACGTCACGG GGATCATGCG CAGCCGGGTG
GGCCGGATGG GCCTGCCCCG GGTGCTCGAC GTGATGGAGG CCAGGTCGGC CACCGTGCTG
CGCGGCCTGT CCGACGATCC CCGCTTCAGC GTGACCCAGC GCTCGGTGCG CCCCGCCCTG
CGCCGGCTCG TCAGGAACGC GATCCGCTTC CGCATCCCGG CACGCGCCGC GCAGGCCCTC
GTGATGCCCG AGAAGGCGCA CCGGCGTGCG GAGCGGCTGG GAGTACGGCT CCGGACACAG
CCGGCCGCTC CGGCGGGCGC CACCGCCCTC GAACGGCTCG ACCACGTCGA ACGGATCCTG
GGCACCCGTG CCGTCCCCCT GCTGCCGACC GTCCTGCCGG GGCCGCTGGC CGGGTTCGCC
ATGCTCGGCC TGGCCTACCG CCTCCTCGGC GACCGCGCCC GGCCCGGTGA GCTCCAGACC
GTCCTGCGGG GACTGCCGCA CAACGTGACC ACCGAGATGG ACCTGGCGCT GTGGCACCTG
GCCACCCGGA TCCGCGCCGA CCGGGAGGCC GCCTCCCTGC TGCTCGGCAC ACCGGCCGCC
GAGCTGGCGG CCCGCTTCGG CGCCGGGTCG CTGCCCGGCG TGGTGGACCG GGGCCTGAAG
GAGTTCCTGT CCGTCTACGG CGTCCGCGCG GTCGCCGAGA TCGACCTCGG CGTGCCCCGC
TGGTCGGAGG ATCCGACGCA CGTCATCGGC GTCCTGGCCA ACTATCTCCG CCTGGAGGAC
CCCGCGCTCT CCCCCGACGC CCTGTTCGCC AGGGGCGCCG CGGAGGCCGT TCTCATGATC
AAAACCCTGA GCGCCCGCGT GGGGGGCGTC CGGGGCCGGG TCGTCCGCTT CGCCCTGGGC
CGGGCCCGGG CCCTGGCTGG CGTGCGCGAG CTGCCCAAGT TCTACATGGT GACCATCCTG
GCCGCCATGC GCGCCGAGCT GGTGACCGTC GGCGCCGACC TGACCGCCCG CGGCCTCCTC
GACTCGCCCC GGGACATCTT CTTCCTGACC CTGGAAGAGG CCCGCACCGC CCTCACGGCC
GCATCCGGGA AAACCACCCC CGAACAGAAC GCACCGACCA CCCCCGAGGG GAGCACACCG
GCCACCCCCG GACAGAGCAC ACCGACCACC CCCGGGGGAA GAACGCCGAC CACCCCCGGA
GGGAGAGCGC CGACCGCCCC CGGACAGAGC GCACCGACCA CCCCCGGCGA CGCGGCGCCG
CCGTACCCCG CGGGGCTGCG GGCGCTCGTC TCCGAGCGGC GCGAGGACGC CGCGCGCGAG
CGGCGCCGCA GGCACCTGCC GCGTGTCCTG CTGTCGGACG GCACCGAGCC CGAGGCGGTC
GCCACGTCGG CCCCGGTGGA CGGCGCGCTC ACCGGCACAC CGGCCTCCGC GGGCAGCGTC
ACGGGGATCG CCCGGGTGGT CCTCGACCCG GTCGGCGCCC ACCTGGAGCC CGGCGAGATC
CTGGTCTGCC CTTCCACCGA CCCCGGCTGG ACCCCGCTGT TCCTCACCGC GGGAGGCCTG
GTCATGGAGA TGGGCGGCGC CAACTCGCAC GGCGCGGTGG TCGCGCGCGA GTACGGCATC
CCGGCCGTCG TGGGCGTCGC CCGCGCGACC GAGCACATCG TCACCGGCCA GCGGATCACC
CTGGACGGCA CCTCCGGCGC GGTGATCACG ACCTGA
 
Protein sequence
MYGTPLILKF DDIDAGMLPL VGGKAANLGV LTAAGLPVPP GLCVTTEAYR RVTEQAGLED 
VLDALAVTAA GETRVLNELA GRARELVLSA PVPADIADAV RRSAHGPVAV RSSATAEDLP
HASFAGQQDT YLNVIGADAV LDAVRRCWAS LWTDRAVAYR AANGIDHRAV LLAVVIQEMV
QSEVAGVMFT ANPVTGRRRE AVIDASPGLG EAVVSGAVNP DHFVVDTATG RITARRLGDK
RLAVRSLPGG GVEHVETAVE GACVTDAQVR ALAELGGRVE DHYGSPQDTE WAVDAGGALW
LTQARPITTL FPIPRHPRPA GAGHPEDRTT RLRAGHGGSG GSGPDGGAQG GARIYFSFSV
AQGIYAPITP MGMSAFRLLS SSAAALLGTP VTDRLDGAPQ FAEAASRMFI DVTGIMRSRV
GRMGLPRVLD VMEARSATVL RGLSDDPRFS VTQRSVRPAL RRLVRNAIRF RIPARAAQAL
VMPEKAHRRA ERLGVRLRTQ PAAPAGATAL ERLDHVERIL GTRAVPLLPT VLPGPLAGFA
MLGLAYRLLG DRARPGELQT VLRGLPHNVT TEMDLALWHL ATRIRADREA ASLLLGTPAA
ELAARFGAGS LPGVVDRGLK EFLSVYGVRA VAEIDLGVPR WSEDPTHVIG VLANYLRLED
PALSPDALFA RGAAEAVLMI KTLSARVGGV RGRVVRFALG RARALAGVRE LPKFYMVTIL
AAMRAELVTV GADLTARGLL DSPRDIFFLT LEEARTALTA ASGKTTPEQN APTTPEGSTP
ATPGQSTPTT PGGRTPTTPG GRAPTAPGQS APTTPGDAAP PYPAGLRALV SERREDAARE
RRRRHLPRVL LSDGTEPEAV ATSAPVDGAL TGTPASAGSV TGIARVVLDP VGAHLEPGEI
LVCPSTDPGW TPLFLTAGGL VMEMGGANSH GAVVAREYGI PAVVGVARAT EHIVTGQRIT
LDGTSGAVIT T