Gene Sros_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4046 
Symbol 
ID8667340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4503684 
End bp4505363 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content73% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003339697 
Protein GI271965501 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0146598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00136288 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGATCGACC CGGTGACCAC GGAGATCATC CGATGCGCGC TGCTGCAGGC GGCCGAGGAC 
ATGAACACCA CGCTGATCAG GTCCTCCTAC ACTCCGGTGA TCTACGAGGC GAAGGACTGC
GCGGTCGCCC TGCTCGACCG CGACCACAAC GTGCTCGGCC AGTCCTCCGG CCTGCCGATC
TTCCTGGGCA ACCTGGAGGT CTGCACCCGC GAGACCGAGC GCCTGCACGG CCGCCAGGTG
TGGCGGGAGG GCGACGTGTG GGTGCTCAAC GACTCCTACA TCGGCGGCAC CCACCTCAAC
GACGTCACCG TCTACGGGCC GATCTTCGTC GACGGCGAGC TGGCCGGCTT CGCCGCGTCC
CGCGCGCACT GGCTGGACAT GGGGTCCAAG GACCCGGGCG GGTCGATGGA CTCCACCTCG
ATCTTCCAGG AGGGGCTGCG CCTGGGCCCG GTCAGGATCT ACGAGGGCGG CGAGCCCCGC
GCGGACCTGC ACGAGGTGAT CGCCCGCAAC GTGCGGTTCC CCCACGCCAC CCTCGGCGAC
ATGCGGGCCC AGGTGGCCTG CGTGACGACC GGGCGGCGCC GCCTGGCCGA GATCGTCTCC
CGCTGGGGCC TCGACACGGT GACGGCCGCC AGGGACGAGA TCTTCGCGCA GACCGAGCAG
CTGGAGCGGG CCGCGATCGC CGCCGTCCCC GACGGGGTCT ACCGGGCCGG GGGGTTCCTG
GACAACGACG GCATCGACCT GGCCACCCCG CTGCCGGTGA ACGTGACCGT GACCGTGGCC
GGCGAGAGCG TCACGGTGGA CCTCACCGAC TGCGCCGACC AGGCCACCGG GCCGGTCAAC
TGCGGCGCCG CCCAGGCCGT GGCCGCCGTC CGGGTCGCCT ACAAGCTCCT GGTCAGCGGT
GAGGTGCCGC TCAACGGAGG GTCCTTCCGG CCGCTCTCGG TGGCGACCCG GCCCGGTTCG
ATGCTCGCCG CCGAGGCGCC CGCCGCCTGC CAGTACTACT ACTCCCATCT GGGACTGGTC
ATCGACCTGG TGGTCAGGGC CCTCGCCCCG GCGCTGCCCG GCAAGGCCGC GGCGGCCAGC
TACGGCGACT CCCTCATCGT CCAGATGTCC GGGGTGGACC CGCGCACCGG CAAGCTGTAC
GTCTCCCAGG AGGCGACCGT CGGCGGCTGG GGGGCCTGGG ACGGCGGCGA CGGCGAGACC
TGCCTGATCA ACAGCGTCAA CGGCTCCCTG CGCGACATGC CGATCGAGGT GCTGGAGACC
CTCTTCCCGG TCCGCGTGAC CCGCTACGAG ATCCGGCCCG GCACCGGCGG CGCCGGGCGG
TGGCGTGGCG GCAACGGTGT CGTGCGGGAG TACGAGTTCA GCGGGGACAC CTCGCTGTCG
CTGTGGTTCG AGCGCTCGGT CACGCCCGCC TGGGGCCTGG CCGGGGGCGG CGCCGGGGCC
CCGCCCCGGG TGACCCTCAA CCCGGGCAGG CCGGGCGAGC GCGAGATGCT CAAGGTCAAC
GCCCTCGCGG TCCGCAAGGG GGACGTGCTG CGCTGCGAGT CCGGCGGCGG CGGCGGGTAC
GGTCCCGCGG AGCTCCGGGA CGCTCCGGCC CTCGCCCGCG ACCTCGCCGA GGGCATGGTC
ACGCCGCCGG GTCGCGCTCA GGTCGGCCCG GCCGCCGCGC CCGTCCCGCC GGTGAGGTAG
 
Protein sequence
MIDPVTTEII RCALLQAAED MNTTLIRSSY TPVIYEAKDC AVALLDRDHN VLGQSSGLPI 
FLGNLEVCTR ETERLHGRQV WREGDVWVLN DSYIGGTHLN DVTVYGPIFV DGELAGFAAS
RAHWLDMGSK DPGGSMDSTS IFQEGLRLGP VRIYEGGEPR ADLHEVIARN VRFPHATLGD
MRAQVACVTT GRRRLAEIVS RWGLDTVTAA RDEIFAQTEQ LERAAIAAVP DGVYRAGGFL
DNDGIDLATP LPVNVTVTVA GESVTVDLTD CADQATGPVN CGAAQAVAAV RVAYKLLVSG
EVPLNGGSFR PLSVATRPGS MLAAEAPAAC QYYYSHLGLV IDLVVRALAP ALPGKAAAAS
YGDSLIVQMS GVDPRTGKLY VSQEATVGGW GAWDGGDGET CLINSVNGSL RDMPIEVLET
LFPVRVTRYE IRPGTGGAGR WRGGNGVVRE YEFSGDTSLS LWFERSVTPA WGLAGGGAGA
PPRVTLNPGR PGEREMLKVN ALAVRKGDVL RCESGGGGGY GPAELRDAPA LARDLAEGMV
TPPGRAQVGP AAAPVPPVR