Gene Sros_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2021 
Symbol 
ID8665303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2173050 
End bp2174309 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative proline iminopeptidase 
Protein accessionYP_003337752 
Protein GI271963556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.198655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGTCC TTCTGCCCGG TGTCGCGCTC ACCGACCACG TCTTCAACGT CCCCCTCGAC 
CACGCCGACC CCGGCGGCCC GGCGATCGAG GTCTTCGCCC GCGAGGCGGT GGACCCGGCC
AGGCAGGACC AGGACCTGCC GTGGCTGCTC TTCCTGCAGG GCGGTCCCGG CGGCAAGGCG
CCGAGACCGG TCGCGGCGGA CGGCTGGCTG GGGCACGCGC TGAAGACCCA CCGGGTGCTC
CTGCTCGACC AGCGGGGGAC CGGCCGCAGC ACCCCGCTCA CCGCCAGGAC GGTGACCGGG
ACCGACACCG AGCTCGCCGC CCGTCTCCGG CACTTCCGCG CCGACGCGAT CGTGGCCGAC
GCCGAGCTGA TCCGGCGCGA GCTCTGCGGT GACCGGCCGT GGGAGACCCT CGGCCAGAGC
TACGGCGGCT TCGTCACGCT GACCTACCTC TCCCAGGCAC CCGAGGGCCT CAAGGCCTGT
TACGTGACCG GAGGCCTGGC CGGGCTCGAC GCCACCGCCG ACGACGTCTA CTCCCGCACC
TACCCCAGGG TCCGCGAGAA GACGGACCGC TACTTCGCCC GCTACCCCGA CGACTCCGCC
CGCCTGGACG CGATCGCCGC CCACCTGCGC CGCGAGAAGG TCGAGCTGCC GGACGGCGAC
GTGCTGACCG TGCGCCGCCT GCAGAGCATG GGCCTGTGCC TGGGGATGAG CGACGGCGCG
GAATACCTGC ACTGGGTGCT GGAGGAGGCC TGGAACGGCG AGCGGCTCTC CGACCTGTTC
CTGTACGAGG TCATGATGGC CACCGGCTTC GTCGGCAACC CCCTCTACGC GGTCCTGCAC
GAGTCGATCT ACGCCCAGGG GGGCGCCACC GCCTGGTCGG CGCACCGGCT GCTGCCCGAG
GAGTTCGCCG AGGAGGCCGA GCCGCTGCTG CCCACCGGAG AGATGATCTA CCCCTGGATG
TTCGACGAGA TCGCCGCCCT GCGCCCGTTC AGGGGCGCCG CCGAGATCCT CGCCGCCGCC
TCCGACTGGC CCGCTCTCTA CGACCCGGTA CGGCTGGCGG CCAACCGGGT CCCGGTGGCC
GCCGCCGTCT ACTACGACGA CATGTACGTC GACGAGGACC TGTCCATGCG GACCGCCCGC
ACGGTCGGCA ACGTTCGGAC CTGGGTGACC AACGAGTGGG AGCACGACGG CGTCCGCGTC
TCCGGCGGGC GGGTGCTGGC CCGCCTGATG GACACGGTCA ACGGCGTCCA CGGCTCCTGA
 
Protein sequence
MTVLLPGVAL TDHVFNVPLD HADPGGPAIE VFAREAVDPA RQDQDLPWLL FLQGGPGGKA 
PRPVAADGWL GHALKTHRVL LLDQRGTGRS TPLTARTVTG TDTELAARLR HFRADAIVAD
AELIRRELCG DRPWETLGQS YGGFVTLTYL SQAPEGLKAC YVTGGLAGLD ATADDVYSRT
YPRVREKTDR YFARYPDDSA RLDAIAAHLR REKVELPDGD VLTVRRLQSM GLCLGMSDGA
EYLHWVLEEA WNGERLSDLF LYEVMMATGF VGNPLYAVLH ESIYAQGGAT AWSAHRLLPE
EFAEEAEPLL PTGEMIYPWM FDEIAALRPF RGAAEILAAA SDWPALYDPV RLAANRVPVA
AAVYYDDMYV DEDLSMRTAR TVGNVRTWVT NEWEHDGVRV SGGRVLARLM DTVNGVHGS