Gene Sros_3160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3160 
Symbol 
ID8666448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3440697 
End bp3441962 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003338848 
Protein GI271964652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.119571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGG TCGCGCAGAT CTGCTCGGAC CTGATCAGGT TCGACACCAC CAACCCCGGC 
TCCGGCGAGC GCCCGGCAGC CGAGCACGTC GCCGGACTGC TGTCCGACGC GGGAATCGAG
CCGACGGTCT TCGAGTCGGC CAGGAACCGC ACCAGCGTGG TCGCCAGGAT CCCGGGCGAC
TCCCCCGAGG CCCTGCTCAT CCACGGCCAC CTGGACGTGG TCCCGGCCGA GCCCGCCGAC
TGGCAGGTGC ACCCGTTCTC CGGCGAGGTC GCCGACGGCT GCGTCTGGGG CCGGGGCGCG
GTGGACATGA AGGGCACGCT GTCGATGACG CTCGCCCTGG TCCGCGACTG GGCCCGGCGC
GGCGTGCGGC CCAAGCGCGA CATCGTGCTG GCCTTCCTCG CCGACGAGGA GGCCACCGGC
GAGTACGGCT CGCGGTACGC GGCGACGCGG CACCGGGAGC TGTTCGACGG CTGCACCGAG
GCGATCAGCG AGTCCGGCGG CTACAGCGTC CAGGCCCCGG ACGCGCGCAT CTACCCCGTC
GCGGTGGGCG AGCGCGGCAC CGCCTGGATG AAGCTCACCG CCCACGGCGT CGCGGGCCAC
GGCTCCCGGC CGCCGAAGGA CAACGCGGTG GCCGAGCTCT GCCACGCCCT GTCGAGGATC
GCCTCCTACC AGTGGCCGGT ACGGCTGACG CCCGGGGTGG CGGCGCTGAT CGCCGGCCTG
GCGGACATCC TCGGCGAGAA AATCGACTAC GACCGCCTGG AGGAGGAGGC CGAGCGGCTC
GGCCAGGCGG GCGCCCTGTT CAAGGCGCAG ATCCGCAACT CGGCCAACCC GACGATGCTG
GAGGCGGGCT ACAAGGTCAA CGTGGTCCCC GGCACCGCGA CCGCGCACGT GGACGGCCGC
TTCCTGCCCG GTTACCGGGA GGAGTTCCTG GAGACGATCG ACCGCCTGCT CGGCCCCAAG
GTCACCCGCG AGTTCGTCAA CATCGAGGAC GCCCCCTCGG CGCCGCTGGA CGCGCCGTTC
TTCGGCCAGC TCTGCGACGC GCTCGTCGCC GAGGACCCGG CCGCGCGGCC GGTGCCGTAC
GTGATGTCGG GCGGCACGGA CGCGAAGTCC TTCGCCGACA TCGGCATCAA GGGCTACGGC
TTCGCACCGC TGATGCTCAG CCCGGAGCTG GACTACTACG GCATGTTCCA CGGCGTGGAC
GAGCGGGTCC CCGTCGAGGG GCTGGAGTTC GGCATGCGCG TCCTGGACCG TCTCCTCGCC
TCCTGA
 
Protein sequence
MNEVAQICSD LIRFDTTNPG SGERPAAEHV AGLLSDAGIE PTVFESARNR TSVVARIPGD 
SPEALLIHGH LDVVPAEPAD WQVHPFSGEV ADGCVWGRGA VDMKGTLSMT LALVRDWARR
GVRPKRDIVL AFLADEEATG EYGSRYAATR HRELFDGCTE AISESGGYSV QAPDARIYPV
AVGERGTAWM KLTAHGVAGH GSRPPKDNAV AELCHALSRI ASYQWPVRLT PGVAALIAGL
ADILGEKIDY DRLEEEAERL GQAGALFKAQ IRNSANPTML EAGYKVNVVP GTATAHVDGR
FLPGYREEFL ETIDRLLGPK VTREFVNIED APSAPLDAPF FGQLCDALVA EDPAARPVPY
VMSGGTDAKS FADIGIKGYG FAPLMLSPEL DYYGMFHGVD ERVPVEGLEF GMRVLDRLLA
S