Gene Sros_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3166 
Symbol 
ID8666454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3447357 
End bp3448559 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content65% 
IMG OID 
ProductZn-dependent dipeptidase microsomal dipeptidase 
Protein accessionYP_003338854 
Protein GI271964658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCC CCAATCCCCG CTACCAGGGA TACCGGTCGT TCGACTATCT GGAGCCACAC 
GCCGACTTCA AGGTCTTCGA CCTCGCCCCC GAAATCGACC GTGTTCCGGC GTACGACCTG
GGCCTGTCGG CCGAGCAGTC CGCGCGGGTC AGCCGCCTCC TGACCGAGCA CATGGCCATC
TCGCTGCACG AGCACCCCAA GGTCCTGACC GCGGACGTCA CGCTGCTGCG CGACTACAAC
CGGACCGGAC GCAACGTGCT CGGGTACGAG GGCCTGGCGC GTTCGGGCAT GACCGCGCTC
TTCGACAACT TCATGAACGG CACCAACTGC GTCACCAGCG AGCACGGCTG GAAGTGGGAC
GACGTCATCT ACGACCTCGG CCTGCGCTTC GCCGACATCG CCAAGCAGGA CTTCGTCGTC
CTGGCCCGCA CGGTCGAGGA GATCGAGAAG GCCAAGGCGG GCGGGCAGCT CGCGCTGGTG
GCCGGGCTGG AGGCGGCGAC GATGATCGAG AATGAGCTCG ATCGTCTGGA CATCCTGTAC
GGCTTCGGGG TCCGTCAGAT CGGTGTCGCG TATTCGCAGG CCAACCAGTT GGGTTCGGGG
TTGGCCGAGC GGGCCGATGC CGGTCTGACC AATTTCGGCC GTCGTGCGGT GGAGCGGATG
AACCGGCTCG GTATGGCGAT CGACATCTCG CACTCGGGTG ACCGTACGTG TCTGGAGGTC
ATCGAGCATT CGGCGGTGCC GGTCTTCATC ACGCATGCCG GTGCTCGTGC GGTGTGGCCG
ACCAACCGGA TGAAGCCCGA TGAGGTGATC AGGGCGTGTG CCGAGCGTGG TGGTGTGATC
GGTCTGGAGG CGGCTCCGCA CACCACGCTG TCGGAGGAGC ATCGCGAGCA CTCGCTGGAG
TCGGTGATGG ATCACTTCAC CTACTGCGTG GACCTGGTGG GCATCGACCA CGTCGCCTTC
GGCCCCGACA CCAACTTCGG TGACCACGTG GGGCTGCACG ACTCCTTCAC CGGTCACCTC
TCGATCGGCC AGGCCCACGG ACACGTCGAG CACCCGCGCG TGCCGTATGT GGCCGGTATG
GAGAACCCGG CGGAGAACTT CACCAACATC GTCGGCTGGC TCGTCAAGCA CGGCTACGGC
GACGACGACA TCAGCAAGGT CATCGGCGGG AACATCCTGC GCGTACTCAA GGAAGTCTGG
TGA
 
Protein sequence
MQPPNPRYQG YRSFDYLEPH ADFKVFDLAP EIDRVPAYDL GLSAEQSARV SRLLTEHMAI 
SLHEHPKVLT ADVTLLRDYN RTGRNVLGYE GLARSGMTAL FDNFMNGTNC VTSEHGWKWD
DVIYDLGLRF ADIAKQDFVV LARTVEEIEK AKAGGQLALV AGLEAATMIE NELDRLDILY
GFGVRQIGVA YSQANQLGSG LAERADAGLT NFGRRAVERM NRLGMAIDIS HSGDRTCLEV
IEHSAVPVFI THAGARAVWP TNRMKPDEVI RACAERGGVI GLEAAPHTTL SEEHREHSLE
SVMDHFTYCV DLVGIDHVAF GPDTNFGDHV GLHDSFTGHL SIGQAHGHVE HPRVPYVAGM
ENPAENFTNI VGWLVKHGYG DDDISKVIGG NILRVLKEVW