Gene Sros_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3174 
Symbol 
ID8666462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3456202 
End bp3457401 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID 
ProductZn-dependent dipeptidase microsomal dipeptidase 
Protein accessionYP_003338862 
Protein GI271964666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.845004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.812253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAC CATCCGCCTC TTACCGCGGT CACCAGGCCT ACGGCTATCT CGAGCCGGGC 
GTCGACTACG CCGACTTCGA GCTGGCCGAG CAGATCGGCC GAGTGCCCGC CTACGACGGT
GGGGTGTCGG CCGAGCAGTC CGAACGGGTC AGCCGGATCA TGGCCGAGCA CATCGTGATC
TCACTGCACG AGCACGCGGT GGTCCTGCCC AAGGACGTCG GTGAACTGCG CCGGTACAAC
CGCACCGGCC GGCAGCGTAC GGGCTACGAA GGTCTGTCGC GTTCGGGCAT GACGGCCGTG
TTCGACAACT TCATGGCCGG GGCGTCGTGC GTCACCAGCG AGAACGGCTG GAAGTGGAAC
GACATGATCT ACGCCCTCGG CCTGCGGCTC GCGGACATCG CCAAGCAGGA CTACGTGGTG
CACGCGCTGA CGGTGGACGA CATCAGGGCA GCCAAGCGCG ACGGCCGGAT GGCGCTGGTG
GCCGGGCTGG AGTCGGCGAC GATGATCGAG AATGAGCTCG ATCGTCTGGA CATCCTGTAC
GGCTTCGGGG TCCGTCAGAT CGGTGTCGCG TATTCGCAGG CCAACCAGTT GGGTTCGGGG
TTGGCCGAGC GGGCCGATGC CGGTCTGACC AATTTCGGCC GTCGTGCGGT GGAGCGGATG
AACCGGCTCG GTATGGCGAT CGACATCTCG CACTCGGGTG ACCGTACGTG TCTGGAGGTC
ATCGAGCATT CGGCGGTGCC GGTCTTCATC ACGCATGCCG GTGCTCGTGC GGTGTGGCCG
ACCAACCGGA TGAAGCCCGA TGAGGTGATC AGGGCGTGTG CCGAGCGTGG TGGTGTGATC
GGTCTGGAGG CGGCTCCGCA CACCACGCTG TCGGAGGAGC ATCGCGAGCA CTCGCTGGAG
TCGGTGATGG ATCACTTCAC CTACTGCGTG GACCTGGTGG GCATCGACCA CGTCACCTTC
GGCCCCGACA CGATGTTCGG CGACCACGTG GGGGTGCACA AGACCTACGC CGGCAACTAC
GCCCAGAACC GCGACGCCGC GCCCGACCAC CCGAACGTCG CCTACGTGGA CGGCCTGGAG
AACCCGGCGG AGAACTTCAC CAACATCGTC GGCTGGCTCG TCAAGCACGG CTACGGCGAT
GATGACATCA GCAAGGTCAT TGGCGGAAAC ACGCTCCGCG TGCTCGATCA TGTCTGGTAG
 
Protein sequence
MQSPSASYRG HQAYGYLEPG VDYADFELAE QIGRVPAYDG GVSAEQSERV SRIMAEHIVI 
SLHEHAVVLP KDVGELRRYN RTGRQRTGYE GLSRSGMTAV FDNFMAGASC VTSENGWKWN
DMIYALGLRL ADIAKQDYVV HALTVDDIRA AKRDGRMALV AGLESATMIE NELDRLDILY
GFGVRQIGVA YSQANQLGSG LAERADAGLT NFGRRAVERM NRLGMAIDIS HSGDRTCLEV
IEHSAVPVFI THAGARAVWP TNRMKPDEVI RACAERGGVI GLEAAPHTTL SEEHREHSLE
SVMDHFTYCV DLVGIDHVTF GPDTMFGDHV GVHKTYAGNY AQNRDAAPDH PNVAYVDGLE
NPAENFTNIV GWLVKHGYGD DDISKVIGGN TLRVLDHVW