Gene Sros_8022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8022 
Symbol 
ID8671350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8833948 
End bp8835927 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_003343420 
Protein GI271969224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTGAGA ACCCGTTCTT CTCCCCCAGC ACGTTGCCCT ACAAGCTGCC CAACTTCGCC 
GAGATCCGGG AGGAGCACTA CCTGCCCGCG TTCGAGCGCG GCATGACCGA CCAACTACGG
GAGGTCGAGG CGATCGCCGG CGACCCCGGC CCGGCGACGT TCGCCAACAC GGTCGAGGCC
CTGGAACGCT CGGGCCAGAT CCTGAAACGG GCGGCGACGG TGTTCATGAG CATCGCCTCC
TCCGACGCCA CCGACGGCAT CCGGGAGATC GAGACCGAGA TCCTCCCCAA GCTGACCAAG
CACGGCGACG CCATCCACCT CAACCGCGCC CTCTACGAGC GGATCAAGCA GGTCGCCACC
GAGGACCCGG AGGAGGCGTG GCTGCTGGAG AAGTACCGCG TGGACTTCGT CCGGGCCGGC
GCCGACCTGT CCGAGGCCGA CCAGGAGCGG CTCAAGGGGC TCAACGAGGA GCTGACCAAG
CTCGCCACCA CCTTCTCGCA GAACCTGCTG ACGGCCTCCA CCGCCTCCGC GCTGGTCGTC
GAGGACGTCA GCGAGCTCGA CGGGCTCTCC GAGAGCGCGA TCAAGGCGAT CGAGAAGGAC
GGCAAGTACG TCCTGCCGCT GCTCAACTTC ACCAACCAGC CCGCCCTGGC CGAGCTGACC
GACCGGCAGA CCCGCAGGAA GCTGTACGAG CTGTCGGTGA GCCGGGCGCC GGAGAACTTC
GACCTGGCCG TCAAGCTGGC GACGCTCCGC GCCGAGCGGG CCGCCCTGCT CGGCTACCCC
AGCCACGCCG CCTACTCCGT CGCCGACCAG ACGGCCAAGA CCACCGACGC CGTCGAGGAG
ATGCTCGGCA AGCTGGTGGG CCCGGCGGTG GCCAACGCCC GCCGCGAGGC CGAGCTGCTG
AGCGAGCAGG CCGGCTTCCC CATCGAGGCG TGGGACTGGT CGTTCTACGC GGAGAAGGTG
CGTAAGGCGC GCTACGACTT CGACAGCTCC GAGCTGCGCC CCTACTTCGA GCTGGACCAG
GTGCTGCGGG ACGGTGTCTT CCACGCCGCC GGCGAGCTGT ACGGGATCAC CTTCGCCGAC
CGGGACGACC TGGCCGGCTA CCACCCGGAC GTACGGGTCT TCGAGGTGTT CAACGCGGAC
GGCTCGCAGC TCGGCCTGTT CGTGTTCGAC CCCTACGCCC GGCCGACCAA GCGCGGCGGC
GCGTGGATGA ACAACCTCGT GGACCAGTCG CACCTGTTCG GCGAGCTGCC CGTGGTGATG
AACAACCTCA ACGTCACCAA GCCCGCCGAG GGCCCGACCC TGCTGACCTT CGACGAGGTC
AACACCGCCT TCCACGAGTT CGGCCACGCC CTGCACGGCC TGTTCTCCGA CGTCCGCTTC
CCCCGGGTCT CCGGCACCGA GGTGCCGCGC GACTTCGTGG AGTACCCCTC GCAGGTCAAC
GAGATGTGGG CGACCTGGCC GTCGGTCCTC GCCAACTTCG CCAGGCACCA CGAGACCGGG
GAGCCGGTTC CGGCGGAGCT GCTGGAGAAG ATGAAGGCCG CCGAGAAGTT CAACCAGGGC
TTCGCGACCG TGGAGTATCT CGCCGCGGCG CTGCTCGACT GGGCCTGGCA CAAGCTGGCC
CCCGGCGAGA CCGTCGAGGA CGCCGAGGCG TTCGAGGCCG CGGCCCTGGA GCGCGCGGGG
ATCGCCTTCG ACCTGGTCCG CACGCGCTAC CGGACCAACT ACTTCGCCCA CATCTTCTCC
AGCGGCGTGG GCGGCTACAG CGCCGGCTAC TACTCCTACA TCTGGAGTGA GGTGCTGGAC
GCCGAGAGCG TCGAGTGGTT CAAGGAGAAC GACGGCCTCA AGCGCGGCAA CGGCGACCAC
TTCCGGTCCG CCCTGCTGTC GGTCGGCGGC TCCATGGACG TGATGGCCGC CTTCCGCAAC
TTCCGCGGTC GCGACCCTCG CATCGAGCCC CTGCTGGAGC GCCGCGGGCT GCTGACCTGA
 
Protein sequence
MAENPFFSPS TLPYKLPNFA EIREEHYLPA FERGMTDQLR EVEAIAGDPG PATFANTVEA 
LERSGQILKR AATVFMSIAS SDATDGIREI ETEILPKLTK HGDAIHLNRA LYERIKQVAT
EDPEEAWLLE KYRVDFVRAG ADLSEADQER LKGLNEELTK LATTFSQNLL TASTASALVV
EDVSELDGLS ESAIKAIEKD GKYVLPLLNF TNQPALAELT DRQTRRKLYE LSVSRAPENF
DLAVKLATLR AERAALLGYP SHAAYSVADQ TAKTTDAVEE MLGKLVGPAV ANARREAELL
SEQAGFPIEA WDWSFYAEKV RKARYDFDSS ELRPYFELDQ VLRDGVFHAA GELYGITFAD
RDDLAGYHPD VRVFEVFNAD GSQLGLFVFD PYARPTKRGG AWMNNLVDQS HLFGELPVVM
NNLNVTKPAE GPTLLTFDEV NTAFHEFGHA LHGLFSDVRF PRVSGTEVPR DFVEYPSQVN
EMWATWPSVL ANFARHHETG EPVPAELLEK MKAAEKFNQG FATVEYLAAA LLDWAWHKLA
PGETVEDAEA FEAAALERAG IAFDLVRTRY RTNYFAHIFS SGVGGYSAGY YSYIWSEVLD
AESVEWFKEN DGLKRGNGDH FRSALLSVGG SMDVMAAFRN FRGRDPRIEP LLERRGLLT