Gene Sros_8451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8451 
Symbol 
ID8671785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9323006 
End bp9325075 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase, S9B (dipeptidyl peptidase IV) subfamily 
Protein accessionYP_003343838 
Protein GI271969642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.695642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTA TGAGTGAGAG CTTCCCTCGT CTGCACGCAA GGACCCGCCG ATTCACCCTC 
GGGGTGCCGC GTGGCTTCAC CATCGCCCCG GGCGGCGACC GCGTCGTCTT CCTCCGGACG
AAGGGCGGTT CCGACCCGGT CACCTGCCTG TGGGAGTTCG ACGTGGCCGG CGGCAAGGAG
CGGCTGATCG CCGACCCGCG CGCGCTGGCC GTGAACGAGG ACGACCTGCC GGCCGAGGAG
CGGGCCCGCC GCGAGCGCAG CCGGGAGCAG GCGGGCGGCA TCGTCGGTTA CAGCACCGAC
AAGGCGGTGA CCACCGCGGC CTTCGCGCTC TCCGGCGGCC TCTACGTCGC CGATCTCAAG
ACCAAGAGGA TCCGGCGGTT GAAGACGGCC GGCCCGGTGA TCGACCCGCG GGTCTCCCCC
ACCGGCGAGC ACGTCGGCTA CGTGACCGGC GGCGCGCTCC ACGTCCAGGA CCTCGAGGAC
GAGGTCGACC ACGTGCTGGC CCGCCCCGAG TCCCCCGAGG TGACGTACGG CCTGGCCGAG
TTCATCGCCG CCGAGGAGAT GAACCGCATG CGGGGCTACT GGTGGTCCCC GGCGGGCGAC
GCGGTGCTGG CCGAGCGGGC CGACGACTCG CCGGTCGACA GGTGGCACAT CGCCGACCCG
GCCAACCCCT CCCGCCCGGC CGCCGAGCAG CGCTACCCCG CCGCCGGCAC GGCGAACACC
AGGGTCGAGC TGTTCCTCCT GGCCCTGGAC GGCTCCCGCA CTCCCGTGCC CTTCGAGGAC
GAATACCTGG TCACCGCCTC CTGGGACGCC CACGGCCTGG CGATCGTCAC GATGCCGCGC
GACCAGAAGA GCATCCGCCT GCTCAAGGTC GACCCGGCCA CGGGCGGCTC GACGGTCGTC
CGCGAGGACA CCGACCCCGC CTGGGTGGAC ATCGTCCCCG GCGTGCCCGG TCACCTGTCC
GACGGCACCC TGGTCTGGGC GGCGAACGTC GGCGGTGGCC ACCGCCTGAT CATCGGTGAC
GAGCCCGTCA CGCCGGCCAC CCTCCAGGTC CGCGAGGTCG TCGACGTCGA CGGTGACGCC
GTGCTCTTCC GGGCCGGCGG CGATCCCACC GAGATCGCCC TGTGGGCCTA CAAGGACGGC
CGCATCACGC TGGTCAGCCC GGCCGAGAGC GGCGTCTACA GCGGCCACAC CGCGGGCGGC
ACGCTGGTGG TCACCGGCCA GACCCTCGAC AGCGAGGGCC CGGTCACGCG CGTCCTCCAC
CACGGCAAGC CCCGCGGCCA CATCGCCTCG CACGCCGAGC GGCACGGCCT GGACCTGCGG
GTCTCCCTGA TCCGCGCGGG CGCGCGCGAC CTGGCGACCG CCGTGCTCTT CCCGTCGGAC
CACGTGCCCG GCTCGGCGAG GCTCCCGGTC CTCATGGACC CCTACGGCGG GCCGCACGCC
CAGCGCGTCC TGGCCGCCTC CGGCGCCTAC CTGACCAGCC AGTGGTTCGC CGACCAGGGC
TTCGCGGTGG TCGTGGCCGA CGGCCGCGGC ACCCCGGGGC GCGGCCCCGA GTTCGAGCGG
GCCGTCCTGC ACGACCTGGC CACCCCGCCC CTGGAGGACC AGGTCGACGC CCTCCAGGGC
GCCGCCGCGC GGTTCCCCGA CGACCTGGAC CTGTCCCGGG TCGGCATCCG CGGCTGGTCC
TTCGGCGGCT TCCTGGCCGC CCTGGCCGTG CTCCGCCGCC CGGACGTGTT CCACGCGGCC
GTCGCCGGAG CGCCGGTGAC CGACTGGCGT CTCTACGACA CCTGCTACAC CGAGCGCTAC
CTCGGCCGGC CGGAGGAGGG CCACTACGAA TCGTCGTCCC TGTTCGCCGA CGCCGAGAAG
CTGGACCGCC CCCTCCTGCT GATCCACGGC CTGGCCGACG ACAACGTGGT CGCCGCCCAC
ACCCTCCGCC TGTCCTCCGC CCTCCTGGCG GCGGGCCGCC CGCACAACGT CCTGCCGCTG
TCCGGCGTCA CCCACATGAC CCCCCAGGAG GTCGTGGCGG AGAACCTGCT CCTGCTCCAG
GTCGACTTCC TCAAGAAGGC CCTGGGCTGA
 
Protein sequence
MTGMSESFPR LHARTRRFTL GVPRGFTIAP GGDRVVFLRT KGGSDPVTCL WEFDVAGGKE 
RLIADPRALA VNEDDLPAEE RARRERSREQ AGGIVGYSTD KAVTTAAFAL SGGLYVADLK
TKRIRRLKTA GPVIDPRVSP TGEHVGYVTG GALHVQDLED EVDHVLARPE SPEVTYGLAE
FIAAEEMNRM RGYWWSPAGD AVLAERADDS PVDRWHIADP ANPSRPAAEQ RYPAAGTANT
RVELFLLALD GSRTPVPFED EYLVTASWDA HGLAIVTMPR DQKSIRLLKV DPATGGSTVV
REDTDPAWVD IVPGVPGHLS DGTLVWAANV GGGHRLIIGD EPVTPATLQV REVVDVDGDA
VLFRAGGDPT EIALWAYKDG RITLVSPAES GVYSGHTAGG TLVVTGQTLD SEGPVTRVLH
HGKPRGHIAS HAERHGLDLR VSLIRAGARD LATAVLFPSD HVPGSARLPV LMDPYGGPHA
QRVLAASGAY LTSQWFADQG FAVVVADGRG TPGRGPEFER AVLHDLATPP LEDQVDALQG
AAARFPDDLD LSRVGIRGWS FGGFLAALAV LRRPDVFHAA VAGAPVTDWR LYDTCYTERY
LGRPEEGHYE SSSLFADAEK LDRPLLLIHG LADDNVVAAH TLRLSSALLA AGRPHNVLPL
SGVTHMTPQE VVAENLLLLQ VDFLKKALG