Gene Sros_5196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5196 
Symbol 
ID8668490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5711128 
End bp5713212 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content68% 
IMG OID 
ProductV8-like protein Glu-specific endopeptidase-like protein 
Protein accessionYP_003340713 
Protein GI271966517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.710887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.178037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATAC AGGAAAAAGT CCAGGAATTG GGTTTAGGCC ATGCGCAGGC CGCTGCCGAC 
CGGTATCGCC AAAGTGACGG AGCCCGGGAG GAGGTTGAGC GACGGCGGGA TCAGGGCGCG
GTCTTCCCCG ACTCGCCGGA GGCGCTCGCG GCCCGCATCA CTCGCCTTAT CCAGCGGAAC
GGAGTGCCGG TCGAGGCGGT GCTGGAGACC ACCCGGGCCG AGTCCTTGGA CCTCCCCGAG
ATGCGCGAGC GTATTCTGGG GATTTCCAAG GATCTGCAGG CATGGAGCTT CCTGCCCCGT
GGTGCCCGTG CCGCTCGGAC CGTCGCGCGG ATCTCGGTCA GCGAAAACGG CCGTGAACTA
CCCGTCGGCA CCGGCTTCCT GGTGTCGCCG AGGTTGCTGC TGACCAACCA CCATGTATTC
CCCGACGTCG AGGCGGCGCA CCGGGCCTTC GTGGAGTTCG ACGCTCAGGT CACCATCGAC
AACACCCCGG AGCCGGCCAA ACGTTTCCGC CTGGATCCGG ATACCTTCTT CGTCGCCGAC
CAGGACCTGG ACTTCGCCTT GGTCTTGGTC GGCGCCGATG CCGCAGGCCG GCTCGCGGGT
GAGACGTTCG GCTGGAACAG GCTCAGTGTC CAACTGGGCA AACTGGTCAT CGGAGAATCA
GTCAACATCA TCGGCCACCC CCGTGGCCGG TTGAAGGAGA TCTCCATCCG CGAGAATCGG
CTGGAGAACC GTTGGGATGA CTTCATCCAC TATCGGACCG ACACCGAGCC CGGCAGTTCC
GGCTCCCCGG TCTACAACGA CCAGTGGGAA GTGACGGCGC TCCATCACAG CGGCGTGCCT
AAAACCGACA GCCAAGGCCG TATCCTGCGA CGGGACGACC GGGTTTGGCA GCCCGGCGAC
GGTGACGACG CCATTGAATG GATCTCCAAC GAGGGCGTGC GCATCAGCGT CATCCTCAAG
CATCTGGCCA CGCTGCCGCT CGATGACAGC CGCCGGGCGT TCCTGACCGA GATGGGTCCG
GAGTCCGGGC TTCAGGACGG TGGCGCCCCG CAACCGGCCG TCGCGGGATC CGGTGCGCCC
TTCGCCGCAC GCCCGCTCGC CGAACCAGCC GTCGAGGTCG TCACGGACCT GGCCGCGACA
CGACCTTCGA CTACCCCGGC GGCTGACGTC CGTCGCGGCC TGACCGCCGG CGCGGCCGCG
TTCGGTGGTG CCCGGCACCT GGTCTTCCTG CACGGCCGCG CTCAGCAGGG ACGTGACCCT
GAGCGGCTGC GCCGATACTG GACCGCCGGA TTGAACGGCG GGCTCACCCG CGCAGGGCTG
GCTACGATCG AACCAGCCGA TGTCTGGTGG CCCTTCTACG GCGACAGGCT TGTTCAGGCC
CTGCAACCCC GTGAGGCGAT CTTCCGCTCG CTGGAGCGGC TCGTCGATCC GGCAGCGGTC
ATCGCGCCGG ACTCCGACGC CGCCCGGCGG CTGTACGAGC AGCTGATCAC CGAGGCCGCC
ACCCAAGCAG GCATGCCCGC CGAAGCCCCG ACCTCCCTGG AAGGGCTGGA CCGGACGGCC
GACGCCGTGC ACCGGGGACT GAGCTGGCTG GCCGCCACCA CCAGCCTGGA CCGGCTGACC
ATCGCCACCT TCTTCACAGA CGTCGCCGCC TACCTCGGTG ACCCACAAGT TCGCGAGACG
GTCTTGGACT GCGTGCTCCA AACGATGCCC GCGACCGGCA CATTGGTGCT GGTCAGCCAT
AGCCTGGGCA CGGTCGTCGC CATGGACCTG CTCACCCGGC TTGATCTCGG AGTCGACGTC
GAGCTTCTCG TCACCGCCGG CAGCCCATTG GGCATGGACG GCGTGTACCG CCACCTGCTC
ACCGGCGGCC CCAAACGCCC CGAACGGGTG GCCCATTGGT TCAACGCCTG GTGCCCGATC
GATCCGGTCA CCATCGGGTG CCCTCTGGGC GACCACTGGC AGGGAGAGCT GGCCGAAACC
CCCGTCACCA ACCCCGCCGG CCGAACCCAC GACATCGAGG AGTATCTCGG CCACCCCGAG
GTCGCCCAAG TGATCGGTGC CCGGCTGTTC GGAGCCAGGC CTTGA
 
Protein sequence
MNIQEKVQEL GLGHAQAAAD RYRQSDGARE EVERRRDQGA VFPDSPEALA ARITRLIQRN 
GVPVEAVLET TRAESLDLPE MRERILGISK DLQAWSFLPR GARAARTVAR ISVSENGREL
PVGTGFLVSP RLLLTNHHVF PDVEAAHRAF VEFDAQVTID NTPEPAKRFR LDPDTFFVAD
QDLDFALVLV GADAAGRLAG ETFGWNRLSV QLGKLVIGES VNIIGHPRGR LKEISIRENR
LENRWDDFIH YRTDTEPGSS GSPVYNDQWE VTALHHSGVP KTDSQGRILR RDDRVWQPGD
GDDAIEWISN EGVRISVILK HLATLPLDDS RRAFLTEMGP ESGLQDGGAP QPAVAGSGAP
FAARPLAEPA VEVVTDLAAT RPSTTPAADV RRGLTAGAAA FGGARHLVFL HGRAQQGRDP
ERLRRYWTAG LNGGLTRAGL ATIEPADVWW PFYGDRLVQA LQPREAIFRS LERLVDPAAV
IAPDSDAARR LYEQLITEAA TQAGMPAEAP TSLEGLDRTA DAVHRGLSWL AATTSLDRLT
IATFFTDVAA YLGDPQVRET VLDCVLQTMP ATGTLVLVSH SLGTVVAMDL LTRLDLGVDV
ELLVTAGSPL GMDGVYRHLL TGGPKRPERV AHWFNAWCPI DPVTIGCPLG DHWQGELAET
PVTNPAGRTH DIEEYLGHPE VAQVIGARLF GARP