Gene Sros_8934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8934 
Symbol 
ID8672272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9868512 
End bp9870143 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content66% 
IMG OID 
ProductABC-type oligopeptide transport system periplasmic component-like protein 
Protein accessionYP_003344309 
Protein GI271970113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTA CTAAGGGCGC ACGGATCACC GCCGGCACCG CGCTGCTCGC TCTGGGGCTG 
GCCGCGTGCG GCAGCCAGGG CTCCACGGGC GGTGGGACGG CCTCCGCCGA CCAGCCCGTG
CGCATGGAGC TCGGCGAGCC CCAGAAGCTC TTCTACCCCG GTGACACCAC CGAGTCCGAG
GGCTCCGAGG TGCTGGCCGC CGTCTTCGCT CCGCTGGTGA GCTACGACGA GAACAAGCAG
GTCGTCAACG ACGTCGCCGA GTCGATCAAG ACGACCGACA ACAAGACGTG GACGATTGAG
CTCAAGCCGG GCTACACCTG GCACAACGAC GAGCCGCTGG TCGCGCAGAA CTACGTCGAC
GCGTGGAACT TCGCCGCCAA CCAGGACAAC GCCCAGGGTG CCAACGGCTT CTTCAGCCGC
GTCGAGGGCT GGGCCGACCT GAACCCGGGC GAGGGCAAGA CCGTCTCCAC CAAGGAGATG
AAGGGCCTCA AGGCCGTCGG CGAGAGCACG CTCAAGGTCA CCCTGACCAA GCCGTTCTCG
CAGTTCAAGA CGATGCTGGG CTACACGTCG TTCTACCCGC TGCCCAAGGC CGCCTTCGGT
GAGGACGGCA AGGTCACCGA GGCGTACGCC AAGCAGCCGA TCGGGCAGGG CTACTTCAAG
TTCGACAAGC CCTACAACAA GGGCACCGAC CAGGCGATCG ACCTGACCCG GTACGACAAG
TTCCCCGGGG ACAAGCCGAA GTTCGACAAG CTCCAGTTCA AGCTCTACGC CAGCGCCGAG
ACCGCGTTCA ACGACCTGCG CGCGGGCAAC CTGGACGTCC ACGACTCGCT GCCCCCCTCG
GCGATCGCCA GCGCCAAGGC CGAGCTCGGC GAGCGCTACA TGGACGAGGC CGACGCCGGC
GTCGGCTACA TCGGCTTCCC GATGCAGTAC AACAAGACCT ACGCGAACGT GAAGGTCCGC
GAGGCCATCT CCCTGGCCAT CGACCGCAAG ACGATCGCCG AGACGGTCTT CTCGGGCACC
CGCGCCCCGG CCGACGACTT CATCAACCCG CTGCTCGACG GCTACCGTCC GGGCGCCTGC
GCGGTCTGCA CCTACGACCC GGCCAAGGCC AAGACGCAGT ACGCCGACAA CGGTGGCCCG
AAGACGCTGG AGCTGGGCTA CAACTCCGAC GGCCCGCACA AGGAGTGGAT CGAGGCGGTC
GCCAACAACC TCCGCGCCAA CCTCGGCGTC CAGGTCACGG TGAAGCCGTT CGAGAAGTTC
GCCTCGATCC TCGACGAGCT CGACAAGAAG ACCTACGGCG GCATGTTCCG CATGGGCTGG
GCGATCGACT ACCCGTCCGC GGAGAACTAC CTGACCCCGG TCTTCTCCAC CGTCGCGATC
AAGACCGGCT CCAACTACGC CGGCTGGTCC AACAAGGCGT TCGACGACCT CCTCGCCAAG
GGCGACAGCG CCGCGACGCA GGCGCAGGGC CTGAAGTACT ACCAGCAGGC CGACGACATC
CTGATCAAGG AACTGCCGTA CATCCCGGTG TACTTCTACC GGACGAACGC CGCGTTCTCC
CAGCATGTCA AGGGCATCAA GATCAACCTC CTCAACCAGG TCGAGTGGGC CCAGGTGGAG
AAGGTCGCCT GA
 
Protein sequence
MRVTKGARIT AGTALLALGL AACGSQGSTG GGTASADQPV RMELGEPQKL FYPGDTTESE 
GSEVLAAVFA PLVSYDENKQ VVNDVAESIK TTDNKTWTIE LKPGYTWHND EPLVAQNYVD
AWNFAANQDN AQGANGFFSR VEGWADLNPG EGKTVSTKEM KGLKAVGEST LKVTLTKPFS
QFKTMLGYTS FYPLPKAAFG EDGKVTEAYA KQPIGQGYFK FDKPYNKGTD QAIDLTRYDK
FPGDKPKFDK LQFKLYASAE TAFNDLRAGN LDVHDSLPPS AIASAKAELG ERYMDEADAG
VGYIGFPMQY NKTYANVKVR EAISLAIDRK TIAETVFSGT RAPADDFINP LLDGYRPGAC
AVCTYDPAKA KTQYADNGGP KTLELGYNSD GPHKEWIEAV ANNLRANLGV QVTVKPFEKF
ASILDELDKK TYGGMFRMGW AIDYPSAENY LTPVFSTVAI KTGSNYAGWS NKAFDDLLAK
GDSAATQAQG LKYYQQADDI LIKELPYIPV YFYRTNAAFS QHVKGIKINL LNQVEWAQVE
KVA