Gene Sros_4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4939 
Symbol 
ID8668233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5466693 
End bp5469290 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content69% 
IMG OID 
Productphosphoenolpyruvate synthase 
Protein accessionYP_003340489 
Protein GI271966293 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0930681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC AGTACGTGTT GGATCTTCAA GAGGTTGACG AGACGCAGGT CGCGGTCGTT 
GGCGGCAAGG CCGCGCACCT GGGCGGGCTG TCGCGGATCG AAGGCATCCG CGTGCCGGCT
GGGTTTTGCG TGACGACGGC CGCCTTCCGG CGGATCATGG CGGAAGCACC GTCGATCGAC
GATCGGCTCG ATCAGCTGTC GCGCCTGAAC CCGGACGACC GGGAGGCGAT CCGCACGCTC
AGCGCGGAGA TCCGCCGGAC CATCGAGGGG ATTGCCATCC CGGGCGATCT GGCGGCGGCG
ATCACCCGCG CGCTCGCCCA GCTCGGCGAT CAAGCCGCCT ACGCCGTACG ATCCAGCGCG
ACGGCAGAGG ACATGCCGAC GGCCTCCTTC GCGGGCCAGC AGGACACGTA CCTGAACGTC
ATGGGGCCGG CGGCGATATT CCAGCACATC AGCCGGTGCT GGGCGTCGCT GTTCACCGAG
CGGGCCGTGA CCTACCGCGT GCGGAACGGC TTCGACCACC GGAAGGTTCA CATGGCCGTG
GTCGTGCAGC AGATGGTCTT CCCGGATGCG GCCGGCATCC TGTTCACGGC CGACCCCGTC
ACGGGCAACC GGAAGGTCGC CACCGTGGAC GCCGGCTTCG GGCTCGGCGA GGCCCTGGTC
TCCGGCCTGG TGAACCCGGA CGTCTTCAAG GTGCGCGACG GCGAGGTCGT CGCCGGGACG
GTCGCCGCCA AACAGCGTGC CGTTCACGCC CTGCCGGCTG GCGGGACGCA GGAGGTGGCG
ATCGACCCGC AGCGGCAGGG GCAGCCGGCG CTGACGGATG CGCAGGTCGT GCGGCTCGTG
CAGCTCGGGC GGCGGATCGA AGCGCATTTC GGCCGCCCGC AGGACATCGA ATGGTGCCTG
GTCGGCGATG ACTTCCGGAT CGTGCAGAGC CGGCCAATCA CCACGCTGTT CCCCATCCCC
GCGGCCCGCG ACCGGGAGAA CCACGTCTAC CTCTCCGTTG GTCATCAGCA GATGATGACC
GACCCCATGA AGCCACTGGG GCTCTCCGTG TGGCAGCTGA CGGCCATGGC GCCGATGCAC
GAGGCCGGCG GGAGGCTGTT CGTCGACGTC ACCCGGCACC TGGCCTCGCC CGCGAGCCGC
GCCGGACTCC TGGAGATGGC GGAGAGGTCC GATCCGCTGA TCAGGGACGC GCTGGAGACC
GTCCTCGACC GCCACGACTT CGTCCCTGCG CTCCCGGACG GCGGTCCCGG CGGGCCGCCG
GTCGGCGGCG CGTCCGCCCC GATCGAGACC GATCCGGCCG TCGTCACCGA GCTGATCGAG
CGCAGCCAGG CGTCCATCGC CGCCCTGCGG CGCGACATCC GGACGAAGAC CGGACCGGCG
CTGTTCGACT TCCTGCCGGA GGCCTTCCAG GAGCACAAGC GGGTCCTCAG TGATCCGTTG
AGCATGCAGG CGATCATGGC GGGGATGGAG GCCACCTGGT GGCTCAACGA CCGGCTGCGG
GACTGGCTGG GCGAGAAGAA CGCGGCCGAC ACCCTCACGC TGTCCGCCCC CGGCAACGTC
ACGTCGGAAA TGGGACTGGC GCTGCTCGAC GTCGCGGACG TGATCCGCCC GCATCCGGAG
GTGGTGGCGT TCCTGCAGGG CGTCGAGGCC GACGGCTTCC TGGACGAGCT GCCGAAGCTC
GCGGGCGGGA CCGAAGCGCG CGACGCCATC GAGGCCTACC TCGACCGGTA CGGCATGCGC
TGCGTCGGCG AGATCGACAT CACGAGGCCG CGTTGGCGCG AACGCCCCAC CACGCTCGTG
CCCGTGATCC TCGACAACGT CAGGAACTTC GAGCCGGGCG CCGCCGAGCG GCGCTTCGAG
CAAGGGCGGC AGGAGGCGCA GAAGAAGGCG CAGGAGGTGC TCGAACGCCT GCGGGCACTG
CCGGACGGGG AGCAGAAGGC CGACGAGACC AAGCGGATGA TCGACCGGGT CCGGACCTTC
ATCGGCTACC GGGAGTACCC GAAGTACGGC ATCGTCAGCC GCTACTTCGT CTACAAGCAG
GCCTTGCTGG AAGAGGCCGA GCGCCTCGTG CAGGCCAACG TGCTCCCCGA GAAGGAGGAC
GTCTTCTACC TCACGTTCCA GGAGCTCCAC GACGTCGTGC GCTCGAACCA GGTGGATGAC
GAGCTGATCC GGCAGCGTAA GGACGCGTTC CGGTCGTACC ACGCGCTCAC GCCGCCCCGG
GTGCTGACAT CGGATGGCGA GGTCATCACC GGGGCGTACC GACGCGACGA CGTGCCGACC
GGCGCCCTGA TCGGCCTACC GGTCTCCGCC GGGACCATCG AAGGGCGCGC CCGCGTCATC
CTGGACATGG CCCAGGCTGA TCTCGAACCG GGTGACATCC TCGTCACGGC CCACACGGAC
CCGAGCTGGA CGCCCCTCTT CGTCGCCGTC ACAGGCCTGG TGACGGAGGT CGGAGGATTG
ATGACACACG GCGCAGTGAT CGCACGGGAG TACGGCCTGC CGGCTGTCGT CGGTGTGGTG
GATGCAACCC GGCTGATTCC CGATGGGCAG CGGATCCGCG TGCACGGAAG CGACGGGTAC
GTCGAGATTC TGCCTTGA
 
Protein sequence
MIEQYVLDLQ EVDETQVAVV GGKAAHLGGL SRIEGIRVPA GFCVTTAAFR RIMAEAPSID 
DRLDQLSRLN PDDREAIRTL SAEIRRTIEG IAIPGDLAAA ITRALAQLGD QAAYAVRSSA
TAEDMPTASF AGQQDTYLNV MGPAAIFQHI SRCWASLFTE RAVTYRVRNG FDHRKVHMAV
VVQQMVFPDA AGILFTADPV TGNRKVATVD AGFGLGEALV SGLVNPDVFK VRDGEVVAGT
VAAKQRAVHA LPAGGTQEVA IDPQRQGQPA LTDAQVVRLV QLGRRIEAHF GRPQDIEWCL
VGDDFRIVQS RPITTLFPIP AARDRENHVY LSVGHQQMMT DPMKPLGLSV WQLTAMAPMH
EAGGRLFVDV TRHLASPASR AGLLEMAERS DPLIRDALET VLDRHDFVPA LPDGGPGGPP
VGGASAPIET DPAVVTELIE RSQASIAALR RDIRTKTGPA LFDFLPEAFQ EHKRVLSDPL
SMQAIMAGME ATWWLNDRLR DWLGEKNAAD TLTLSAPGNV TSEMGLALLD VADVIRPHPE
VVAFLQGVEA DGFLDELPKL AGGTEARDAI EAYLDRYGMR CVGEIDITRP RWRERPTTLV
PVILDNVRNF EPGAAERRFE QGRQEAQKKA QEVLERLRAL PDGEQKADET KRMIDRVRTF
IGYREYPKYG IVSRYFVYKQ ALLEEAERLV QANVLPEKED VFYLTFQELH DVVRSNQVDD
ELIRQRKDAF RSYHALTPPR VLTSDGEVIT GAYRRDDVPT GALIGLPVSA GTIEGRARVI
LDMAQADLEP GDILVTAHTD PSWTPLFVAV TGLVTEVGGL MTHGAVIARE YGLPAVVGVV
DATRLIPDGQ RIRVHGSDGY VEILP