Gene Sros_5147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5147 
Symbol 
ID8668441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5659831 
End bp5661075 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content71% 
IMG OID 
Productphosphoenolpyruvate synthase 
Protein accessionYP_003340668 
Protein GI271966472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTAC CTCTTATGCA GGCCGGCGCC GAGACCTGCG GGGGCAAGGC CGGCGCACTC 
GGCGCGTTGC TCCGCGCGGG TCTGCCGGTC CCTGACGGCT TCGTCATCCC GTTCGCCGCT
TACCTCGCCG CTGTCCGCGA CCTGGACCTC GGGCGGTTCG CCGACGAGTC GGACGATCTT
GACGCGACGC GGCGGGCGAT CGAGGCCCGC CCGGTCCACG CCACCGTGAT CGATGCGCTG
GGACGCGCAC TCGACGAGCT CGGCGATCCA CCCGTGGCGG TGAGATCATC GGCAGCAAGC
GAAGACACCG GTCAGGCATC GGCGGCCGGT CAGCACGAGA GCTTTCTCGC TGTGCACGGA
GTCAGCGAGG CCGCCGACGC CGTACGCGCC TGCTGGGCCT CGCTGTTCTC TCCACGTGCC
ATCGACTACC GGGGGGCCTC CGGCCGCGAC GACCGGCCAT CCGACGATCT TGTGATGGCC
GTCATCGTCC AGCGCCATCT GGATGCCGAG GTGTCCGGGG TCATGTTCAC ACCCGCGGAC
CCGGACGGCG CGACCGAGAT CGAGGCGTCC TGGGGCCTCG GCCCCAGCAT CGTCGGGGGC
AAGGTCACCC CTGACGCCTA TCGCGTCGCC GAGGACGGGT CGGTCACACG CACCGTCGCA
GACAAACGGA CCCGCCTTGA CCGGCGCGGC ACGCAGCTCG TCATTCGCGA CGTGCCCACC
CCTGCCCGGA ACCAACCGAC GCTCGACGAC GCGACCGCCA CGCGGCTGGC CAAGCTGGGC
GAGAAGATTG CCGCCGTACT CGGTGGACCG CAGGACATCG AGTGGGCGAT CGCCGACGGC
CGCACCTGGG TTCTGCAAGC ACGGCCGGTC ACAGCCGCAC CCCCACCGCC ATCGCTTTCC
GGCGCCTCGG ACACCCCAGC CGCCGCACTC ACCGGAACAC CAGGCAGCCG CGGGACCGTG
ACCGGCACCG CGAGGATCGT CCGCGGTCCC GGCGACTTCG CGCGCGTGCA CCCGGGCGAC
ATCCTCGTCT GCCCTTTCAC CGACCCCGCT TGGACGCCGC TGCTGCGCAT CGCCGCCGGC
GTCGTCACCG AAACCGGAGG CGTGCTCTCC CACGCCGCGA TCGTCGCTCG CGAGCACGCC
ATCCCCGCCG TCCTCGGCAT CCCGAACGCG ACGAGCAGGC TCCACGACGG CACCGTCATC
ACCATCGACG GCACCACCGG CACCGTCACG GCGACAAACG CGTGA
 
Protein sequence
MIVPLMQAGA ETCGGKAGAL GALLRAGLPV PDGFVIPFAA YLAAVRDLDL GRFADESDDL 
DATRRAIEAR PVHATVIDAL GRALDELGDP PVAVRSSAAS EDTGQASAAG QHESFLAVHG
VSEAADAVRA CWASLFSPRA IDYRGASGRD DRPSDDLVMA VIVQRHLDAE VSGVMFTPAD
PDGATEIEAS WGLGPSIVGG KVTPDAYRVA EDGSVTRTVA DKRTRLDRRG TQLVIRDVPT
PARNQPTLDD ATATRLAKLG EKIAAVLGGP QDIEWAIADG RTWVLQARPV TAAPPPPSLS
GASDTPAAAL TGTPGSRGTV TGTARIVRGP GDFARVHPGD ILVCPFTDPA WTPLLRIAAG
VVTETGGVLS HAAIVAREHA IPAVLGIPNA TSRLHDGTVI TIDGTTGTVT ATNA