Gene Sros_5272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5272 
Symbol 
ID8668566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5789757 
End bp5790866 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content72% 
IMG OID 
ProductO-succinylbenzoate synthase 
Protein accessionYP_003340784 
Protein GI271966588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.132905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.154235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTCG ACGCCGTCGA ACTGCGCGAG GTCGCGCTGC CGCTGGTCAG CCCGTTCCGT 
ACCTCGCTCG GCACCCAGAC GGTACGCACC GCGCTGCTCG TGCGAGTGCT CAGCGACGAG
GGCGAGGGCT GGGGCGAGTG CGCGGCCGAG GACGAGCCGA CATACTGTCC CGAATACCTC
GCCGGGGCGG CCGACGTGAT CAAGCGGTTC ATGCTCCCGG CGCTGGCTCC GCTCGACCTG
GAACCGGCGG CGGTCGGCCC CGCGCTGCGG CACCTGCGCG GGCACCCGAT GGCCAGGGCG
GCCGTGGAGA CGGCGGTGCT GGACTGCTGG CTGCGGGCAC GACGCACGAG CCTGGCCGCT
CACCTGGGAG GGGTACGGCG GCGCGTGCGC GTGGGCGTTT CCGTGGGCAT CACCGAGACT
GTCGACGCGC TGCTGGACAC GGTCGACCGG CATCTGACCG CCGGCTACGC CAGGATCAAG
CTGAAGATCG AGCCCGGCTG GGACCTCGAA CCGGTCAGGG CGGTGCGCGA GACGTTCGGC
GCCGGCGTGA TGCTCACCGT CGACGCCAAC ACCGCCTACC AGCCCGGTGA CCTCAGGCAC
CTGGCGAAAC TCGACGCGTA CGGGCTCGCC CTGGTCGAGC AGCCCTTCGC CCCCGACGAC
CTGCACGCGC ACGCCGCGCT CGCCGCCCGC ATGGACACGC CCGTCTGCCT GGACGAGAGC
ATCACCAGCG CGCGCGACGC GGCCGAGGCC ATCAGCCGGG GTGCCTGCTC GATCATCAAC
ATCAAGCCCG GCCGCGTCGG CGGCTACCTG GAGGCGCGCC GCATTCACGA CCTGGCTCAG
GCGAACGGCG TGCCGGTCTG GTGCGGCGGC ATGCTGGAGA CCGGCCTCGG CCGGGCCGCC
AACCTGGCGC TGGCCAGCTT GCCCGGCTTC ACGCTGCCGG GCGACATCTC CGCCACCGAG
CGGTACTACC ACCGCGACAT CACCCGCCCG TTCGTGCTCG ACGGCGGCGA GCTGCCGGTT
CCGGTCGAGC CCGGCCTCGG CGTACTGCCC GAGGAAGCGG ACCTGCAGGC ATGCACGACG
GCGGTGGAGA CGGTGAGCTG CACATTCTGA
 
Protein sequence
MKLDAVELRE VALPLVSPFR TSLGTQTVRT ALLVRVLSDE GEGWGECAAE DEPTYCPEYL 
AGAADVIKRF MLPALAPLDL EPAAVGPALR HLRGHPMARA AVETAVLDCW LRARRTSLAA
HLGGVRRRVR VGVSVGITET VDALLDTVDR HLTAGYARIK LKIEPGWDLE PVRAVRETFG
AGVMLTVDAN TAYQPGDLRH LAKLDAYGLA LVEQPFAPDD LHAHAALAAR MDTPVCLDES
ITSARDAAEA ISRGACSIIN IKPGRVGGYL EARRIHDLAQ ANGVPVWCGG MLETGLGRAA
NLALASLPGF TLPGDISATE RYYHRDITRP FVLDGGELPV PVEPGLGVLP EEADLQACTT
AVETVSCTF