Gene Sros_4019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4019 
Symbol 
ID8667313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4472939 
End bp4474594 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content75% 
IMG OID 
Product2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene -1-carboxylic-acidsynthase 
Protein accessionYP_003339670 
Protein GI271965474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.207997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0874374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCCG CCACCGCGCT CGCGACGGTG CTGATCGACG AGCTCGTGCG CTGCGGCCTT 
ACGGATGTGG TCCTCGCGCC CGGCTCACGC TCGGCCCCGC TGGCCCTGGC CGTCCACGCC
GACAGCCGGA TCCGGCTGCA CGTGCGCGTC GACGAGCGCT CGGCCTCCTT CCTCGCGCTG
GGCCTGGCCC GGCGCGGCGA GCGCCCGGTC GCCCTGATCT GCACCTCCGG CACCGCGGCC
GCGAACTTCC ACCCGGCGGT CATCGAGGCG CACGAGTCGG GGGTGCCGCT GCTGCTGCTG
ACCGCCGACC GGCCGCCCGA GCTGCGCGAC ACCGGCGCCA GCCAGACCAT CGACCAGATC
AAGCTGTACG GTACGGCGGT CCGCTGGTTC AGCGAGGTCG GCCTGCCCGA GGACCGTCCC
GGCCAGGTCG CCTACTGGCG GTCGCTGGCC TGCCGCGCCT ACCAGCGCTC GCTGGGCCCG
ACCGACCCCG GCCCGGTCCA GCTGAACCTG TCCTTCCGCG AGCCGCTGAT CCCCGACGGC
GACGCCTCCT GGTGCGAGTC CCTGGAGGGT GACGCCAACG GGCCCTGGAT CCGGGCCCGG
GTGGCGCCCC CCGCGGTGGC GCTGCACCTG CCGCCGACCC GGCGGGGCGT GCTGGTCGTC
GGCGACGGCG CCTCCAACGT GCGCAGATAC GTCGCGGCGG CGGGCATGGC CGGCTGGCCG
GTCCTGTCGG AGCCGAACGG CGGCGCCCGC TACGGCGACC ACGCAATGTC GACCTACCAC
TTCCTGCTCG GCACCCCCGA GTTCGCCGAC CGGCACCGGC CCGAGCTGGT GGTCACCCTG
GGCCGCCCGG GCCTGTCCCG GTCGCTGCTG AACTGGCTGA GGCACGCCGA CGAGCACATC
GTGGTCGCCC CCGACCTGAC CCGCTGGCCC GACCCGACCC GCTCGGCCAC CCAGGTGGCG
CAGGCGGTGG AGATCCCGGT CGCGGCCGGC GACGACGCCT GGCTGCACTC CTGGCGCCGC
GCCGACAACG CGGCCAGGGC CGCGATCGAC GAGGTGCTGG ACGGCGCCGG GCTCAGCGAG
CCGCGCCTGG CCCGCGACCT GGTGGACATG CTGCCCAACG GGTCGCTGCT GTTCTCCGGC
TCCTCCATGC CCATCCGCGA CCTGGACCAG GCGATGCGCC CCCGCAGGGG CCTGCGGATC
CTGGCCAACC GGGGCACCGC CGGGATCGAC GGCGTGGTCT CCACCGCGAT GGGCGCGGCC
CTGGCGCACA ACGGCCCCGC CTACGCGCTG ATGGGCGACC TGACCTTCCT GCACGACCAG
AACGGGTTAA TCCTCGGCCC CCGCGAGCCC CGCCCCGACC TGTGCGTGGT CGTGGTGAAC
AACGACGGCG GCGGGATCTT CTCGCTGCTG CCCCAGGCCG CGCTGCGCGA CCCGTTCGAG
CGGGTCTTCG GCACCGCGCA CGGGGTGGAC CTGGCCCACG TGGCCGCCGC CACCGGCACT
CCGTACACCT TCGTCAGCGA GCCCGACCAG CTCTCCAAGG CGCTGCGCGG GGAGGGGCTG
CGGATCGTCG AGGTCCGCAG CGACCGGGAG TCCAACGCGG TCCTGCACGC CATGATGCGC
GACGCCGCCC ACGCCGCGAT CCGCGAGGTC ATGTGA
 
Protein sequence
MNPATALATV LIDELVRCGL TDVVLAPGSR SAPLALAVHA DSRIRLHVRV DERSASFLAL 
GLARRGERPV ALICTSGTAA ANFHPAVIEA HESGVPLLLL TADRPPELRD TGASQTIDQI
KLYGTAVRWF SEVGLPEDRP GQVAYWRSLA CRAYQRSLGP TDPGPVQLNL SFREPLIPDG
DASWCESLEG DANGPWIRAR VAPPAVALHL PPTRRGVLVV GDGASNVRRY VAAAGMAGWP
VLSEPNGGAR YGDHAMSTYH FLLGTPEFAD RHRPELVVTL GRPGLSRSLL NWLRHADEHI
VVAPDLTRWP DPTRSATQVA QAVEIPVAAG DDAWLHSWRR ADNAARAAID EVLDGAGLSE
PRLARDLVDM LPNGSLLFSG SSMPIRDLDQ AMRPRRGLRI LANRGTAGID GVVSTAMGAA
LAHNGPAYAL MGDLTFLHDQ NGLILGPREP RPDLCVVVVN NDGGGIFSLL PQAALRDPFE
RVFGTAHGVD LAHVAAATGT PYTFVSEPDQ LSKALRGEGL RIVEVRSDRE SNAVLHAMMR
DAAHAAIREV M