Gene Sros_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2202 
Symbol 
ID8665484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2373514 
End bp2374965 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative O-succinylbenzoate--CoA ligase 
Protein accessionYP_003337928 
Protein GI271963732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACC TGACCGTTCC CCAATTGCTC AGAGACCGGG CCTCCGCCGA CCCCGACGGG 
GTCGCGCTGA TCGTGCACGG CTCGGCACCG CTCACCTATC GCCAGTGGTA CGAGCGGTCC
GCTGTGATCG CGAACGGCCT GGTCGCGCGC GGTGTCACCA AGGGCGATCG GGTGGGTCTG
GTCTTCGGGA GCGCGGACTG GGCGGACTAC GCGGTGGCGT TCTGCGGGGT ACTGGCGGCC
GGAGCCGCCG CGGTGCCGCT GTCGGACCGC CTCGCCTCCG CTGACCTGCG CTTCATGCTG
GAACACTGCG GTGCGGCCGG AGTGATCCAT GCCGGCGGGC TCACCGTGCC CGAGATCGGA
TGGAGCGCCA CGCTCGACGG CCTGCGGACC GAAACGTCCT ATGAGACGGC CGATCCCGCG
CCCGGAGACC TCGCGCAGAT CCTCTACACC TCCGGCACGA CGGGCCGCCC CAAGGGGGTG
TCGGCCACCC ATGCCAACCT GGCCTACGGC TGCGGCGCCA GGCGCCGCCC GTTCGCGCAC
TCCGCGCACC TGCTCCACGC CTTCCCGATC GGCACGAACG CCGGCCAGGC CATGCTGATC
AACGCGCTCG ACGCCCACCC GGCCGTGGTG ACGCTGCCGC GCTTCACCCC CGGCCGCTTC
GCCGCGCTGA TCGAGTCCTA CCGGGCGGGG ACGGTCTTCC TCGTCCCCGC GATGGCCATC
GAGCTGATCA ACGCCGGAGT CGGCGAGCGG CACGACCTGT CCGGCGTGCT GCTGCTCGGC
TCGGCCGCGG CGCCGCTGCC CTCGGCCGTC GCCCAGGCGC TGACCACGGC CTTCCCCAAC
GCGACGATCA CCAACTACTA CACCTCGACC GAGGCCGCCC CCGCTCAGAC GATCATGATG
TTCGACCCGG AGCGGCCCGG CAGCGTCGGC CGCGCGGTGG CCGGCGGACG GGTCAGGATC
GCCACCGCGG AAGGCACGCC GCTCGCGCCG GGGGAGACCG GCGAGGTCTG GATGCGGTCC
CCGGCCGTGC CGCGCGCCTA CTACGGCGAT CCCGGGAGCG GGACCTTCCG CGACGGCTGG
GTGCGGATGG GCGACCTCGG CCGCATCGAC GCGGAGGGCT ACCTCTACCT CGTCGACCGG
GAGGGCGACG TGGTGAAGTC GGGCGCCTAC AAGGTCTCCA CGATCCACGT CGAGGAGGCC
GTCTACCAGC ACCCCGACGT GGTCGAGGCG GCGGCCTTCG GGGTCCCGCA CCCCGTGCTC
GGAACCGTCG TGGCCGTCGC CATGGCCACC CGAGCCCCGC TCACCCCCGA GGAACTGCGG
ATCTTCCTCA AGGACCGGCT CGCCCCGCAC GAGCTGCCCG CGCACGTGAC CACCGTCGAC
GCCCTGCCGC GCAACAACGG CGGGAAGGTC GACAAGCGGG CCCTGCGCCG GAGCATGGAG
GACGTGCGAT GA
 
Protein sequence
MVNLTVPQLL RDRASADPDG VALIVHGSAP LTYRQWYERS AVIANGLVAR GVTKGDRVGL 
VFGSADWADY AVAFCGVLAA GAAAVPLSDR LASADLRFML EHCGAAGVIH AGGLTVPEIG
WSATLDGLRT ETSYETADPA PGDLAQILYT SGTTGRPKGV SATHANLAYG CGARRRPFAH
SAHLLHAFPI GTNAGQAMLI NALDAHPAVV TLPRFTPGRF AALIESYRAG TVFLVPAMAI
ELINAGVGER HDLSGVLLLG SAAAPLPSAV AQALTTAFPN ATITNYYTST EAAPAQTIMM
FDPERPGSVG RAVAGGRVRI ATAEGTPLAP GETGEVWMRS PAVPRAYYGD PGSGTFRDGW
VRMGDLGRID AEGYLYLVDR EGDVVKSGAY KVSTIHVEEA VYQHPDVVEA AAFGVPHPVL
GTVVAVAMAT RAPLTPEELR IFLKDRLAPH ELPAHVTTVD ALPRNNGGKV DKRALRRSME
DVR