Gene Sros_4439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4439 
Symbol 
ID8667733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4954840 
End bp4956270 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID 
Productadenylosuccinate lyase ; K01756 adenylosuccinate lyase 
Protein accessionYP_003340052 
Protein GI271965856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00550594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000234914 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTGCCA AGCCGCGTAT CCCCAATGTC CTGGCCGCCC GCTATGCCTC GGCGGAGCTG 
GCCCGTCTGT GGTCTCCCGA GTACAAGGTG GTGGCCGAGC GCCGCCTGTG GCTGGCGGTG
CTGGCCGCGC AGGCGGAGCT GGGAGTGTCC GTCGCCCCGC AGGCGGTGGC CGACTACGAC
AAGGTCGTCG AGCAGGTGGA CCTGGCCTCG ATCGCCGAGC GGGAGCGGGT CACCCGCCAT
GACGTGAAGG CCCGGATCGA GGAGTTCAAC GCCCTGGCCG GGCATGAGCA CGTGCACAAG
GGCATGACGT CGCGGGATCT GACCGAGAAC GTCGAGCAGT TGCAGATCCG CGACAGCCTG
CTGGTGGTGC GTGATCGGTG TGTGGCGTTG CTGGCGCGGC TGGGCGGGCT GGCGGCCGAG
CATTCGGGCA CGGTGATGGC GGGCCGGTCG CACAACGTGG CCGCGCAGTC GACCACGCTG
GGCAAGCGTT TCGCCTCGGC GGCCGATGAG CTGCTGGTGG CCTTCGCCCG GTTGGAGGAG
CTGATCGCCC GTTACCCGCT GCGCGGTATC AAGGGTCCGG TCGGCACCGC TCAGGACATG
CTGGATCTGC TGGGCGGCGA CCGGGGCAGG CTGGCGGAGC TGGAGGATCG GGTGGCCGGG
CACCTGGGGT TCGCCCGGCG CCTGACCAGC GTGGGGCAGG TCTATCCCCG GTCGCTGGAC
TACGAGGTGG TCACGGCGCT GGTGCAGCTG GCGGCCTCTC CTTCCTCGCT GGCCAAGACG
GTCCGGCTGA TGGCCGGGCA CGAGCTGGTC ACCGAGGGGT TCGCCGAGGG GCAGGTGGGG
TCTTCGGCGA TGCCGCACAA GATGAACACC CGTTCCTGTG AGCGGGTCAA CGGGCTGACG
GTGGTCTTGC GCGGTTACGC CTCCATGGTC GGGGAGCTGG CGGGCGACCA GTGGAACGAG
GGGGACGTGT CGTGCTCGGT GGTGCGGCGG GTGGCGCTGC CGGATGCGTT CTTCGCCTTC
GACGGGCTGG TCGAGACGAT GCTGACGGTG CTGTCGGAGT TCGGCGCGTT CCCGGCGGTC
ATCGCCGCCG AGCTGGACCG GTATCTGCCG TTCCTGGCGA CGACGAAGAT GCTGATGGCC
GCGGTGCGGG CCGGGGTGGG CCGGGAGAGC GCCCATGAGC TGATCAAGGA GCACGCGGTC
GCCTCGGCGC TGGCGATGCG TGAGCGGGGC GCCGGAAACG AGCTGCTGGA GCGCCTGGCG
GCCGACGAGC GTTTCCCGCT GGACGCCGCG CAGCTGGCCG AGCTGCTGGA GCAGCGGATC
GCGTTCACCG GTGCGGCGGC CGAGCAGGTG GAGGCGGTCG TGGCGCGCGT CGGTGAGGTC
GTCGCCCGCT ATCCCGCCGC GGCCGCCTAC GCTCCCGGGG CGATCCTCTG A
 
Protein sequence
MTAKPRIPNV LAARYASAEL ARLWSPEYKV VAERRLWLAV LAAQAELGVS VAPQAVADYD 
KVVEQVDLAS IAERERVTRH DVKARIEEFN ALAGHEHVHK GMTSRDLTEN VEQLQIRDSL
LVVRDRCVAL LARLGGLAAE HSGTVMAGRS HNVAAQSTTL GKRFASAADE LLVAFARLEE
LIARYPLRGI KGPVGTAQDM LDLLGGDRGR LAELEDRVAG HLGFARRLTS VGQVYPRSLD
YEVVTALVQL AASPSSLAKT VRLMAGHELV TEGFAEGQVG SSAMPHKMNT RSCERVNGLT
VVLRGYASMV GELAGDQWNE GDVSCSVVRR VALPDAFFAF DGLVETMLTV LSEFGAFPAV
IAAELDRYLP FLATTKMLMA AVRAGVGRES AHELIKEHAV ASALAMRERG AGNELLERLA
ADERFPLDAA QLAELLEQRI AFTGAAAEQV EAVVARVGEV VARYPAAAAY APGAIL