Gene Sros_5057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5057 
Symbol 
ID8668351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5581389 
End bp5582579 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID 
ProductErythromycin esterase like protein 
Protein accessionYP_003340590 
Protein GI271966394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.581564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.641423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACG ATACCGTTGA AGGCATGGAT ATCAAGGACA TGGCCCGGCC GTTCGATGGC 
ACGGGCATCT CGGCCTTCCT CCGGTCGCTT CCCGCCAAGC CCCTGCTGCT CGGCCTGGGC
GAGGCCAGGC ACTTCGTGGA GGAACTGGGC GACCTGCGCA ACGAGATCTT CCGGCATCTG
GTCGAGCACG AGGGCTACCG GTCGTTCGCC ATCGAGAGCG ACTGTCTCAT GGGCCTCGTG
GTCGACGACT ACGTCACGAC GGGTACGGGC ACGCTCGACG ACGTCATGGA ACGCGGCTTC
AGCCACGACT TCGGCACGTC CCCGGCCAAC CGCGACCTCG TACGCTGGAT GCGCGCGTAC
AACGAGGAGC ATGACGAAAA GCTCCGGTTC TTCGGCTTCG ACGGTCCGCT GGAGTATTGG
GCCGAGAGCC CGCGCCAGGC GCTCACCGCC CTCTACGCCC TCCTCGACGG CCCGCTCCCC
TGCACCATGG AAACCCTCGA CGCGCTGCTC GGCCCGGACG GCCGGTGGAC GGACGAGGCC
ACGGTCATGG ACCCGTCCCG GTCGATCGGC CAGTCCGCCG ACGCCCAGCG GCTGCGGCTG
CTCGCCGACG ACCTGGTGGC ACTGCTCGAC ACTCAGGTGC CGCGGCTGAG CGCGGAGGAC
AGGGAGCGGG CGGAGCTGTA CGGGCGCACC GCCGTCGGCC TGCTCCGCTA CCACCACTGG
ATGGCCGACA CGTCCCCGGC CCGGATAGCC AAGCTGTCGG GCCTGCGGGA CGCGATGATG
GCCGCCAACC TGCGTGCCGT CGCCGAGCGC GGCCCGGCCC TGGTCTTCTC CAGCAACCTC
CACCTGCAGC GGAACAAGAG CTTCATGCTG CTCGGCGACC AGCCGCTGGA GTGGTGGAGC
GCGGGGGCGA TCACCGGAGC GCACCTGGGC GACCGGTACG CCTTCCTGGC CTCAGCCCTC
GGCACGGTCG GCGACGACAC CCCGCCCCCC GACACCGTCG AGGGCATCCT GTCCACGCTC
CCGTGGGACC ACTCTCTCAT CGACGCCCGC CGCCTGGCCG AGGCCACCAC GGAGCCCGCC
CAGCGCATCT CCCACGATTT CGCCTATTTC CCGCTGGACC CGGCCCAGCT CGACATGATC
GACGGCGTCG TCTTCCTCAA GCAGGCTGTC AGGCTGAAAA AGCTGGGTTG A
 
Protein sequence
MFNDTVEGMD IKDMARPFDG TGISAFLRSL PAKPLLLGLG EARHFVEELG DLRNEIFRHL 
VEHEGYRSFA IESDCLMGLV VDDYVTTGTG TLDDVMERGF SHDFGTSPAN RDLVRWMRAY
NEEHDEKLRF FGFDGPLEYW AESPRQALTA LYALLDGPLP CTMETLDALL GPDGRWTDEA
TVMDPSRSIG QSADAQRLRL LADDLVALLD TQVPRLSAED RERAELYGRT AVGLLRYHHW
MADTSPARIA KLSGLRDAMM AANLRAVAER GPALVFSSNL HLQRNKSFML LGDQPLEWWS
AGAITGAHLG DRYAFLASAL GTVGDDTPPP DTVEGILSTL PWDHSLIDAR RLAEATTEPA
QRISHDFAYF PLDPAQLDMI DGVVFLKQAV RLKKLG