Gene Sros_4878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4878 
Symbol 
ID8668172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5401066 
End bp5402658 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content69% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003340438 
Protein GI271966242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0148671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCT ACTCGAGCGC GGCGGCGGCG ACCCCGCTGC TGGGCGAGAC CATCGGCGAC 
AACTTCGAGC GGACCGTGCG GCGCTTCGGC GACCGCGAGG CACTCGTCGA CGTGCCCTCC
GGCCGTCGCT GGACCTACGC CGAGCTCGAC GCCGACGTGA ACCGGCTGGC CCTCGCGCTC
CTCGCCTCCG GCATCGCCAA GGGCGACCGG ATCGGCATCT GGGCGCCCAA CTGCGCCGAA
TGGATGATCG TCCAGTACGC CACCGCGAAG ATCGGCGCGA TCCTGGTCAA CGTCAACCCC
GCCTACCGCG GCCACGAGCT CGACTACGTC GTCAGGCAGT CGGGCCTGCG CCTGCTCATC
AGCGCGCTCA CGCACAAGGG CAGCGACTAC CGCGCGATGG TCGAGGAGAT CGGGTTCGCC
GACGTCGTCT ACCTCGGCGA ACCCGGCTGG GACCGGCTGC TGGCGCTGAC CGCCCCCGAG
GAGCGGCTGC GCGAGCGCAT GGCGTCGCTG AGCGCCGACG ACGCGATCAA CATCCAGTAC
ACCTCCGGCA CGACCGGTTT CCCCAAGGGC GCCACGCTCT CGCACCACAA CATCCTCAAC
AACGGCTTCT TCGTCGGCGA GCTCATCCAC TACGACGAGC ACGACAGGGT GTGCCTGCCG
GTGCCCTTCT ACCACTGCTT CGGCATGGTG ATGGGCAACC TCGGCGCCAC CTCCCACGGC
GCGTGCGTGG TCATCCCCGC GCCCGGCTTC GACCCCGAGG CGACGCTGCG GGCCGTACAG
CAGGAACGAT GCACCTCCCT GTACGGTGTG CCGACCATGT TCATCGCCGA GCTCACTCTC
GCAGGGCAGT ACGACCTGTC CAGCCTGCGC ACCGGCATCA TGGCGGGCTC GCCCTGCCCT
GTCGAGGTGA TGAAGCGCGT CGTCACCGAG ATGAACATGG CCGAGGTCGC CATCTGCTAC
GGCATGACCG AGACCTCGCC CGTCTCCACC ATGACCAGGT CCGACGACAG CCTGGAGCGC
CGCACCGAGA CCGTCGGGCA GGTGATGCCG CACGTCGAAG TCAAGATCAC GCACCCGGAG
ACCGGGCTGA CCGTGCCGCG CGGCGAGCCC GGCGAGCTGT GCACGCGCGG CTACTCGGTG
ATGCTCGGCT ACTGGAACGA GCCGGAGCGC ACGGCCGAGG CCATCGACAC CGCCCGCTGG
ATGCACACCG GCGACCTGGC CACCATGGAC GCCGACGGCT ACGTCAACGT GGTCGGCCGG
ATCAAGGACA TGGTCATCAG GGGAGGCGAG AACGTCTACC CGCGCGAGGT CGAGGAGTTC
CTCTACCGCC ACCCCGACAT CGCCGACGTG CAGGTGATCG GCGTGCCCGA CGAGAAGTAC
GGCGAGGAGC TCATGGCCTG GGTCGTCATC CGCCAGGGCG GCACACCCCT GACCGCCGAG
GCCGTCAGGG AGTTCTGCGC CGGCAAGCTC GCCCACTACA AGATCCCGCG CTACGTCCAC
GTCGTCGACG GGTTCCCGAT GACGGTCACC GGCAAGATCC GCAAGGTCGA GATGCGCGAG
GAGGGCGTGC GCCTGCTCGG GCTCGACCAG TGA
 
Protein sequence
MQSYSSAAAA TPLLGETIGD NFERTVRRFG DREALVDVPS GRRWTYAELD ADVNRLALAL 
LASGIAKGDR IGIWAPNCAE WMIVQYATAK IGAILVNVNP AYRGHELDYV VRQSGLRLLI
SALTHKGSDY RAMVEEIGFA DVVYLGEPGW DRLLALTAPE ERLRERMASL SADDAINIQY
TSGTTGFPKG ATLSHHNILN NGFFVGELIH YDEHDRVCLP VPFYHCFGMV MGNLGATSHG
ACVVIPAPGF DPEATLRAVQ QERCTSLYGV PTMFIAELTL AGQYDLSSLR TGIMAGSPCP
VEVMKRVVTE MNMAEVAICY GMTETSPVST MTRSDDSLER RTETVGQVMP HVEVKITHPE
TGLTVPRGEP GELCTRGYSV MLGYWNEPER TAEAIDTARW MHTGDLATMD ADGYVNVVGR
IKDMVIRGGE NVYPREVEEF LYRHPDIADV QVIGVPDEKY GEELMAWVVI RQGGTPLTAE
AVREFCAGKL AHYKIPRYVH VVDGFPMTVT GKIRKVEMRE EGVRLLGLDQ