Gene Sros_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5389 
Symbol 
ID8668683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5906565 
End bp5908154 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340894 
Protein GI271966698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.539443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC TGACGGGAAC CGGCGTACTC GCCCGGCTCG TCGTCCGCCG CGACCGCCTC 
ATCCTCCCCC TGTGGGGCCT GCCCGCCGCG CTCTACCCGT CGAGCATCGC CGGCAGCACC
GCCGGGCTCT ACCCGACCGC CGAGGCCCTG CGCGGCGTGG GCGACGCGGC GATGGCCAAC
CCCTCGCAGG CGGCCATGCG CGGCCCGGTG TTCGAGGCGA GCGTGGGCGG GCTGACCGCA
CACACCGTGA CCAGTTCGGG TGGCATGCTG CTCGGGCTGG TGAGCATGCT CCTGATGATC
CGGCACAGCA GGGGCGAGGA GGAATCGGGC CGCCGCGAAC TCGCCGCCGC GGGAGTGGTG
GGCAGGCACG CGCCGTTGAC CGCGGCGCTG GCCGTGGTGC TCGCCGCCAA CCTGGTGATC
GCCGTCCTCA TGGCGGGCGC GCTCACCGGC GTGGGGCTGC CCGCCTCCGG CTCCTTCGCG
CTGGCCCTGT CGCTGGCCGC CGCCGGATGG ACCTTCGCCG CGGTCGGCGC CCTCGCCGCG
CAGCTCACGG AGAGCGTGGC CGCGGCCAGG GGGATAGGCA TCGGGGTGTT CGCCGTCTTC
TTCCTGATCC GAGCGGTCGG CGACGCGGGC GGGGTGGCCT GGCTGTCGTG GGCGTCGCCG
CTGGGCTGGA CGCTGCGGGT ACGGCCGTTC GCGGGCGAGC GCTGGTGGGT CTTCGCGCTC
CTGCTGGCCC TGGTCGTCGC GCTGGCAGGC ACGGCCTACC GGCTGTCGTC CCGCCGCGAC
CTGGCCGCCG GTGTGCTGCC TGCACGGCTC GGGCCCGTAG CGGCGGCGCC CGGGCTGCGC
AGCGCGCCGG CCCTGGCCTG GCGGCTGCAC CGCGGTCAGC TGGTGGCCTG GATCGCCGGG
TTCGCGGCGG GCGGCCTCGC GCTCGGCGGT GCCGTCTCCG GCGGGATCGA AGGCCAGATC
GACGCTCCGC AGATCATGGA GATGATCGCC AGGGTGGGGG GCGGGGACGC CGAGCCGGCC
GACTTCTTCG TCAACTACCT GCTGTCCATG CTCGCCTGGA TCATCGCCGC CTACGGCATC
CTGTCCGCGC TCCGGCTGCG GACGGAGGAG ACGGCGGGGC GCGCCGACCT GGTGCTGGTG
ACCCCGACGA GCCGCATCCG GTGGGCGCTC AGCCATCTGT TCATGGCGGT GGTCGCGCCC
GCGGCGGCAA TGGTGGCGCT GGGCGCGGCC ACCGGACTCG CCTACAGCGC GCGCGGCGGC
GACCCGGGCA AGTTCCCGCT GGTGCTGGGG GCCGCGCTGG CCTACCTGCC CGCCGTCTGG
GTGATGACGG GGATCGCGGT CGTCCTGGCC GGGCTGCTGC CCCGGCTGTC CACGGCCGCG
TGGGGGATCT GGGTGGCGTT CATCCTGCTC GACGTGCTCG GCACTCTGGG GCAGGTCGAC
GAGTCGGTGC TGAACATCAT CCCGTTCGTG CACGTGCCGT GGATCATCCT CGGCCAGACG
GCAGTGGCAC CGCTGCTCCT AATGACCGTG GTCGCCGTCG CCCTGGGCGC CACCGGGCTG
GCCGGTCTGC GCCGCCGCGA CATCGCGTGA
 
Protein sequence
MSALTGTGVL ARLVVRRDRL ILPLWGLPAA LYPSSIAGST AGLYPTAEAL RGVGDAAMAN 
PSQAAMRGPV FEASVGGLTA HTVTSSGGML LGLVSMLLMI RHSRGEEESG RRELAAAGVV
GRHAPLTAAL AVVLAANLVI AVLMAGALTG VGLPASGSFA LALSLAAAGW TFAAVGALAA
QLTESVAAAR GIGIGVFAVF FLIRAVGDAG GVAWLSWASP LGWTLRVRPF AGERWWVFAL
LLALVVALAG TAYRLSSRRD LAAGVLPARL GPVAAAPGLR SAPALAWRLH RGQLVAWIAG
FAAGGLALGG AVSGGIEGQI DAPQIMEMIA RVGGGDAEPA DFFVNYLLSM LAWIIAAYGI
LSALRLRTEE TAGRADLVLV TPTSRIRWAL SHLFMAVVAP AAAMVALGAA TGLAYSARGG
DPGKFPLVLG AALAYLPAVW VMTGIAVVLA GLLPRLSTAA WGIWVAFILL DVLGTLGQVD
ESVLNIIPFV HVPWIILGQT AVAPLLLMTV VAVALGATGL AGLRRRDIA