Gene Sros_8566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8566 
Symbol 
ID8671900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9451403 
End bp9452773 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID 
Productcystathionine beta-synthase 
Protein accessionYP_003343951 
Protein GI271969755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTAC ATGATTCACT CGTGGAGCTG ATAGGCAACA CTCCACTCGT CCGGCTGCAC 
AAGGTGACGG CGGGGCTGCC GGCTCAAGTG CTGGCCAAGG TGGAGTATTT CAACCCGGGC
GGCTCGGTGA AAGACCGGAT CGCGGTGCGG ATGATCGATG CCGCCGAGAA GTCGGGCGCG
CTGCGCCCCG GCGGCACGAT CGTGGAGCCC ACGTCGGGCA ACACCGGGGT CGGGCTGGCC
ATCGTGGCCC AGCAGCGGGG CTACAAGTGC CTGTTCGTGG TGCCCGACAA GGTCGCCCAG
GACAAGATCG CGGTCTTGCG CGCCTACGGT GCGGAGGTCG TGGTCTGCCC GACGGCGGTC
TCTCCCGACC ACCCGAGTTC CTACTACTCC GTCTCCGACC GGCTGGCCCG GGAGACTCCG
AACGCCTGGA AGCCGGACCA GTACTCCAAC CCGAACAACC CCGACAGCCA CTACCACTCC
ACCGGCCCGG AGATCTGGGA GCAGACCGAG GGCCGGCTCA CCCACTTCGT GGCGGGCGTC
GGCACGGGCG GCACCATCAG CGGTATCGGT CGCTACCTCA AGGAGGTCTC CGACGGCCGG
GTGAAGATCA TCGGAGCGGA CCCGGAGGGC TCGGTCTACT CCGGCGGCAG CGGACGGCCC
TACCTGGTGG AGGGCGTCGG CGAGGACATC TGGCCGGCCA CCTACGACAC CACGATCTGC
GACGAGATCA TCGCCGTCTC CGACAAGGAC TCCTTCGGCA TGACCCGTCG CCTGGCCCGC
GAGGAGGCGC TGCTGGTGGG CGGCTCCTGC GGCATGGCGG CGGTCGCGGC ACTGCGCGTG
GCCAAGCAGG CCGGCCCGGA CGACGTGGTC GTGGTGCTGC TGCCCGACGG CGGCCGGGGC
TACCTGTCGA AGATCTTCAA CGACGACTGG ATGGCCGACT ACGGCTTCCT GACCACCTCC
AGCGACGAGG GCCTGGTCAA GGACGTGCTG ACCCGCAAGG GATCCGGCAT GCCGGAGTTC
GTGCACACCC ACCCGCACGA GTCGGTGGAC ACGGCCATCT CCATCATGCG GGAGTACGGC
GTCTCGCAGC TCCCGGTGAT GAAGGAGGAG CCGCCGGTCA TGGCCGCCGA GGTGGTCGGC
TCGATCCTGG AGCGCGACCT GCTCGACGCC CTCTACCGCG GCCGGGTGCG GCCGAACGAC
CCGCTGGCCG ACCACATGTC CCAGCCGCTG CCTATGATCG GTGCGGGGGA GCCGGTCTCC
ATCGCGGTCG AGGCGCTGGA GAAGGCCGAC GCCGCGGTCG TCCTCGACGA CGGCAAACCC
GTCGGACTGG TCACCCGTCA GGACCTGCTG GCCTTCCTCG CCAACCACTA G
 
Protein sequence
MRVHDSLVEL IGNTPLVRLH KVTAGLPAQV LAKVEYFNPG GSVKDRIAVR MIDAAEKSGA 
LRPGGTIVEP TSGNTGVGLA IVAQQRGYKC LFVVPDKVAQ DKIAVLRAYG AEVVVCPTAV
SPDHPSSYYS VSDRLARETP NAWKPDQYSN PNNPDSHYHS TGPEIWEQTE GRLTHFVAGV
GTGGTISGIG RYLKEVSDGR VKIIGADPEG SVYSGGSGRP YLVEGVGEDI WPATYDTTIC
DEIIAVSDKD SFGMTRRLAR EEALLVGGSC GMAAVAALRV AKQAGPDDVV VVLLPDGGRG
YLSKIFNDDW MADYGFLTTS SDEGLVKDVL TRKGSGMPEF VHTHPHESVD TAISIMREYG
VSQLPVMKEE PPVMAAEVVG SILERDLLDA LYRGRVRPND PLADHMSQPL PMIGAGEPVS
IAVEALEKAD AAVVLDDGKP VGLVTRQDLL AFLANH