Gene Sros_3348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3348 
Symbol 
ID8666636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3673994 
End bp3675586 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content74% 
IMG OID 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_003339030 
Protein GI271964834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00415757 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGCCC CGACCACCGA GACGGAGCTC GGCGCCCTGT GGCGCCAGGT GCTCCGCGTG 
CCCGCCGTCG CCCCCGGCGA CAACTTCTTC GAGCTGGGCG GCGACTCCTT CCGGGCCGCC
CAGCTCGCCG GCCTGGTCGG CACCCGCCTG GGGGTGCCCG TCACGCCGGC CCTCGCCTTC
GACCGGCCGG AGCTGGCCGG GCAGGCCCGC TGGATCGACG ACGCCCGCGC CGCGGGGCTC
TCGGCCGCCG CCGGGCCCGC GACCGGCGGG GGAGCTCCGC TCAGCACCCA GCAGGAGGAC
TTCCTGTACT GGATGTTCGA GAGCGAGCCC GTCCGCGACA TCGGGTCCTG CGCCACCGCG
ATCCGGATCC GCGACTCCTT CGACGTGGCC GTCCTCACCC GCGCGCTGGA GGCGGTGATC
GCGCGGCACG AGCCGCTGCG CAGCGTCGTC ACCGCGTCGG GGGAGGTGAT CGTCGCTGAC
GAGCTGCCGC CCGAGGTCGC CGAGGCCGTG GCCGAGGGCC GGACGCCGCA GGAGCGCGAG
CGCGACGCCG AGCGGATCGT CTGGCACGAG CGCATGCGTC TCGACGACGT TCTGCGCGGC
CCCCTCGTGC GGGCCCTCGT CGTGCACCTC GGCGAGGACG ACCACGTGCT GGTCCTCGCC
GTGCACCACT TCGCCTTCGA CGGCTTCTCC CTGGGCGTCA TGCTCCGCGA GCTGGGCATC
GTCTACTCGG CCCTGCGTAC GGGCTACCCC AGCCCGCTGC GCCCGCTGCC GATGTCCTAC
GCCGACTACT GCGCCTTCAC CCGCGAGCAG TGGCCGCGCA ACCAGGCGTA CTGGGACCTG
GTCCTGGAGG GTGCCCCCCG CGAACTGACG CCGTTCCCCG GCCGCAGGGA GACCACCCTG
TTCTCCCGCC GCAGGCACGC CTTCGAGATC GACGCGGAGC TGGCCGGCCG GCTGGGGGAG
ACCGCCAGGG CGCGCGGCGC GACCACGTTC ATGGCGGTGG CCGCGTGCTG GACCTGGCTG
CTGCGCCAGT GGACGGGGAT GACCGACCTG GTGGTGATGT CGCCCGTGCC CGGCCGTACC
GCGCCCGAGC ACGAGACGCT GATCGGCTGC CTGGTCCAGT CGCTCATCCT GCGCCTGGAC
GCCTCGGGCG ACCCCTCCTA CGGCGAGCTG GTCGACCGGG TCCGGGAGGT GTCCGTGGGG
GCGGTGGCGC ACCAGTTCCA CGCCTACCAG GACGCCCGGC TCCGGGTGCC CTTCCCCTCG
CGGATCCACT ACGAGAGCTT CGGCGCCCCG CACTTCCCCG GCCTCATGTC CGAGGCCTTC
CCCTTCCCCC GGGAGCAGGA GGGGCTGGAC TGGAGCGCCA ACCCGGGCGA GGTCGACCTC
AGCGCCCCGG AGCTGATCGT CGAGGAGCAG CGGGACGGCT CCATGCTGGC CGCCGTGGTC
TACAACCACT ACGGTTACGA CCCCGCGACG GCCGCCGAGC TCGCCGAGTC CTTCCAGGAG
TACGTCAGGG CCGCCGTGGC CGTTCCTGAC TCCCCGCTGC CGCCGCTGCC CGCGACAGCC
AGCCACGCCG GGGCGGAGGC CAGCCAGGGC TGA
 
Protein sequence
MTAPTTETEL GALWRQVLRV PAVAPGDNFF ELGGDSFRAA QLAGLVGTRL GVPVTPALAF 
DRPELAGQAR WIDDARAAGL SAAAGPATGG GAPLSTQQED FLYWMFESEP VRDIGSCATA
IRIRDSFDVA VLTRALEAVI ARHEPLRSVV TASGEVIVAD ELPPEVAEAV AEGRTPQERE
RDAERIVWHE RMRLDDVLRG PLVRALVVHL GEDDHVLVLA VHHFAFDGFS LGVMLRELGI
VYSALRTGYP SPLRPLPMSY ADYCAFTREQ WPRNQAYWDL VLEGAPRELT PFPGRRETTL
FSRRRHAFEI DAELAGRLGE TARARGATTF MAVAACWTWL LRQWTGMTDL VVMSPVPGRT
APEHETLIGC LVQSLILRLD ASGDPSYGEL VDRVREVSVG AVAHQFHAYQ DARLRVPFPS
RIHYESFGAP HFPGLMSEAF PFPREQEGLD WSANPGEVDL SAPELIVEEQ RDGSMLAAVV
YNHYGYDPAT AAELAESFQE YVRAAVAVPD SPLPPLPATA SHAGAEASQG