Gene Sros_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2016 
Symbol 
ID8665298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2167632 
End bp2169236 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content72% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003337747 
Protein GI271963551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.279533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCACC GGTTGACCCT GTCCGACGTC CTCGCCGAGC ACGCCCGCAG CCGCCCCCGG 
GTGACGGCGG TGGTGGACGG CGAGGTGCGG CTCACCTATC CCGAACTCGA CGAGCGGGTC
ACCCGCCTGG CGGACGCGCT GGCCGCGCGG GGCGTCACGG CCGGCGACCG GGTGTTGTGG
CTCGGCCGCA ACGGCCATCC GGTGCTGGAG CTGCTGCTCG CCTCCTCCCG GCTCGGCGCG
ATCTTCTGCC CGGCCAACTG GCGGCAGTCC ACCGACGAGC TGTCCTTCGT CGTGGACGAC
CTCACCCCGA AAGTCGTCGT CTGGGAGCGG TCCGAGGCCG CCGCCCCCCT GAGCGACGAC
GGCTGGATCC TCGCGGGCGA CGGCTACGAG GCCTTCCTGT CGAGCGGCTC CCCCGCCGCC
CACCGGGATG ACGCCGCGGA CGCCCTCCCG GGCGACCCCG CCGGCGCCCG CGCGGAGGTG
CCCGGAGGAC CCCTCACGGG CGGGCCCCTC ACGGGCGGGC CCCTCACGGG TGGGCCCGCG
CATCCGGCGG AGCCCGGCGA CGACACGCTC CCGGTGCTCG CGCTCTACAC CGCCGCCTTC
GACGGCCGGC CGAACGCGGC CCTGCTCAGC AGCGCCGCGC TGGTCGCGCA CAGCACCGCG
CTGCTGGTGG TCCGCCAGAT GGAGGACGGC TTCACCTTCC TGAACAACGG CCCGCTGTTC
CACGTGGGCA CGATGATGTT CTGCCTGGCC ACCCTGCAGA TCGGCGGGAC GAACGTGTTC
ACCCCGGCGT TCGACGCCGA GGAGGTCTGC CGCCTGATCG ACGCGGAGAA GGTCACCCAG
GCGTTCCTGT TCGGCCAGAT GATCGACGCG GTGACGGACG CGAACAAGGA CGGCAAATAC
GACCTGACAT CGCTCCGCTT CGTCTCCCAC TCGGCCGAGT GGGACGCGAT GACGACGGTG
GACGACTCCC CCTGGTGCCG GTCCAAGATG GGCGGCTACG GCCAGACCGA GGTGGGCGGC
ATGCTGACCT TCCTCGGCCT GGCCCCGGGC GCCGCCGGAT TCGCCGGACG GCCGTCGCCA
CTGGTGCAGG TACGGCTGCT CTCCGCCGAC GGGAGCGAGG TGCCGTCCGG GGAGGTGGGC
GAGATCTGCG CCCGCGGCAA GTCGCTTTTC TCCGGATATT TCAACAGACC TGAGCTCAAT
GCGGAGAAGA CACGCGGCGG CTGGCACCAC ACCGGCGATC TCGGCCGCCG CGAGCCCGAC
GGCACGATCA CCTTCATCGG CCCCAGGCTC CGCATGATCA AGTCCGGCAA CGAGAACGTC
TACCCGGCCG AGGTCGAGCG GGTGCTGAAG ACCCACCCCG CGGTCGCCGA CGCCGCCGTC
ATCGGCGTCC CCGACGACCG GTGGCACCAG GCCGTGAAGG CCGTCGTGGT CCTCAAGGGC
GAGGCCACGG CCGAGGACGT CGTCGAACAC GTCCGCACCC GGCTCGCCTC GTACAAGAAG
CCCAGGCACG TCGACTTCGT CGAGGCCATC CCCAGGAAGG GCTTCGCCCC CGACTACGAC
GCCCTCGACG CCGCCCACGG CGGCGGCGGC TACCCCGGCT CCTGA
 
Protein sequence
MIHRLTLSDV LAEHARSRPR VTAVVDGEVR LTYPELDERV TRLADALAAR GVTAGDRVLW 
LGRNGHPVLE LLLASSRLGA IFCPANWRQS TDELSFVVDD LTPKVVVWER SEAAAPLSDD
GWILAGDGYE AFLSSGSPAA HRDDAADALP GDPAGARAEV PGGPLTGGPL TGGPLTGGPA
HPAEPGDDTL PVLALYTAAF DGRPNAALLS SAALVAHSTA LLVVRQMEDG FTFLNNGPLF
HVGTMMFCLA TLQIGGTNVF TPAFDAEEVC RLIDAEKVTQ AFLFGQMIDA VTDANKDGKY
DLTSLRFVSH SAEWDAMTTV DDSPWCRSKM GGYGQTEVGG MLTFLGLAPG AAGFAGRPSP
LVQVRLLSAD GSEVPSGEVG EICARGKSLF SGYFNRPELN AEKTRGGWHH TGDLGRREPD
GTITFIGPRL RMIKSGNENV YPAEVERVLK THPAVADAAV IGVPDDRWHQ AVKAVVVLKG
EATAEDVVEH VRTRLASYKK PRHVDFVEAI PRKGFAPDYD ALDAAHGGGG YPGS