Gene Sros_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3489 
Symbol 
ID8666777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3859571 
End bp3860725 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content71% 
IMG OID 
ProductrRNA (guanine-N(2)-)-methyltransferase 
Protein accessionYP_003339168 
Protein GI271964972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.180285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.478822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTT TGACGACGTC AAGGGGTGAG TTCGACCTCG CCCGCTTTCC CGATGATCCC 
CGTGACCTGC TCCGCGCCTG GGACGCCGCC GACGAATACC TGCTGCGGCA CCTGGACGGA
ATCGACGGTG AGCCGACGGA CCTGTCGGGC ACCGTGGTCG TGGTGGGCGA CCGATGGGGT
GCCCTGGTCA CCGCGCTGGC CGATCACCGC CCCGCGCAGA TCACCGACTC CTTCCTCACC
CAGGAGGCGA CCCGGGGCAA CCTGAAGCGC AACGGGGCCG ACGCGGACAT GGTGCGGCTG
CTGTCGACCA GGGACACGCC GCCGGACCGG ATCGACGTGC TGCTGATCCG GGTGCCCAAG
AGCCTCGCGC TCCTGGAGGA CCAGCTGCAC CGGCTCGCAC CCCGCGTCCA CACGGGCACC
GTCGTCATCG GCACGGGGAT GGTCGCCGAG ATCCACACCT CGACCCTGAA GCTGTTCGAG
CGGATCCTCG GGCCGACCCG GACGTCGCTG GCGGCCAGGA AGGCACGGCT CATCTTCTGC
TCGCCGGCCC CTCAGCCCCT CCGGGGCGCC AGCCCGTGGC CGCGGAGCTA CGCGCTGCCC
ACCGGCATCG GAACGGCCTC GGGCCTGACC GTCACCAACC ACGCGGGCAT CTTCTGCGCC
GACCGCCTCG ACATCGGCAC CCGCTTCCTC CTGCGGAACC TCCCCCGCCG CCGGGGCCCC
GAGCGCGTCG TGGACCTGGG CTGCGGCAAC GGGGTGGTCG GAGTGGCCGC GGCGCTGGCC
AACCCCGAGG CCGAGGTGAT GTTCATCGAC GAGTCCTACC AGGCGGTGGC CTCTGCGGAG
GCCACGTTCC GGGCGAACGC CGGCGCGGGC ACCACGGCGC GGTTCGTGGT GGGTGACGGC
CTGTCCGGTG TACCGGCCGG GACGGTGGAC CTGGTGCTGA ACAACCCGCC GTTCCACACC
CACCAGGCGA CGACCGACGC GACGGCCTGG CGCATGTTCA CCGGGTCGCG CGCCGCGCTA
CGCCGTGGCG GCGAACTGTG GGTGATCGGC AACCGGCACC TCGGCTACCA CGTGAAACTG
CGCCGGATCT TCGGTAACTG CGAGGTCGTC ACGAGCGACC CGAAGTTCGT CATCCTGCGG
GCCGTCAGAA GCTGA
 
Protein sequence
MNRLTTSRGE FDLARFPDDP RDLLRAWDAA DEYLLRHLDG IDGEPTDLSG TVVVVGDRWG 
ALVTALADHR PAQITDSFLT QEATRGNLKR NGADADMVRL LSTRDTPPDR IDVLLIRVPK
SLALLEDQLH RLAPRVHTGT VVIGTGMVAE IHTSTLKLFE RILGPTRTSL AARKARLIFC
SPAPQPLRGA SPWPRSYALP TGIGTASGLT VTNHAGIFCA DRLDIGTRFL LRNLPRRRGP
ERVVDLGCGN GVVGVAAALA NPEAEVMFID ESYQAVASAE ATFRANAGAG TTARFVVGDG
LSGVPAGTVD LVLNNPPFHT HQATTDATAW RMFTGSRAAL RRGGELWVIG NRHLGYHVKL
RRIFGNCEVV TSDPKFVILR AVRS