Gene Sros_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3797 
Symbol 
ID8667087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4232371 
End bp4233660 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID 
ProductIS605 family transposase OrfB 
Protein accessionYP_003339460 
Protein GI271965264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.9215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.414505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTACCG GACGCAAGTA CCGCCTGGAC TTCACCCCCG AGCAGGGTGA ATTCGCCGAA 
CGCATCGGCG GGGCGTGCCG GTCGGTGTGG AACACCGCGC TGGAACAGCG CCGGATCTAC
CGTCGGCGTG GTGGGTGGAT CGGCTATCAC GACCAGGCCC GCCAAGTGGC TGAGGCGAAA
GATGACTTCC CCTGGCTGGC CGAGGTGCCC GGTCACTGCC TGCAGCAGGC GTTGATCGAC
CTGGATCAGG CGTGCGCCAG GCACGGCACG TGGAAGGTCC GCTGGAAGTC GAAGGTCGCC
AACCCGCCGA GCTTCCGATT CCCTGAGGGC GGGAAAATCA CGGTCGAGCG GCTCAACCGG
CGCTGGGCGC GAGTGAAGCT GCCGAAACTC GGTTGGGTCC GCTTCCGCCT CACCCGCCCG
CTCGGCGGGA AGGCCAAGAA CGCCACCGTC AGCCGGGACG GTGAGCATTG GTACATCAGC
TTTCTCGTCG AGGACGCAGT CACCCCGCCT GAGCGGCACG CCGACCCCGG CAGCGCCGTG
GGGATCGACC GGGGCGTGGT CAAGGTCGTG ACCCGCTCGG ACGGCCGCTT CCACCATCGG
GTGTTCGCCC GTGATCGGGA AGTCGAGCAT GCCAAGAAGC TTCAGCGAGA CTTCGTCCGG
ACCGCGAAGG GATCGGCCCG GCGCAAGGAA GCTGCCGGGC GGGTCGCCGC TATGGCGCGG
AAGGTCCGCA GACGCCGGGA GGACTTCGCC GCCAAGACCG CCCATACCCT GGCCACGGGC
TTTGAAATGG TCGTGTTCGA GGCGCTCACG ACCAAGAACA TGACCGCTGG CGTCGAACCC
AGGCCAGACC CTGAGCAGCC GGGCGCGTTT TTGCCGAACG GGGCCGCCGC TAAGACCGGA
CTGAACCGGT CTATCTTGGA CAAGGGCTGG TACCGGATCG AGCTGGCCAC CCGTAGTAGG
GCCCGGTATA CGGGCACCCA CGTGATCACT GTCAACCCGG CGTACACGAG TCAGACGTGC
AACGTGTGCA CGGTGGTGGA CCGGAAGTCC CGCGAGAGCC AAGCGGTCTT CCGGTGCACC
TCGTGCGGAC ACATCGAGCA CGCCGACGTG AACGCCGCCA AGAACGTACT CACCGCCGGG
AGGGCGGAGT TCGCACAGCC CAGACCGGGT GTGCGGGCTG GGGCGCGCAA ACCACGCAAC
CGCGTGGGCC GCAAGGCCAA TCGCCAAGCA ACAGCAGCGC AGAGCACCGC AACAGCGGGG
TCCGGGCTGG CTGGAATCCC CCGGCTTTAG
 
Protein sequence
MLTGRKYRLD FTPEQGEFAE RIGGACRSVW NTALEQRRIY RRRGGWIGYH DQARQVAEAK 
DDFPWLAEVP GHCLQQALID LDQACARHGT WKVRWKSKVA NPPSFRFPEG GKITVERLNR
RWARVKLPKL GWVRFRLTRP LGGKAKNATV SRDGEHWYIS FLVEDAVTPP ERHADPGSAV
GIDRGVVKVV TRSDGRFHHR VFARDREVEH AKKLQRDFVR TAKGSARRKE AAGRVAAMAR
KVRRRREDFA AKTAHTLATG FEMVVFEALT TKNMTAGVEP RPDPEQPGAF LPNGAAAKTG
LNRSILDKGW YRIELATRSR ARYTGTHVIT VNPAYTSQTC NVCTVVDRKS RESQAVFRCT
SCGHIEHADV NAAKNVLTAG RAEFAQPRPG VRAGARKPRN RVGRKANRQA TAAQSTATAG
SGLAGIPRL