Gene Sros_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2035 
Symbol 
ID8665317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2183616 
End bp2185595 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID 
Producttransposase Tn3 family protein 
Protein accessionYP_003337763 
Protein GI271963567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.200024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.850327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCGTG AGTGGGAGCT GGACGAGCTC ATCGAGTGCT GGACGCTGGA TGAGGACGAA 
CTGGGGCTGC TGAGCAACAA GTCCGGGGCG ACGCGGCTGG GGTTTGCGCT GCTGTTGAAG
TTCTTCGAGC TGGAGGGGCG GTTCCCTCGG CGCGAGGATG TGCCCAAGGC CGCGGTGGAG
TACATGGCCG GGCAGGTGGG GGTGGAGGCG GCGCTGTTGT TCGCCGACTA CCAGTGGAGC
GGCAGGACGA TCGAGTACCA CCGCAGCCAG GTCCGCAAGC ACCATGGGTT CCGGCAGCCG
ACAGTGGCCG ATGAGGACAA GCAGATCTTC TGGCTGGCGG GTGAGCTGTG TCCGGAGGAG
TTGTCGCGGG ATCGGCTGCG CGCGGCGCTG CTGACGCGGT TCCGGGCGGA GAAGATCGAG
CCGCCGACTC CGGTCCAGGT GGACCGGCTG CTGGGCGCGG CCGTGGCGAT GTTCGAGCGG
GACTTCACCA CCCGCACCGC TCAGCGGCTG GGATCGGCGG CCGGCGCGCG GCTGGACGAT
TTGATCGCGG CTCCCGAGGT CGAGGCCGAC CCTGTCGAGG TGCCGGTGGC CGCGCCGGGA
CGCAGCTTCT TGCAGGAGCT GAAGGAGGAT CCGGGGCCGA TCCAGTTGGA CACGCTGCTG
GCCGAGATCG TCAAGCTGGA ACGGGTGCGC GCGATCGGCC TGGGCGAGGG CCTGTTCGAG
GGGGTGTCGG AGAAGATCGT GGAGTCCTGG CGGGCGCGGG CGATGCGGAT GTACCCCTCC
GACTTCGCCG CGGCGGCCGA GCCGGTGCGG TGGACCTTGC TGGCGGCGCT GTGCTGGGTG
CGCAAGACCG AGCTGACCGA CGGCCTGGTG GAGCTACTGA TCCAGCTCGT GCACAAGATC
AGCGTGCGGG CCGAACGCAA GGTCGAAAGC GAGATCAGCG CCGAGTTCCG GCGGGTGCAC
GGCAAGAACG GCATTCTGGT TCGGCTCGCC CAGGCCGCCC TGGACCTGCC CGAGGAGGTC
GTGCGCGAGG CGATCTACCC GGTGGTGGGG GCGCGGACGC TGGCAGACAT CGTCGCCGAG
GCCAAGGCGA ACGAGAAGGT GTTCAACTCG CGGGTGCGCA CCAAGCTGCG CGGCTCCTAC
TCGCGCCACT ACCGGCGCGG GCTACCTAAG CTGCTGCGCG CGCTGGAGTT CGGCTGCTCC
AACACCGCCT TCCGGCCGGT GATGGACGCC CTGGCGCTGC TGGACCGGTA CGCCGACTCC
GAGGCGGTGC ACTACGACCG CAGCGAGACC GTCCCGCTGG AGCACGTGGT GCCCGACGAT
TGGAAGGACG CGGTCGTCGA CCCCGACACC GGCCGCGTGG AGCGCATCCC GTACGAGCTG
TGCGTGCTGG TCGCCCTGCA CAAGGCGATC CGCCGCCGGG AGATCTGGAT CACAGGTGCC
AAGACGTGGC GCAACCCCGA CGACGACCTG CCCGCCGACT ATGAGAACAA CCGGGACGTG
CACTACGAGG CGTTGTCCAA GCCCCGTGAC CCGGCGGCGT TCATCGCCGA CCTGCAACGC
CGGCACCTGG CCGCCCTGGA CCGGCTCAAC ACCGCGATGG GCTCCGACAC CACCGGCGGA
GTGAAGCTGA CGCGACGCAA AGGCGAGCCG TGGATCTCGG TCCCGCCCCT GAACCGCCGT
CCCGAGCCGA CGGGCCTGGT GGCGCTGAAG GAGGAGATCT CGCGGCGCTG GGGCGTGATC
GATCTGCTGG ACGTGCTCAA GGACGTCGAC CACGTCACCG GGTTCACCAA GGAGTTCACC
TCGGTGGCCT CCCGCACCAT CACCCATCCT GACGTGCTGC AACGCCGGCT GCTGCTGTGC
CTGTACGGGC TGGGCACCAA CGTGGGCATC AAGCGGGTGG CCGACGGCGC GGTCGCGGCC
GGGCTGGAGG ACAGCGAGGC GGTGCTGCGC CGGATCCGGC GGCTGTTGAT CCGCGGATGA
 
Protein sequence
MRREWELDEL IECWTLDEDE LGLLSNKSGA TRLGFALLLK FFELEGRFPR REDVPKAAVE 
YMAGQVGVEA ALLFADYQWS GRTIEYHRSQ VRKHHGFRQP TVADEDKQIF WLAGELCPEE
LSRDRLRAAL LTRFRAEKIE PPTPVQVDRL LGAAVAMFER DFTTRTAQRL GSAAGARLDD
LIAAPEVEAD PVEVPVAAPG RSFLQELKED PGPIQLDTLL AEIVKLERVR AIGLGEGLFE
GVSEKIVESW RARAMRMYPS DFAAAAEPVR WTLLAALCWV RKTELTDGLV ELLIQLVHKI
SVRAERKVES EISAEFRRVH GKNGILVRLA QAALDLPEEV VREAIYPVVG ARTLADIVAE
AKANEKVFNS RVRTKLRGSY SRHYRRGLPK LLRALEFGCS NTAFRPVMDA LALLDRYADS
EAVHYDRSET VPLEHVVPDD WKDAVVDPDT GRVERIPYEL CVLVALHKAI RRREIWITGA
KTWRNPDDDL PADYENNRDV HYEALSKPRD PAAFIADLQR RHLAALDRLN TAMGSDTTGG
VKLTRRKGEP WISVPPLNRR PEPTGLVALK EEISRRWGVI DLLDVLKDVD HVTGFTKEFT
SVASRTITHP DVLQRRLLLC LYGLGTNVGI KRVADGAVAA GLEDSEAVLR RIRRLLIRG