Gene Sros_3614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3614 
Symbol 
ID8666902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4008701 
End bp4011730 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content68% 
IMG OID 
Producttransposase Tn3 family protein 
Protein accessionYP_003339289 
Protein GI271965093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTG TGCTGGACGA GGACGAGCTC GTTGGGAACT GGACGCTGGT CGGGGATGAG 
CTGGATCAGC TGTCAGGCCG GCGGGGCGTG ACCAAGCTCG GGTTCGCGCT CCTGCTGCGG
TTCTATGCGC TGAACGGCCG GTTCCCGACG GGCCGCGCCG AGCTTCCCGA TCAGGCCGTC
GCCTACGTCG CCCGGCTGGT CGACGTACCC GCCTCAGAGC TGGGGCTGTA TGAGTGGGAC
GGCCGGACGA TCAAGGATCA CCGCAAGGAC ATCCGGAAGT ACTTCGGGTT CCGCGAGTGC
TCTCTCGCTG ATTCCGACAA GGCCGCGGAC TGGCTGGCGG CCCAGGTCTG CCTGAAGGAG
CGGCAGGTCG ACCGCGTCCG CGCCGAGCTC CTGGCGCATC TGCGCCAGGA GCGGATCGAG
CCGCCGGCGC GGGACCGGAT CCGCCGGATC ATCGGGACAG CGCTGCGGCA GGCCGAGCAG
ACGCAGACCT CCCGGATCTC CAGCCGGATC CCTGCAGAGG CCGTCCCGCA GCTGCTCGCC
CTGATCGCCA AGAGCACCGA TCCCGGTGAC GGACCCGAAG ACCAGGACGA GGACGACGGC
GCGTTGTTCG GCGCCGCCGA GGCCGCCGCG GTGGACGTGT TCGCGGTGAT CCGCGAGGAG
CCGGGCAACG TGAGCGTGAA GACGATCGAG CGGGAGGTGT TCAAGCTGAC CGCGATCGGC
AAGGTCGGCC TGCCGGACAA CCTGTTCGCC GACGTCGCTC CGAAGGTGCT GGCAGCCTGG
CGGGCTCGGG TCGCGGCCGA GGCGCCGTCG CACCTGCGCT CGCACCCGCA CGACGTCAAG
GTGACGTTGC TGGCCGCCTA CTTGTACTGC CGGTCCCGTG AGATCACCGA CACTCTGGTC
GACCTGCTGA TCGCGACGGT GCATCGGATC AACGCCCGCG CCGACACGAA GGTCATCAGT
GACTTCGTGG CGGAACTGAA GCGGGTGTCG GGCAAGGAGA ACATCCTGTT CAAGATGGCC
GAGGCCGCGC TGGAGTCGCC GCAGGCGCGG GTGGAGGACG TCATCTACCC GGCGGTGCCG
GGCGGCTACA AGACGCTGGT CCAGCTGCTG CACGAGTACA GGGCGAAGGG CACCTCCTAC
CGGCAGCACA AGCAGCGGGT GTTCAAGGCC TCCTACACCA ACCACTACCG GATCGGCCTG
ATCCAGATCC TGGAGGTGCT GGAGTTCGGG TCGACGAACA CCGTGCACAC GCCCATGGTG
CAGGCCCTCA CCTTGATCAA GCGGTACAAG GCCGAACAGT CCAACCGGAT CAAGTGCTAC
GCCCTCGGCG AGCACGTCCC CGTCGACGGG ATCGTCCCGG CCGAGCTGGT GGAGCTGATG
TACCGGGCCG ACAGCACCAA GCGGCAGCGG ATCCTGCGCA CCGTGTACGA GTGCGGCGTC
TTCCAGTCCC TGCGCGAGAA GCTGCGCTGC AAGGAGATCT GGGTCCACGG GGCGGACCGG
TGGCGCAACC CCGACGACGA CTTGCCCAAG GACTTCGAGG CCAATCGCAC CGAGAACTAT
GCCAAGCTCC GCAAGCCGCT GGACCCGCAG GTGTTCGTCG ATCAGCTGCG CGAGGAGATG
GACACCGAGC TGTCGGCGCT GAACGACGCC CTAGGTGGCA AGGGCCTCAC GTGGTTGAAG
ATCGCCGAAC GGCGCAACGC CGGGGCGATC CACCTCACCC CGCTGGACGC TGCCCCAGAG
CCGCGCAACC TGCGCCGACT GAAGGCCGCG ATCCGGGACC GGTGGGGCGT CGTCCCGCTG
ATGGACATGC TCACCGAGAC CGCCCTGCGG ACCGGCTGCT TGAACGTCTT CACCCCGGCC
GGAACGCAGA ACCACCTGGA CCCGGCCGTG TTGTTCGAGC GGCTCCTGCT GCTGATCTAC
GCCTACGGGA CGGGGACCGG GATCCGCGCG GTCGCCGCCG GCGACCACCC GCACACCGAG
GACGACCTGC GGTACGCGCG CCGCCGCTAC CTGACCGTCG AGGCCTGCCG CGAGGTCGCC
CGGGTCATCG CCAACGCGAC CTTCGCCGTC CGGCAGGCTG CCCTGTGGGG CGTGGGCACG
ACCGCGGTCG CCTCGGACTC CACCCACTTC TCAGCTTTCG ATCAGAACAT CTTCACCGAG
TGGCACTCCC GCTACCGGCG GGCCAAGCGC GGGGTGCTGA TCTACTGGAC GGTCGAGGTA
GGTGGGTCGA TGGCCGTGCA CAGCCAGCTC ATCAGCTGTT CAGCCTCCGA GGTCCACGCG
ATGGTCGAGG GCGCGATGCG GCACGGCACC GACATGGATA TCGAGCAGAA CTTCGTCGAC
TCCCACGGCG CCAGCTTCGT TGGGTTCGGG ATCACCAGGC TGCTGGACTT CGACCTGGTC
GCCCGGTTCA AGCAGATCAA CAAGATGAAG CTGTACGTGC CAGGCCGCGG CGAGGACTTC
TCCTACCCGC TCCTCAGCCC CGCGCTGACC AGGCCAATCC GGTGGGACAT CATCACCCAG
AACTACGACA TCATGATGAA GTACGCCACC GCGATCCGGC TGCGGACCGC GTCCACCGAG
GCCCTGTTGC GCCGATTCAC CAGCGAGACC ACCCACCCGG CCTACGCCGC GATGCTAGAG
GTCGGCCGCG CCCAGCGCAC GATCTTCCTC ACCCGCTGGC TCCGCGACCG CGATCTCCAG
CGGGAGACCG AATCCGGATT GAACGTGGTG GAGAATTACA ACGGCGTCAA CGACTACATC
AAGTTCGGCA AGCGCGGCGA ACTCGCCTCC AACCGGCGCG AAGAGCAGGA GCTGGGGATG
CTGTGCCTGC ACATCCTCCA GTCGGCCCTC GGCCTGATCA ACACCCTGAT GATCCAGGAC
ACCCTCGCGC TACCCGAGTG GGAGAACGTC CTGACCGACG CCGACCGGCG CGGCCTCACC
CCGCTGTTCC ACACCAACAT GACGCCCTAC GGCGAGATCC AGCTGCGCAC GGACCGGCGC
CTCGACCTCA CCGACCTGCC GACCGCGTAG
 
Protein sequence
MARVLDEDEL VGNWTLVGDE LDQLSGRRGV TKLGFALLLR FYALNGRFPT GRAELPDQAV 
AYVARLVDVP ASELGLYEWD GRTIKDHRKD IRKYFGFREC SLADSDKAAD WLAAQVCLKE
RQVDRVRAEL LAHLRQERIE PPARDRIRRI IGTALRQAEQ TQTSRISSRI PAEAVPQLLA
LIAKSTDPGD GPEDQDEDDG ALFGAAEAAA VDVFAVIREE PGNVSVKTIE REVFKLTAIG
KVGLPDNLFA DVAPKVLAAW RARVAAEAPS HLRSHPHDVK VTLLAAYLYC RSREITDTLV
DLLIATVHRI NARADTKVIS DFVAELKRVS GKENILFKMA EAALESPQAR VEDVIYPAVP
GGYKTLVQLL HEYRAKGTSY RQHKQRVFKA SYTNHYRIGL IQILEVLEFG STNTVHTPMV
QALTLIKRYK AEQSNRIKCY ALGEHVPVDG IVPAELVELM YRADSTKRQR ILRTVYECGV
FQSLREKLRC KEIWVHGADR WRNPDDDLPK DFEANRTENY AKLRKPLDPQ VFVDQLREEM
DTELSALNDA LGGKGLTWLK IAERRNAGAI HLTPLDAAPE PRNLRRLKAA IRDRWGVVPL
MDMLTETALR TGCLNVFTPA GTQNHLDPAV LFERLLLLIY AYGTGTGIRA VAAGDHPHTE
DDLRYARRRY LTVEACREVA RVIANATFAV RQAALWGVGT TAVASDSTHF SAFDQNIFTE
WHSRYRRAKR GVLIYWTVEV GGSMAVHSQL ISCSASEVHA MVEGAMRHGT DMDIEQNFVD
SHGASFVGFG ITRLLDFDLV ARFKQINKMK LYVPGRGEDF SYPLLSPALT RPIRWDIITQ
NYDIMMKYAT AIRLRTASTE ALLRRFTSET THPAYAAMLE VGRAQRTIFL TRWLRDRDLQ
RETESGLNVV ENYNGVNDYI KFGKRGELAS NRREEQELGM LCLHILQSAL GLINTLMIQD
TLALPEWENV LTDADRRGLT PLFHTNMTPY GEIQLRTDRR LDLTDLPTA