Gene Sros_1676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1676 
Symbol 
ID8664953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1789041 
End bp1792196 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337410 
Protein GI271963214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.665877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTG GTTTTTACCT GCGATTCCCT CACATCTCCC GCGAGACGCT GACGTTCGTC 
GCAGACGACG ACGTCTGGGC GGCTCCCGCG GCCGGAGGAC GGGCCTGGCG TCTCTCCTCT
GACCGCGCGC CCGCCGGCCA CCCCCACCTG TCCCCGGACG CCACGAAGGT CGCCTGGACC
AGCGTCCGCG ACGGTGCCCC GGAGGTGTTC CTGGCCGACC TGGAGTCCGG CTCCGCCGAG
CGGCTGACCT ACTGGGGCGA CCGCCGGACC CGCACGCGCG GCTGGACGCC GGACGGCAGG
GTCCTCGCGA TCAGCGCGAC CGGCCAGCCG TTCGCCTCGC GGAGCTGGGC CTACGCCCTG
GGCGACGGCC GGGTCGAGCG GCTGCCGTAC GGCCCCGTCA CCGATCTGTC GGTCACCGGC
GACGCGGTGG CGCTGCTCAC GGCCTCCCTC GCCCGGGATC CGGGCACCTG GAAGCGCTAC
CGGGGCGGTA CGGCGGGGCG CCTCTGGGTC CAGCGGGCGG GTGGCGACTT CACCCGGCTC
CTGGCGGAGG TGAACGGCCA CTTCGCCAGC CCCATGCTCG TCGGCGGGCG CCTGGTCTTC
CTCTCCGACC ACGAGGGGGT CGGCAACCTC TACTCCTGCG CGCTGGACGG CACCGGTCTG
CGCCGGCACA CCGACCACGA TGTCTTCTAC GCCCGGCACG CCACCACCGA CGGGACCCGG
ATCGTCTACC AGAACGCCGG TGACCTCTAC CTGCTGGACG ACCTTGACGC CGAGACCCGC
AAGCTCGACG TCACGCTCGG CTCCCCGCTG CGGGGGCGGC AGCCGTACCA GGTCAACGCG
GGCCGCAGGC TGCACGATCT GGCCGTGGAC GCGACCGGGC GGGCCAGCGC GGTGGAGATC
CAGGGCTCGG TGCACTGGAT CACCCACCGG GACGGCCCGG CGCGGCAGCT CAGCACCGGC
CTGTCGGCGG GGAAACCCCG CATCCTCGGT ACGGACCGGG TGGTGTGGGT CTTCGACGAC
GGGGAGCGGC AGGGCCTGGA GATCGGGCCG GCCAGTGGCG GGGAGAGCCG CAGGATCGCC
GCGGGCAAGC TCGGCGAGGT CGGTGACCTG GCCGTCGCAC CCGACGGGAA CGCGATCGCG
GTGGCCGCCA GGGACGGGCG GCTGCTCCTG GTGGACGTGG CCTCCGGTGA GGTCACCGAG
CTCGCCACCG CCAACGGTGA GATCGAGGGA CTGGCCTGGT CCCCCGACTC CGCCTGGCTG
GCCTGGTCGC ACCCGGAGGC CACGCCGCTG CGCCGGATCC GGCTGGCCAG GGTGATGGAC
CGGGTCGTCA CCGACGTCAC CGACGGCCGG TTCGTGGACG TCTCGCCCGC ATTCGCCGGG
GACTATCTGG CGTTCCTGTC CCGGCGCGGC TTCGACCCGG TCTACGACGC GCACTCCTTC
GACATGTCCT TCCCGTTCGG CTACCGCCCC TACCTGGTAC CGCTGGCCGC GGCGACCCCG
TCGCCGTTCG CGCCCAGCTC CGACGGCCGG GCCGTCGGCG AACCCGATGA CGACGACAAG
AAGAAGCGCA AGGACGACGC CGCGATGACC GTCGACCTGG ACGGTATCGC CGCCCGGGTC
GTGCAGGTGC CGGTGCCCGA GGGGCGCTAC TCGGCGCTCC GCGCGGTCAA GGGCGGGTTC
GTCTGGCTGC GCGAGCCGCT GGCCGGGGAG CTCGGCGAGG ACCGCGACCA GGTGGGGGGC
GAGCCGCCCC AGCCCGCGCT GGACCGTTAC GACCTGGTCA AGCGCAAGTG CGAGGAAGTG
GTCGACAAGC TCGACCGGTA CTGGGTGAGC GGGGACGGCA CCCAGCTCGT CGTCCGTGAC
AAGGGCGCGC TCACCGTCCG CCCGGTGACC GGCAAGAACG GCGGGGACGA CGCGACGGTC
GACCTGTCGC GGATCCGCGT CACCGCCGAC CCGCAGGCCC AGCGGCGGCA CGCCTACAGC
CAGATGGGCC GCCGGATGCG GGCCGACTTC TGGGTCGAGG ACATGGCGGA CGTCGAGTGG
GACGCCGTGC TGGAGGAGTA CCGGCCGCTG GTGGAGCGCG TCGCGACCGC CGACGACTTC
GCCGACCTGC TCTGGGAGGT GCTCGGGGAG CTGGGCAGCT CGCACGCCTA CGTCGACGCC
GCCCCCTGGG GGCACCCCGC CTCCGACTCG GTCGGCCTGC TCGGCGCCGA CCTGTCCTGG
AACGGCGACG GCCGCTGGCA GGTGGACCGG GTGCTGCCCG GCGAGTCCTC CGACGTGCAC
GCCCGCTCAC CGCTGGCCGT GCCCGGAGCG GGGATCCAGC AGGGGGAGAC GCTGCTGGCG
GTCGACGGCC GCCCGGTGCC GCCGCAGGGC CCCGCCTCGC TGCTGGTCGG CGCGGCGGAC
AAGGCGGTCG AGCTGACCCT GGGCGGCGGG CGGAGGGTGG TCGTCACGCC GCTCCGCGAC
GATCGGCGCC TGCGCTACCA GGACTGGGTG GCCGGGCGCC GGGCGCATGT CCGAGGTCTC
GGTGACGGCC GTGTCGGCTA CCTGCACATC CCGGACATGG TCGCCGAGGG GTGGGCGCAG
TTCCACCGCG ACCTGCGCAG GGAGATGACC TTCGAGGCGC TCGTCGTGGA CGTCCGGGGC
AACCGGGGCG GCCACACCTC CGAACTCGTC ATCGAGAAGC TGATCCGCCG GGTCATCGCC
TGGGACCTGC CCCGGGGCAT GACGCCGATC ACCTACCCCG AGGACGCTCC GCGCGGTCCG
CTGGTCGCGG TCACCGACCA GGACGCCGGA TCCGACGGCG ACATCGTCAC CGCCGCTTTC
AAGATCCACA AGCTCGGGCC GGTGGTCGGC ACCCGCACCT GGGGTGGCGT CATCGGCATC
GAGGGGGGTC ACCGCCTGGT CGACGGGTCG CACATCACGG TGCCCCGGTA CTCGTTCTGG
TTCGAGGGGC TCGGCTGGGG CGTGGAGAAC TACGGCGTCG ACCCGGACGT CGAGGTGGAC
ATCACCCCCG ATGACTGGGC GGCCGGCCGG GACCCGCAGC TTGAGGAGGC GGTACGGCTC
GCGCTGGCCG CCCTCGACGA GCGGCCCGCG GCGTCCCCGC CCGACCCCGC CACCCGCCCC
TCCCGCCGCC GCCCGGCGCT GCCGCCCCGC CCGTAG
 
Protein sequence
MASGFYLRFP HISRETLTFV ADDDVWAAPA AGGRAWRLSS DRAPAGHPHL SPDATKVAWT 
SVRDGAPEVF LADLESGSAE RLTYWGDRRT RTRGWTPDGR VLAISATGQP FASRSWAYAL
GDGRVERLPY GPVTDLSVTG DAVALLTASL ARDPGTWKRY RGGTAGRLWV QRAGGDFTRL
LAEVNGHFAS PMLVGGRLVF LSDHEGVGNL YSCALDGTGL RRHTDHDVFY ARHATTDGTR
IVYQNAGDLY LLDDLDAETR KLDVTLGSPL RGRQPYQVNA GRRLHDLAVD ATGRASAVEI
QGSVHWITHR DGPARQLSTG LSAGKPRILG TDRVVWVFDD GERQGLEIGP ASGGESRRIA
AGKLGEVGDL AVAPDGNAIA VAARDGRLLL VDVASGEVTE LATANGEIEG LAWSPDSAWL
AWSHPEATPL RRIRLARVMD RVVTDVTDGR FVDVSPAFAG DYLAFLSRRG FDPVYDAHSF
DMSFPFGYRP YLVPLAAATP SPFAPSSDGR AVGEPDDDDK KKRKDDAAMT VDLDGIAARV
VQVPVPEGRY SALRAVKGGF VWLREPLAGE LGEDRDQVGG EPPQPALDRY DLVKRKCEEV
VDKLDRYWVS GDGTQLVVRD KGALTVRPVT GKNGGDDATV DLSRIRVTAD PQAQRRHAYS
QMGRRMRADF WVEDMADVEW DAVLEEYRPL VERVATADDF ADLLWEVLGE LGSSHAYVDA
APWGHPASDS VGLLGADLSW NGDGRWQVDR VLPGESSDVH ARSPLAVPGA GIQQGETLLA
VDGRPVPPQG PASLLVGAAD KAVELTLGGG RRVVVTPLRD DRRLRYQDWV AGRRAHVRGL
GDGRVGYLHI PDMVAEGWAQ FHRDLRREMT FEALVVDVRG NRGGHTSELV IEKLIRRVIA
WDLPRGMTPI TYPEDAPRGP LVAVTDQDAG SDGDIVTAAF KIHKLGPVVG TRTWGGVIGI
EGGHRLVDGS HITVPRYSFW FEGLGWGVEN YGVDPDVEVD ITPDDWAAGR DPQLEEAVRL
ALAALDERPA ASPPDPATRP SRRRPALPPR P