Gene Sros_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1471 
Symbol 
ID8664747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1548957 
End bp1552994 
Gene Length4038 bp 
Protein Length1345 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337207 
Protein GI271963011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.984073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTGG GCGCGATCGC GTTCAACACC GCGCCCGACA GGATCATCGC CGAGACCAAG 
CTCGACATGG CGCTCAACCC CGTCGGCTTC CTGGCCAGGG CGGCGCACCT GTGGGACGGG
ACGTTCTTCG GTCACCTGCA GAACCAGGCC CACGGCTACC TGTTCCCGAT GGGCCCCTTC
TACAGCCTCT GGCTCGGCCT GGACATGCCC GCGTGGAACG TGCAGCGGCT GTGGATGGCA
TCCGTCCTGA TCGCGGCCTT CCTCGGCGTC GTCCGCGTGG CCCGGGAGCT GGGCATCGGC
TCGCCGAGGG CGGGCATACT CGCCGGGCTG GCCTACGCGC TCGCCCCCCA CGCCCAGGCG
CTGATCGGGG TCAACTCCTC GGAGTTCCTG CCCAGCGCGA TCCTCCCCTG GATCCTGCTG
CCCCTGATCA GGGGCGCCCG CGGGGACACC ACCCCGCGCC GCGCCGCCGC GTGCTCGGCC
GCCGCGCTGC TGCTGTGCGG CGGGGTGAAC GCGGCGGCCG AGCTGGCCGT CCTGGTCGTC
CCGCTCCTCT ACCTGCTCAC CAGGCGGGCC GGTCCCCTCA AGCGCAGGCT GATCGCCTGG
TGGCTCCCGC TGGTCGGGAT GGCCGCCTTC TGGTGGCTGG CCCCGCTGCT CGTCATGGGC
AGCCACATCT TCTCGTTCCT GCCCTTCATC GAGACGGCCG CCGCGACCAG CCAGGTGACC
TCGCTCACCA ACGTCGTCCG GGGGACGTCG AGCTGGCTCA GCTTCCTGCA GACCGACGGC
AGGCCCTGGG TGGAGGCGGC CTTCGCCCAG GCGACCCGGC CATGGCTGAT CGTGGTGACC
TCGCTGGTGG CCGCGCTCGG GCTGGCCGGG CTGGCGCGCC GGGGCACCCC GGAGCGGTTC
TTCCTGCTGC TCACCCTGCT GGTCGGCGTG GCGGTGGTCT CGGCCGGCCA CGCCGGCCCG
ATGGCCGCGC TGACCAGGGA GGCGCTCGAC GGCCCGCTCG CCGCCTTCCG CAACCTGCAC
AAGTTCGACG CGCTGATCCG CCTGCCCGTC GCGCTGGGAC TCGCCGCCCT CGTCGCCGCC
CCCGTACGGG CCCGGCTGCC GTTGCGCGCC GGCGTCGCCG TCCTCGCCGG GCTCAGCTTC
GCCCCGGTCG CGCTGTCGGG GGTGACCACC GCGGGCTCGT TCTGGCAGGT GCCCACCTAC
TGGCGTGAGG CCGTGGGCTG GCTCAACGCC AACGCGACGG ATCAGATGGT GCTGGTCGTG
CCGGGGTCAC GGCGCGGCGA GTACGACTGG GGCCGGCCCA TCGACGAGCC GCTGCAGCCG
CTGCTCGACG GCGTCCGCTG GGCGGCGCAC ACGAACGTCC CGTGGGGCTC GCCGGGCATC
GCCCGGCTCA TGCAGGCCCT GGACGAGCGG TTCGGCGCGG GCAAGGGGTC GCTCGGGCTC
GGCTCGACGC TGCGCCGCAT GGGGATCGGC TACGTCCTGG TCCGCAACGA CCTCGACCGG
GCCACGATCG GCGACGCCTG GCCCTCCAGG GTCCACGAGG CGCTGGAGGA GACGCCGGGC
CTGTCCAGGG TGCGCGGTTT CGGCCCCCAG GTCGGCCAGG AGCAGAGCAC CATCGCGGCG
ACCTGGCTCG ACCAGCCGTA CGACGCCCTG GAGGTCTACC GGGTCGGCGG GACGCCCATG
CGGGCCGGGA CGCTGCCCGC CGGCCGCACG ACGAGGGTCA CCGGTGCGCC GGAGGCCGTG
CTGACCCTCG CCGAGCAGGG GCTGCTGGAC GACGACCGGC CGGTGGTGGT CGGTGACGAC
GCGGCGGCCT ACGAGATACC GGCCGGGGAC ACGATCGTCA CCGACACGCT CCGCCGCCGG
GAGAACGTCT TCAGCGACCT GCGGCGCTCG GCCTCGGCCA CCCTCACGGA GTCGGAGAAC
CCCCGCAGAG CGGCGGCGGC GGCCGACCTG ACCGACCCCG CGTGGGACCG GTACACCTCG
ACGGCCGAAT ACACCGGCAT CGCCGGGCTG AGCGCCTCGT CCGCGGAGAG TTCGGCCGGC
GCGCTGCCGG GGACCCGCGA CCCGGGACGC CAGCCGTTCG CGGCGGTCGA CGGGGACGCC
CGGACGAGCT GGCGCTCCGA CGGCTGGCGG GGCCCCGTCG GCGAGTGGTG GGAGATGCGC
TTCACCGGGC CGCTCACGCT GCCGCACGTC ACCGCGCGGT TCGAGCGCTC CGCCATCGGC
CCGCCGGTCA CCGAGGTCGC GGTGGAGACC GACTCCGGCA CGGTGACCTC GGCGGTCACC
TCGGCCGACA CGCAGCGACT CCCGGTGCCG GAGGGGCCCA CCTCCCGGAT CCGGATCCGG
GTCACGAAGG TCGGCGGCGT GCAGGCGGGG CCGCTCGGCG GCCGCGTGGG ACTGTCCGAA
ATTACGGTGC CGGGGGTACG GCCCGGCCGT ACCATCTCGG TGCCGGGCGT CGGAGACGGC
CGCGCCGTGA CCACGGTGCT GACCGGCACC GCGGACCTCT CGCCCTGCGC GCGCGGATCC
TTCGCCTGGG CCTGCGACGA TCGCCTGGAG GTCCAGGGAG AGGACGGGTA CGGCTTCGAC
CGCAAGGTGG CGACTCCGGA GACCGGCGAA TGGGAGATCA GCGGCCGGGC CCTGCTCACC
GACCCGGCGA CGGCCGAGCG GCTGGTGACG CTGCCCGAGT CCTATCCCAA GGTGAGCGCC
TCGTCCACGG CCGTCGACCA TCCGGCCGTC CTCGGCCGCG CCGCCTTCGA CGGCGACGAC
CGCACGATCT GGTACGCCGA TCCCCTCGAC CGGAGACCGG CGCTCACCGT CGACCTGGGA
CGGGAGAGGG CGATCTCCCG GATCAAGCTC CGCTTCCCCG ACTCCTACCT CGGCCTGCCG
CCGGTCCGTG TGACCGTCAG GACCGGCTCC GGCGCCGTCA GGTCCGGCTG GCCGGGCTCC
GACGGCTGGC TGCGGATCGC CGAGGTCCGG GCCAGGAGCC TCAAGCTCGA ATTCGAGACG
ACGGTCTCGC GGCCGCTCGA ACTCGTCGAG GTCTCCATCC CCGGCGTCCC CGCCGTCGCC
GACCTGGACA CCTTCCCGCT CAAGCTGCCC TGCGGGTCCG GGCCCGCGCT GACCGTCGGC
GGCAACGGCG TGCCCACCGA GATCGTCGAG GGCACGCTCG GTGACGTGCT CAACGGCCGG
GAGGTCTCCT ACCGCGCCTG CGAGCCCGTC GCGGTGGGGC CGGGCGGCGC CCGCGTCACC
GCCGGGCACG AGGACCCGTT CCGCGTGCGG TCGGTGGTGA TCAACGGCGA CCGGGCCGGA
GCGGCGCCGG TCACCATGGC GCCGGTCGAC GTCGCCCGCT GGGAGGCAGG CACCCGGCAG
GTCCGGGTCG CCACCTCCGA GGTCTCCTAC CTCGTGGTGA ACGAGAACTT CAACGACGGC
TGGCGGGCCA CCGCCGGAGG GCGGGAGCTC ATCCCGGTCC GTCTCGACGG TTGGCGGCAG
GCGTGGCTGC TGCCCGCCCG CCTCTCCGGC ACGGTGACCA TGCACTACAC CCCGGACGAC
GCCTATCGCA CCGCGCTGCT GGCCGGCGCG GGTCTCGCCC TGGCCGTGGC GGCCCTGGCG
CTCTGGCCGG CGCGGCGCCG GATCCGCTGG GCCCCCGCCC GTCCGGCCGC TCCGCGTTCG
GCCCTGCTGG TCGCGCCGGT GCTCGGATTC TGGGCCGACG GCCTGACCGG CGTCGCCGTG
GTCCTGCCGG CCTTGCTCCT CATGGTCTGG GCCGAGCGGG TGGCCAGGGC CAGATACATC
GCGGAGGGAG GCGTCAAGGG GACCGTCACC GTCATCCGGT CCCCCTGGGC GCCCGCGGTG
CTCGCCGCCG CCGCCGGTCT CGCGCTGGCC GCCGGCCTTC CGGAGCAGGT CGGCCAGTCG
GCCACGCTCG CGGCGCTCGG CCTGCTGGCC TCGGGCTTGC CCGCGGCGGC ACCCCGGCCC
GACCTGAAGG AGCTCACACC TCCGCGAGCC GCGACCGCAT CCGATTCCGG ATCGACCCCG
GAGTATTCAT GGACCTGA
 
Protein sequence
MLLGAIAFNT APDRIIAETK LDMALNPVGF LARAAHLWDG TFFGHLQNQA HGYLFPMGPF 
YSLWLGLDMP AWNVQRLWMA SVLIAAFLGV VRVARELGIG SPRAGILAGL AYALAPHAQA
LIGVNSSEFL PSAILPWILL PLIRGARGDT TPRRAAACSA AALLLCGGVN AAAELAVLVV
PLLYLLTRRA GPLKRRLIAW WLPLVGMAAF WWLAPLLVMG SHIFSFLPFI ETAAATSQVT
SLTNVVRGTS SWLSFLQTDG RPWVEAAFAQ ATRPWLIVVT SLVAALGLAG LARRGTPERF
FLLLTLLVGV AVVSAGHAGP MAALTREALD GPLAAFRNLH KFDALIRLPV ALGLAALVAA
PVRARLPLRA GVAVLAGLSF APVALSGVTT AGSFWQVPTY WREAVGWLNA NATDQMVLVV
PGSRRGEYDW GRPIDEPLQP LLDGVRWAAH TNVPWGSPGI ARLMQALDER FGAGKGSLGL
GSTLRRMGIG YVLVRNDLDR ATIGDAWPSR VHEALEETPG LSRVRGFGPQ VGQEQSTIAA
TWLDQPYDAL EVYRVGGTPM RAGTLPAGRT TRVTGAPEAV LTLAEQGLLD DDRPVVVGDD
AAAYEIPAGD TIVTDTLRRR ENVFSDLRRS ASATLTESEN PRRAAAAADL TDPAWDRYTS
TAEYTGIAGL SASSAESSAG ALPGTRDPGR QPFAAVDGDA RTSWRSDGWR GPVGEWWEMR
FTGPLTLPHV TARFERSAIG PPVTEVAVET DSGTVTSAVT SADTQRLPVP EGPTSRIRIR
VTKVGGVQAG PLGGRVGLSE ITVPGVRPGR TISVPGVGDG RAVTTVLTGT ADLSPCARGS
FAWACDDRLE VQGEDGYGFD RKVATPETGE WEISGRALLT DPATAERLVT LPESYPKVSA
SSTAVDHPAV LGRAAFDGDD RTIWYADPLD RRPALTVDLG RERAISRIKL RFPDSYLGLP
PVRVTVRTGS GAVRSGWPGS DGWLRIAEVR ARSLKLEFET TVSRPLELVE VSIPGVPAVA
DLDTFPLKLP CGSGPALTVG GNGVPTEIVE GTLGDVLNGR EVSYRACEPV AVGPGGARVT
AGHEDPFRVR SVVINGDRAG AAPVTMAPVD VARWEAGTRQ VRVATSEVSY LVVNENFNDG
WRATAGGREL IPVRLDGWRQ AWLLPARLSG TVTMHYTPDD AYRTALLAGA GLALAVAALA
LWPARRRIRW APARPAAPRS ALLVAPVLGF WADGLTGVAV VLPALLLMVW AERVARARYI
AEGGVKGTVT VIRSPWAPAV LAAAAGLALA AGLPEQVGQS ATLAALGLLA SGLPAAAPRP
DLKELTPPRA ATASDSGSTP EYSWT