Gene Sros_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2633 
Symbol 
ID8665919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2868880 
End bp2871297 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338345 
Protein GI271964149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.732413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCG AGAGCCTGAT CGGGCGACTT GACCTGAGCG CCAAGGTGCG CCTGCTGACC 
GGCGCGGACA TGTGGTCGCT GCCCGCCCTC CCCGAGATCG GGCTCAGGCG GCTGGTGATG
AGCGACGGCC CGATCGGCGT ACGCGGGGAG CAGTGGTCCT CCGCCGATCC CTCGATCGCG
CTGCCCAGTC CCACCGCGCT GGCCGCGACC TGGGACGTGG CGCTGGTCCG CCAGGCCGGG
CGGCTGCTCG CCCAGGAGGC GCGCCGCAAG GGCGTGCACG TGCTGCTCGC CCCGACCCTC
AACCTGCACC GCAGCCCGCT CGGCGGCCGG CACTTCGAGT GCTTCTCCGA GGACCCGTAT
CTGACCGGCG AGCTCGGCGC CGCCTACGTG GAGGGGGTCC AGGAGGGCGG GGTGGGCACC
ACTCCCAAGC ACTTCGTGGC CAACGACTTC GAGACCGACC GGTTCACGGT GAACGTGAGG
GTCGGCGAGA AGGCGCTCCG CGAGGTCTAC CTAGCGCCGT TCGAGCGGGT CGTCCAGGCC
GGCGCCTGGG GCATGATGGC CGCCTACAAC TCGGTCAACG GGACCACGAT GACCGAGCAC
CGGGCACTCC AGCGGGACCT GCTGAAGGAC GAGTGGGGCT TCGACGGCTG CGTCGTCTCC
GACTGGACCG CCGCCCGCTC GACGGCGGCG ACGGCCGGGG GCGGGCTCGA CGTCGCCATG
CCGGGGCCGT CCGGCCCCTG GGGCGAGAAG CTGGAGGCGG CGGTCCGCGA GGGCCGGGTC
GCCGAGGAGG TCGTCGACGA CCAGGTCCGC CGCGTCCTCC GCCTCGCCCG GCGCGTCGGC
GCCCTGGAGG ACTCCCCCGG CCCCGGCCCC GCCCCGGCTG CCCCCTCTCC CACCGCCGGT
TCCGTCCCGG CCGCCGGCGG CACGGAGATC GACGGTACGG CGCTCGCCCG CCAGGTGGCC
GCCCGGTCGT TCACGTTGCT GCGCAACGAG GGCGGCCTGC TGCCTCTCGG CGCTGTCCGG
AGTGTGGCGC TGATCGGGTC GGCGGCCGGT GAGGCCCGGG TCATGGGCGG GGGCAGCGCC
CAGGTGTTCC CCGCCCGGGT CGTCTCGCCC CTGGAGGGCC TGCGCGGGCG CGGGGACGTC
GAGGTCCGCC ACGCGATCGG CACCGACCCC CGGGTACGGC TCGCACCCCT GGCCACCCCC
GCCCGCGCCC TCTTCCTGGA CGCCACGGGC GAGACCCTGG CCGAGCACCC GCTGGCCGGC
ACAGAGGCCC GCTGGGTCGG GTCGCTCCCC CCGGACGTGG ACCCCGCCCG CCTGGCCGCC
GTCGAGATCC GCACCGTCTA CACCCCCGGG GCGAGCGGCC CGCACGAGTT CTCGATCGCC
GGCGTCGGCG CGTTCACCGT GAAAATCGAC GGGGACACCG TCTTCGACGG GACGATCACG
GCCGAGGGCG GCGACCCGGC CGCGGCGTTC CTCTCCCCTC CGGAGCGCCG GATCACCGCC
ACGCTCGCCA CGGGCACTCC CGTCGAGCTG AGCGTCAGGC ACCCCGCGGG AGCGTTCGGC
GGCATGGCCT TCGTGTCGTT CACCCTGGGC CACGCCGACC CCTCCCCCGG CGACGACGCG
CTCATCGCCG AGGCGGTGGG CGTCGCCGCG GGCGCCGACG TGGCGGTCGT GGTCGCCTCG
ACCACCCCGG AAGTGGAGAG CGAGGGCTTC GACCGCATCG GCCTGGCCCT GCCCGGCCGC
CAGGACGAGC TGATCGCCAG GGTCGCCGGA GTCAACCCGC GCACGGTCGT GGTGGTCAAC
GCCGGATCAC CGGTGGAGAT GCCGTGGCTG GAGCAGGTCG CCGCGGTCCT GCTGACCTGG
TTCCCGGGCC AGGAGGCCGG TGACGCGCTG GCCGACGTGC TGTTCGGCGA CGCCGAGCCC
GGCGGCCGGC TGCCCACCAC CTGGCCGGTC CGCCAGTCCG ACGTGCCCGT CCTGAACGTG
ACCCCGGCCG GCGGGGAGGT CGCCTACGAC GAGGGCCTTT TCATCGGATA CCGCGCCTGG
GAGCGCGCCG GCCGCGCGCC GGCGTTCTGG TTCGGCCACG GGCTGGGCTA CACGACCTGG
TCCTACGACA CGATCGCCGT CACCGGCACG ACTGTGACGG TGGCCGTGAC CAACACCGGC
CGGCGTGCCG GACGTGAGGT CGTCCAGTTC TACCTCGCCG CGGTGAACCG GGACCCCGAC
CGGCCCGCCC GCCGGCTGGC CGCCTTCGAC GTCGTCGACG CCGGCCCCGG CCAGACCGTG
GTCACCACGG TCCACCTGCC CGAGCGCGCC TTCCAGATCT GGACGGAGGA GGGCTGGCGG
ACCGTCTCCG GCGACTACAC CGTCGAGGCG GGCCGCAGCG TCGCCGACCG GCGCCTGGCG
GCGACCGTGA CGCTGTGA
 
Protein sequence
MDIESLIGRL DLSAKVRLLT GADMWSLPAL PEIGLRRLVM SDGPIGVRGE QWSSADPSIA 
LPSPTALAAT WDVALVRQAG RLLAQEARRK GVHVLLAPTL NLHRSPLGGR HFECFSEDPY
LTGELGAAYV EGVQEGGVGT TPKHFVANDF ETDRFTVNVR VGEKALREVY LAPFERVVQA
GAWGMMAAYN SVNGTTMTEH RALQRDLLKD EWGFDGCVVS DWTAARSTAA TAGGGLDVAM
PGPSGPWGEK LEAAVREGRV AEEVVDDQVR RVLRLARRVG ALEDSPGPGP APAAPSPTAG
SVPAAGGTEI DGTALARQVA ARSFTLLRNE GGLLPLGAVR SVALIGSAAG EARVMGGGSA
QVFPARVVSP LEGLRGRGDV EVRHAIGTDP RVRLAPLATP ARALFLDATG ETLAEHPLAG
TEARWVGSLP PDVDPARLAA VEIRTVYTPG ASGPHEFSIA GVGAFTVKID GDTVFDGTIT
AEGGDPAAAF LSPPERRITA TLATGTPVEL SVRHPAGAFG GMAFVSFTLG HADPSPGDDA
LIAEAVGVAA GADVAVVVAS TTPEVESEGF DRIGLALPGR QDELIARVAG VNPRTVVVVN
AGSPVEMPWL EQVAAVLLTW FPGQEAGDAL ADVLFGDAEP GGRLPTTWPV RQSDVPVLNV
TPAGGEVAYD EGLFIGYRAW ERAGRAPAFW FGHGLGYTTW SYDTIAVTGT TVTVAVTNTG
RRAGREVVQF YLAAVNRDPD RPARRLAAFD VVDAGPGQTV VTTVHLPERA FQIWTEEGWR
TVSGDYTVEA GRSVADRRLA ATVTL