Gene Sros_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1203 
Symbol 
ID8664478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1228479 
End bp1230329 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content70% 
IMG OID 
Productcell wall biogenesis glycosyltransferase-like protein 
Protein accessionYP_003336944 
Protein GI271962748 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.811759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGAC TCAGCGTGGT TGTTCCGTAC TACAACGTCG AACGGTATTT CGACGCCTGC 
CTGGCCTCGA TCCAGGGCCA GACGCTCGGC GACCTTGAGG TCATCTGCGT GGATGACGGA
TCGATGGACG CGAGCGCGGT CATCGCCAAG GACTACGCGT CACGAGACCC GCGCTTCCGG
GTGGTGGTCC AGGAGAACCA GGGCCTCGGC CCGGCGAGGA ACACCGGTGT GACGCACGCG
TCGGGGCACT ACCTGGCCTT CGCCGACAGC GACGACATCG TCCCCCCGCG CGCCTACGAG
CTCCTGGTGG GCTCCCTGGA GCGGAGCGGG TCCGACATCG CGTCGGGCAA CGTGCAGCGG
CTCACTTCCG AGGGGCTGGT CCAGTCCTGG GCGCACCGCA ACGCCTTCAG GAAGACCCAG
GTGGGCACGC ACATCACCCG GAACGTGCAC CTCCTGTTCG ACCGGTCGGT GTGGAACAAG
GTCTTCCGGC GCTCGTTCTG GGACGAGCTG GGCATGGAGT TCCCCGCCCG GCTGTACGAG
GACATGCCGG TGACCGTCCC CGCCCACGTG CGCGCGCGGG CCGTGGACGT GCTCTCCGAG
GTCGTCTACA TCTGGCGCCT GCGCGAGGGG TCCATCACCG AGCGCCGCTT CCGCGCGGAG
AACGTCACCG ACCTCGTGAT CTCCGTCGCG GAGACCGCGC GCTTCCTCGA AGAGCACGCC
CCCGGGCTGC GCCGTGTCTA CGAGCGGGAC ACGCTCCACA ACGACCTGCG GGTGGCGGTC
GAGGCGCTGG CGGCGGGCGT CGAGCCCGAT CTCCTGCTGG ACGCCGCCTG CTCCTACCTC
GACACGGTGA AGCGGAGGTC GTATCTGGAG CTGCCCGCGA TCCGCAGGCT GCAGCTCCAG
CTGATGCACC GGCGCCTGGT CACGGAGCTG GCCGAGGCGG TCCGGTTCGA GCGGGAGAGC
ATGGACGAGG CCCGGCTGCG GCGGGGCCTG CTGTCGCGGC GCAAGTGGTA TGCCGAATAT
CCCTTCCGGC GCGGCTACGG GATCCCCCGC TGGGTCTTCG ACGTCTCGCG AGAGCTCACG
ATGGTCGGCC GGCTGGAAGG CTGGGAGTGG CGCGCGGGCC GGCTGCACGG CGAGGGCGGG
GTGCGGCTGC CGGGTGTCAT GGTCGGGGCG GCTCCGTCCT GCGCGATCAG CCTGTGGCTA
CGCGAGCGCG CGAGCGGCAC GGTGCTGGAG CTGCCCGTGG AGCGCTACGA CCTGGGGTTC
GGCTTCGTCT GCGATCCCGG CACGCTTCCC GTACGGGCCT CCAAGTGGGA GGTGTGGGTC
AGGCTCACCG TGGACGGCAT CGTCCGGGAG GCGCGGCTGA GGGGGAACAC TCCGGCGCTC
GGTCCCGGTG AGACGGACGA CGGGATGTGG ATCCAGCCGC TCGTGGACGA CAAGGGGTTC
GCCGTCCTCA CCGTCAAGCG CCCGCGGGCG GCGGTGACCG GGTGCGAGGC GTCCGAGGGC
CGGCTGCGGA TCCTCGGGTG GCACGCCGAG CAGGAGAGCG CCGCGGGCGT CACCCTGGTG
ATGGGCGTCC TGGACGGTCC CGAGCGTTCC TACCCCGTCA CGACCGCCCC GGCGCGGGAG
GACCGGCGGG AGTTCGCCGT GGACGTTCCC GTCGAGGACC TGCGGCCCGA GGAGGAGACG
GCCGTCTGGA ACATGTACCT GTCCGGGATG GCCGACGGCA GGCCGCTGCG GATGCCCGTG
GCCCTCCACG AGCTCCCCGA GTGGGAGCTG GACGGGTGGG AGATGCAGAT GGCGCGCACA
TCGCGCGGAA ACATGGCGCT GAGAGTCCAA CGGAGTGAAG ATGATCATTA A
 
Protein sequence
MPRLSVVVPY YNVERYFDAC LASIQGQTLG DLEVICVDDG SMDASAVIAK DYASRDPRFR 
VVVQENQGLG PARNTGVTHA SGHYLAFADS DDIVPPRAYE LLVGSLERSG SDIASGNVQR
LTSEGLVQSW AHRNAFRKTQ VGTHITRNVH LLFDRSVWNK VFRRSFWDEL GMEFPARLYE
DMPVTVPAHV RARAVDVLSE VVYIWRLREG SITERRFRAE NVTDLVISVA ETARFLEEHA
PGLRRVYERD TLHNDLRVAV EALAAGVEPD LLLDAACSYL DTVKRRSYLE LPAIRRLQLQ
LMHRRLVTEL AEAVRFERES MDEARLRRGL LSRRKWYAEY PFRRGYGIPR WVFDVSRELT
MVGRLEGWEW RAGRLHGEGG VRLPGVMVGA APSCAISLWL RERASGTVLE LPVERYDLGF
GFVCDPGTLP VRASKWEVWV RLTVDGIVRE ARLRGNTPAL GPGETDDGMW IQPLVDDKGF
AVLTVKRPRA AVTGCEASEG RLRILGWHAE QESAAGVTLV MGVLDGPERS YPVTTAPARE
DRREFAVDVP VEDLRPEEET AVWNMYLSGM ADGRPLRMPV ALHELPEWEL DGWEMQMART
SRGNMALRVQ RSEDDH