Gene Sros_5179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5179 
Symbol 
ID8668473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5693055 
End bp5694296 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase 
Protein accessionYP_003340698 
Protein GI271966502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000882934 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGTGTGT TGTTGTCGAC GTATGGGTCG CGTGGGGACG TCGAACCGCT GGTGGGACTC 
GCGGTGCAAT TGCGGGCGCT CGGCGCGGAG GTGCGGGTGT GCGCTCCGCC GGACGAGGAC
TTCGCGGAGC GGCTGGCCGG TGTCGGCGTG CCGCTGATGC CGGTCGGCCA GTCGGCGCGC
GCGCTGACGA CCGCGGCGCC GCCGCCGTCG CCGGCGAACC TGCCCCAGCG CGCGGCCGAG
TTGATCGCCG GCCAGTTCGA CGTGGTCACC GCGGCGGCCG AGGGGTGTGA CGTGCTGGTG
GCGACCGGCG CGATGCCGGC CGCGGCCGGC GCGCGGTCGG TGGCCGAGAA ACTGGGCATC
CGCTCCGTGT CCGTGACCTT CCAGCAGCTC ACCCTGCCGT CGCCGCACCA CCCGCCGCTG
GCGTATCCGG GCCGGCCGTT CCCGCCGGAC GTGACCGACA ACCGGGTGCT GTGGGACCTG
GACGCCCAGA GCATCAACGC GCTGTTCGGT GCGGCGCTCA ACACGAACCG GGCGTCGATC
GGCCTGCCCC CGGTGGACAA CGTCCGCGAC TACGTCATCG GCGACCGGCC GTGGCTGGCG
ACGGACCCGA CCCTGGACCC GTGGCAGGAG CCGGCGGACC TCGACGTCGT GCAGACCGGC
GCGTGGATCC TGCCCGACGT TCGCCCACTC CCGGCCGAGC TGACGGCGTT CCTGGACGCC
GGCACACCAC CGGTGTACGT GGGCTTCGGC AGCATGCCCA TGAGCGCCTC GACGGACGCC
GCCCGGGTGG CCATCGAGGC GGTCCGCGCG CAGGGCCGCC GCGCGCTCGT CGGGCGCGGC
TGGGCCGACC TGGCCCTGAT CGACGACCGG GACGACTGCT TCACCGTCGG CGAGGTCAAC
CAGCAGGCGC TGTTCGGCCG GGTGGCCGCC GTCGTGCACC ACGGCGGCGC GGGCACGACG
ACGACGGCCG CCCGGGCCGG CGCTCCTCAG GTGGTGGTAC CCCAGGTGGC GGACCAGCCG
TACTGGGCCG GACGGGTGGC CGGCCTGGGC ATCGGCGCGG CACACGACGG TCCGGCTCCG
ACCTTCGAGT CCCTGTCAGC CGCGCTCAGG ACCTCCCTGG CCCCCGAGAC CCGCGCGCGA
GCGGCCGCCG TGGCCGGCAC GGTCCGCACC GACGGGGCGA CGGTGGCCGC GAAGCTGCTG
CTCGACGCGG TCAGCCGGGA GAGGCCGCCC GGGTCCGCGT GA
 
Protein sequence
MRVLLSTYGS RGDVEPLVGL AVQLRALGAE VRVCAPPDED FAERLAGVGV PLMPVGQSAR 
ALTTAAPPPS PANLPQRAAE LIAGQFDVVT AAAEGCDVLV ATGAMPAAAG ARSVAEKLGI
RSVSVTFQQL TLPSPHHPPL AYPGRPFPPD VTDNRVLWDL DAQSINALFG AALNTNRASI
GLPPVDNVRD YVIGDRPWLA TDPTLDPWQE PADLDVVQTG AWILPDVRPL PAELTAFLDA
GTPPVYVGFG SMPMSASTDA ARVAIEAVRA QGRRALVGRG WADLALIDDR DDCFTVGEVN
QQALFGRVAA VVHHGGAGTT TTAARAGAPQ VVVPQVADQP YWAGRVAGLG IGAAHDGPAP
TFESLSAALR TSLAPETRAR AAAVAGTVRT DGATVAAKLL LDAVSRERPP GSA