Gene Sros_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1648 
Symbol 
ID8664925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1761107 
End bp1762723 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyltransferase-like protein 
Protein accessionYP_003337382 
Protein GI271963186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.323978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATG GAGCGGGGAC CGACCTGCCC CTGATCCGGC ACAACGACTA CTCCCCCCTG 
GTCCCTCCCG CCCTCGGCGA ATGGGATCCG GCGCTCCCGG TAAGCGTGAT CATCCCCGCG
CACGGCGGCC AGCACAGGCT CGACCTGACC CTCGCCGCCC TGGCGGCGCA GACCTACCCC
GGTCACCTCA TGGAAGTGAT CGTGGTGGAC GACGGCAGCG ACCCGCCCCT GCGGCTGCCG
GAGATCGCTC CGCCGGGCAC CAGGATCGTC GCCGCCGACC CCGGGCGCTG GGGCATCGCG
CACGCGGTGA ACACCGGGGC CGCCCTCGCC GAGGGCCGGA TCATCCAGCG CCTGGACGCC
GACATGGTCG TCTGCCGCGA GCACATCGAG GCACTGGCCC GCTGGCACCA CCTGGCCGGC
TACCTGGTCG CCATCGGCGC GAAGAAGTTC GTCGAGGAGC CCGAGCTCTC CCCGGCCCAC
CTGTACGACG GCGTGCGCAC GGGCCCGCTG GAGGCCGTCT TCGACCTGTC GGAGGCGCTG
CCCAGCTCCA CCGAGCAGAC CATCTCCCGG ACCGACGGCC TGCGGACCAG CCGCAACCCC
TACCACGTGT GCACCGGGCC GACGGTGTCG ATGCGGCGGG AGACCTTCCA CGCCGTCGGC
GGGATCGATC CCGACGTGCT CAGGGGCGAG GACACCGAGT TCGCCTACCG GCTGGCCGCG
CACGGGGCGG TCTTCGTCCC CGACATGGCC GCCCAAGCCG TGCACCTGGG ACTTCCCGCG
CAGCGCCGTG ACCGTGACCG GGCGGTCCGC GCGGTCGGCC CCTACCTCGC CCACCGCGTT
CCGCTCCGCC GTGACCTGCG CAAGGACCGG GGCCGGCGGT GGCTGGTGCC GTACGTGGAG
GTGGTGCTCC ACGTCGACGG CGACGAAAGG CAGGTGCGCG ACGCGGTGAG CGCGGCGCTG
GAGGGGTCGG TGACCGACGT GCGGGTCACC CTGGTCGCCC CCTGGTCCCG GCTGTCCCCG
GGCCGCCGCG CGGTGCTCGG CGACCCCTCC TTCGAGCTGC GGCTGCTGCG CGAGCACTTC
GCCCACGACG AGCGGGTACG GCGGGCCGAC GAGGTCTCCC CCACCCCCGC GCCGATCCCC
TTCCGCTACA CCGGCCCGAT CTCGGTCCCG CTGGGGCACG GCTCGCTGGA GCGGATGATC
GCCGCGCTCC AGGACGACCG GTCCGGCATG CTCGTCGTCG ACCTCGGCGA CGACGGTACG
GCGACGCTGG AGCGGACCGA GGCGCTGGGC CGGGCACTCC TGCTGGGCGC GGACGACGTC
CCCGCCTCGA TCAAGGCCAC CCACGGCGTG CGGCACGGCG ACCGGGCGGA GTTCTGGCCG
GTCCCGGCCG CTCCGGCCGC TCCCGCCCGG AAGCCCGCGG GGGCTTCCGC GGAGAAGGCG
GAGAAGTCCG CGCAGGCCGC GCCCGGGAGG CCCGCGCAGG CTCCCCCGGA GAAGCCGGCC
TGGAATCCGC CGGAACAGCC CACGGCGGCT CCGTCCCGCG GGGACCGGCC CCCGGCGCCC
GCGCGGAAGC CGGAGTCCCG GCTCTCCAGG CTCCGTTCGG CGATCCGGAG GGGCTGA
 
Protein sequence
MTDGAGTDLP LIRHNDYSPL VPPALGEWDP ALPVSVIIPA HGGQHRLDLT LAALAAQTYP 
GHLMEVIVVD DGSDPPLRLP EIAPPGTRIV AADPGRWGIA HAVNTGAALA EGRIIQRLDA
DMVVCREHIE ALARWHHLAG YLVAIGAKKF VEEPELSPAH LYDGVRTGPL EAVFDLSEAL
PSSTEQTISR TDGLRTSRNP YHVCTGPTVS MRRETFHAVG GIDPDVLRGE DTEFAYRLAA
HGAVFVPDMA AQAVHLGLPA QRRDRDRAVR AVGPYLAHRV PLRRDLRKDR GRRWLVPYVE
VVLHVDGDER QVRDAVSAAL EGSVTDVRVT LVAPWSRLSP GRRAVLGDPS FELRLLREHF
AHDERVRRAD EVSPTPAPIP FRYTGPISVP LGHGSLERMI AALQDDRSGM LVVDLGDDGT
ATLERTEALG RALLLGADDV PASIKATHGV RHGDRAEFWP VPAAPAAPAR KPAGASAEKA
EKSAQAAPGR PAQAPPEKPA WNPPEQPTAA PSRGDRPPAP ARKPESRLSR LRSAIRRG