Gene Sros_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0375 
Symbol 
ID8663643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp365235 
End bp366944 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlycosyltransferase-like protein 
Protein accessionYP_003336149 
Protein GI271961953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAGT GTGGCAAACG TTGCCTATGG TACGGTCACG GCGTGGTATC CGATGCCAGC 
AGGCCAGACT CCCCCAAGGT TTCCGCGAAG TCGGCCCGTG CCGGACTGCG CGGGCTTCTC
AAGGGGTTCG TCCAGCACCC TGTGATCGTC TCCCGAGTCG TGGCGACCAA GGTCAAGTCC
GACCCTGTGC GTGTGGCTCA GGCCGCCGCG GAGACCCTCC CGCCGCGGCT CCGCCCCGTC
GTGGGCCGGG TCGCCTGGCC CGCCGCGCGC CGAGCCAGGA TCGTCGTGCG CAAGCTCGGC
ATGCGCCTGG TCAGGGGGCC GTGGGACGAG GCGAAGGCGC ACTTCGACGC CGGGCGGATG
ACCGAGGCCG CGGCGGTGCT CCAGCCCTAC ACCAAATACC CCTTCATCAA CCGCCGGGCC
ACCTACTACA CCGGCGAGCT GGCCTCCATC CAGCCCAACC CGATCCCGCC CAAGTCCAAG
GTGATCGTGG GCGAGCGGGT GGAGGGCCGG GTGCTGCACT GCGTCACCAA CGCCCTGCCG
TACACGCAGG CCGGATACAC CGTGCGCACG CACCGGATCG TCACCGCGCA GAAGGCCGCG
GGGCTGGATC CGCACGTCGT GACCAGCTGG GGCTGGCCGA TGATGCAGGG CCACGTCGAC
GCGGAGCCGT ACGAGGAGAT CGACGAGATC CCCTACCACC GGCTGCTGCC GAGCGGCGAT
GTGCCGTTCG AGAGCCACGG CCGCATGATC CGGGGCGCCG GCGAGGTGAC CGAGCTGGTC
AGGACGCTCC GGCCGCAGGT GCTGCACGCC GCGACCGACC ACCGCAACGG CTCGGTGGCG
CTGGCGGTGC GCGAGCGGAC CGGCACGCCG ATGGTCTACG AGGTCCGGGG CTTCCTGGAG
GAGACCTGGG CCTCCCGCGA CCCCAAGCGC GTCGGCAGCC AGCGGCACGT GCTCCAGCGT
GACCGCGAGG CGTTCATCAT GCGTTCGGCC GACGCCGTGG TCACCCTCGC GGAGACCATG
GCCACCGAGA TCGTCGAGCG GGGCGTGCCG AGGGAGAAGA TCTTCCTGGC GCCCAACGCG
GTGGACGACT CGCTGCTGAC CGCGGAGTAC GACGGGGCGA CGTTCCGCTC CGCCTACGGC
ATCGAGCCCG GTGAGATCGT CATGGGCTCG GTGTCGAGCA TCGTGGCCTA CGAGGGCTTC
GCGACCATGA TCAACGCGGC CGCCCTCCTG CGGGATCAGG GCGCCCCGAT CAGGCTGCTC
CTCGTCGGCG ACGGCGCCGA GCGCCCGGCA CTGCTGGAGC AGGTCGAGGA GCTGGGGCTC
GGGGACGTCG CGATCCTGCC CGGCCGGGTC GGCCCGGACG AGGCGCTGCA GGCCCAGTCG
GCCATCGACA TCTTCGTCTG CCCCCGCGAG GACCTGCGGG TCTGCCGACT CGTTACGCCA
TTGAAACCCG TCGAGGCGAT GGCCCTCGGC AAGCCGGTCG TGCTGAGCGA TCTGCCCGCC
CTGTCGGAGC TCGTCGGCTC CGAGGGCGCC GGGCTCCTGG TCCCCGCGGG TGACCCGGAG
GCCCTGGCGG AGGCACTCGC CGCACTGCGC GACGACCCGG CGCGGCGGGC GGCGATGGGC
GAGGCCGGAC GCGCGGAGGT GGCCGCGAAG CGCACGTGGA GCCGCGTCGC GGAAACGTAC
CGTGACATTT ACCGATCACT TGCCGGTTGA
 
Protein sequence
MRECGKRCLW YGHGVVSDAS RPDSPKVSAK SARAGLRGLL KGFVQHPVIV SRVVATKVKS 
DPVRVAQAAA ETLPPRLRPV VGRVAWPAAR RARIVVRKLG MRLVRGPWDE AKAHFDAGRM
TEAAAVLQPY TKYPFINRRA TYYTGELASI QPNPIPPKSK VIVGERVEGR VLHCVTNALP
YTQAGYTVRT HRIVTAQKAA GLDPHVVTSW GWPMMQGHVD AEPYEEIDEI PYHRLLPSGD
VPFESHGRMI RGAGEVTELV RTLRPQVLHA ATDHRNGSVA LAVRERTGTP MVYEVRGFLE
ETWASRDPKR VGSQRHVLQR DREAFIMRSA DAVVTLAETM ATEIVERGVP REKIFLAPNA
VDDSLLTAEY DGATFRSAYG IEPGEIVMGS VSSIVAYEGF ATMINAAALL RDQGAPIRLL
LVGDGAERPA LLEQVEELGL GDVAILPGRV GPDEALQAQS AIDIFVCPRE DLRVCRLVTP
LKPVEAMALG KPVVLSDLPA LSELVGSEGA GLLVPAGDPE ALAEALAALR DDPARRAAMG
EAGRAEVAAK RTWSRVAETY RDIYRSLAG