Gene Sros_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1645 
Symbol 
ID8664922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1757101 
End bp1758822 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycosyltransferase-like protein 
Protein accessionYP_003337379 
Protein GI271963183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.514144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAC GGATCACCGG CAACGACTAC CGGAGCCTCA CCCCGCCCGA GCTCGGCGCC 
TGGACGCCGT CGCTGCGGGT CAGCGTGGTC GTGCCCGCCT ACGGCGGCCA GGACAAGCTC
GACCTCGTGC TCGCGGGGCT GGCGGGGCAG ACCTATCCCG CCGGCCTGAC CGAGGTCATC
GTGGTGGACA ACGGCAGCGA GCCGCCGCTG CGCCTGCCCG AGCTGAGGCC CGCGAGCACC
CGGCTCATCG TCTGCCCCAC TCCCGGGCGG GCCCACGCCC GCAACGCGGG GCTCGGCGCC
GCCACCGGTG ACGTGATCCA CTGGCTCGAC TCCGACGTGG TCCTGGACCG CCGGTCCGTC
GAGGCGCACA TGCGCTGGCA CCACGCGGCG CCCTACCTCG TGGTGACCGG CTACCTGCGT
TTCACCCCGG CGGAGCTGCC CGCCCCCCGG GAGGTGGCCG CGGCCGCCGA CCTGGCCGAG
CTCTTCGAAC CGGCCGAGCC GCACGCCTGG CTGGTGGACC TCATCGAGCG GACCGACGGC
CTCACCGACA ACCCGCACCG CGCGTTCAGC CTGCACGTGG GCGGCGCCAC CTCGGTCAAC
GCCGCGCTGC TCGCCCAGGC CGGGCCGATG GACACCGAGC TCATCCTTGG CCAGGACACG
GAGATGGGCT ACCGGCTGGC CCAGGCGGGC GCGGTCTTCG TGCCCGAGCC GCTGGCCCGC
GCCTTCCACC TCGGTCCCAC CATGCGGATG CGCGACAAGG CGCCGATCGA CCGGGTCAGC
CACGCCTTCG TCGCCGACCG GATCCCGAGT TATCGGTGGC TCCGCGCCCA TCCGGCCCGG
CAGTGGAAGG TGCCCTACCT GGAGGTGGTC GTCGGGCCCG CCGGGCGGCC GGACCACGCC
GGGGAGGGCG GGCGCGGGTA CGGCTACGAC GAGGTCCGCG CCACCGTGGA CGCCGTCCTC
GCGGGCACCG TGCCCGACGT GGTGGTCACC GTCACCGGTC CCTGGGACCG CGTCCGGACC
GAGGGCCGCG CGCCGTTGCG GAACCCGGAC CTGGACCTGG AGCTGATCCG CGGCCACTAC
GCCCACGAGG GCCGGGTGCG GTTCGCCCTC GACGCCCCTC GGACGGTGCG GATGGCTCCG
CCGTACCCGG CGGAGGGCGG CGAGGCGCCG GAGGCCGGGG CGGCCCCGCC GTACCGGCTG
AGGCTGCCCG CCGGCTGGGT GCCGGGCGAG GACAGCCTCG CCCGCCTGCT CGACGTGGCC
GGGGACGGGG GATACGGGCT GGTCTCGGCG CTGCTGGCCG AGGGGGCCGG CGAGGGGATC
GTGGCGGCCA GGCTGGAGCG CACCGCCGCG TTCGCCCGTG CCGCGATCGT CCGGCGGGAG
GGCGAGGACC TCGACGACGC GGTGGAGGAC ACCTCCGGGG TGCTCTGGGT GGACGGCGAG
ACCTACGGGT TCCTGCCGGA GGCCCGGCCG ATCATCGGCC GCCGCGGCGC GTACCGGGCC
AGGACGGAGG CGCAGGCCGA GATCGCCCGC CTCGCCAAGG AGAACGAGCG GCTGCGCGCC
CAGGTGACCA GGTGGCGCGA CGAGGCGGGC CGCTGGCGCA AGAGCGCGGT CGAGCTGCGG
CGCGAGGTCG GCGGTCTGCG CAAGGAGCTG GCCGCCGCCA GGAAGATCGT CCAGTACGGC
CTGCTCTCGT CCGTCAAGCG GGCGATCATC CGCCGCCGGT GA
 
Protein sequence
MTARITGNDY RSLTPPELGA WTPSLRVSVV VPAYGGQDKL DLVLAGLAGQ TYPAGLTEVI 
VVDNGSEPPL RLPELRPAST RLIVCPTPGR AHARNAGLGA ATGDVIHWLD SDVVLDRRSV
EAHMRWHHAA PYLVVTGYLR FTPAELPAPR EVAAAADLAE LFEPAEPHAW LVDLIERTDG
LTDNPHRAFS LHVGGATSVN AALLAQAGPM DTELILGQDT EMGYRLAQAG AVFVPEPLAR
AFHLGPTMRM RDKAPIDRVS HAFVADRIPS YRWLRAHPAR QWKVPYLEVV VGPAGRPDHA
GEGGRGYGYD EVRATVDAVL AGTVPDVVVT VTGPWDRVRT EGRAPLRNPD LDLELIRGHY
AHEGRVRFAL DAPRTVRMAP PYPAEGGEAP EAGAAPPYRL RLPAGWVPGE DSLARLLDVA
GDGGYGLVSA LLAEGAGEGI VAARLERTAA FARAAIVRRE GEDLDDAVED TSGVLWVDGE
TYGFLPEARP IIGRRGAYRA RTEAQAEIAR LAKENERLRA QVTRWRDEAG RWRKSAVELR
REVGGLRKEL AAARKIVQYG LLSSVKRAII RRR