Gene Sros_4999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4999 
Symbol 
ID8668293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5522691 
End bp5525195 
Gene Length2505 bp 
Protein Length834 aa 
Translation table11 
GC content72% 
IMG OID 
ProductMannosyl-glycoprotein endo-beta-N-acetylglucosaminidase 
Protein accessionYP_003340541 
Protein GI271966345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.517089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.335211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCC GCGTGTTACC CGCCCTGTGC GGGGCCCTGA TCGCTACGGC CCTCGCCGTC 
CAGCCCGCCC TCGCCGCCCC CACGGCCCGG CCCACCTCCT CACCCGCCGA CCCATCCCCC
TCGGCTGCTC CGTCCACCCC GGCGGCGCCG AACCGGTCGG GCCGCACGGC GGTGCGGGCG
GCGGGCGACC AGCCCTACGC CTCCTACTGG CACCCGAACA CGATCCTGAA CTGGGACCCG
GCGACGGACC CGGACGCGCG GTTCAACCGG TCGCGGGTAC CGCTGCGGCC CCGCGCCTCC
GACCCCGCGC TCAAGGCCAA CCCGAACGCG CGCGCCGGCG AGGGCAAGAT CGCCTCGCTG
GTGTCGTTCG CACCCACCTC CGGCAACCCG TCACAGGGCT CCCTCGACCC CAACTACTAC
GCCTTCACCT ACTGGCAGTA CATCGACACC CTGGTCTTCT GGGGCGGTTC GGCGGGCGAG
GGCCTCATCC TCGCCCCCAA CGCGACGGTG ATCGACGCCG CGCACCGCAA CGGGGTCAAG
GTCTACGGCA CGGTGTTCTT CCCGCCCACG GCGTACGGCG GGCAGATCCA GTGGGTACGC
GACTTCGTGC GGAAGTCCGG CCCGAACTAC CCGGTCGCTG ACAAGCTGGT ACAGGTGGCC
CAGCACTACG GGTTCGACGG CTGGTTCATC AACCAGGAGA CCGCGGGCGG CGACGCCGCC
CTGGCGACCG AGATCCGTTC GCTGATGAAG TACGCCCGGG CCAAGAGCCA GGCCGAGTTC
ATGTGGTACG ACGCGATGAC CGAGAACGGC TCGGTGAGCT GGCAGGACGC GCTCACCACG
GCGAACGACG CCTTCCTGAG CGACCCCGCC CGGGTGTCCG ACTCGATGTT CCTGGACTTC
GGCTGGGGCG CGGGCAACCT GCGCTCGTCG CGCGACCTGG CCCGCTCCCT CAGCCGGGAC
GAGCACGAGC TGTATGCGGG CATCGACACC GAGGCCAACG GATACAACAC CGGGGTGAGC
TGGGACGCGG TGTTCCCCAC CGGTCAGCCG CACGTCACCT CGCTGGGCAT CTACCGGCCC
GAGTGGACGT GGAAGTCCTC CTCAGGCCCG GCCGACTTCC GCACCCGCGA CTCCCGGTAC
TGGGTGGGCG CGAACGCCGA CCCGTCCAAC ACCGCGACCT CCTCGTCCTG GAAGGGCCTG
GCGCACCACA TCGCCGAGTC CACGGCCGTG ACCGCGAAGC CGTTCGTCAC CGGGTTCAAC
ACCGGCCACG GCGACTTCTA CAACGTCGGC GGGACCCGCG TGCGCACCGG CGGCTGGAAC
AACCTGTCAG TACAGGACGT GCCCCCCACC TATCAGTGGG TGGTGGACTC CAGCGGGACC
CGGCTCACGC CATCCATCGA CTACGGCGAC GCGTACGAGG GCGGCTCGTC GCTGCGGCTC
ACCGGGAATC TCGACGCAGC CAACACCGTG CGGCTCTACC AGGCGAAGCT GCCGGTGACG
GCCGGCACCA AGCTGTCCGT CGTGGTCAAG ACGCCCGCCG CGGGTCCGAC CCGCCTGAAG
GCGGCCGTCG CGTTCACCGA CGCGCCGGGT ACCTTCACCA CATTCGACCT GGGGTCCACG
AGCGGCACCG GCTGGGAGCG GAAGACGCTG GATCTGTCGG CGCACGCCGG CAAGACCATC
GCCCAGCTCG GCCTGCGCGC CGAGGGCTCC GCCGCCTCCT ACGACATCCG GGTCGGGCAG
CTCGCCGTGT ACGACGGAGC CGTGGACGCG CCCGCCGCGC CCTCCGGCCT CCAGGTGCTC
GGCGCGACCG ACGTGTCCCC CTCGCGCAAG TCGCTCCGCC TGGCCTGGAC CGCCGCCGGC
TCCGGGTCCG GCCAGGTCCA CCACTACGAC GTCTACCGCC GCAACGCCGA CGGCTCCCGG
ACCTACCTGG GCGCGACCCC GAACGACGCG TACTTCGTGC CGCAGCTCGA CCGGGCCGGC
GCGGAGACCA GCACGACGAT CGAGGTCGAG GCCGTCTCCA CCGAGTACGG CCGGTCGGCC
GCCGCGACCG CGACGGTCGC CTGGTCGGGG GAGCCGGGGG AGGACAATCG CGCGCTCGGC
CGCCCGGCAA CCGCCTCCGG GCAGTGCAAC GCGAACGAGG GGCCGGCCAA GGCCGTGAAC
GGCACCGTCA CCGGCGGCAA CTCCGACAAA TGGTGCACGC TCACCGCGAA CAGGTGGCTG
GAGGTGGACC TCGGCGAGGC CCGCTCCCTG ACCCGGTTCG TGGTCCGGCA CGCGCAGGCC
GGCGGGGAGC CCGCCGCGTT CAACACCCGG GACTTCACGA TCCAGGTGCG CTCCGCCGCC
TCGGAGGAGT GGGAGACCGC GGTGACCGTC ACCGGCAACA CGACGGCCAC CACCACTCAC
CCGGTCACCC TCACCGCCCG CCACGTCAGG CTGTCCATCA CCAAACCGGC TCAGAACACC
GACTCCGCCG CCCGGATCTA CGAGTTCGAG GCGTGGGGGA GGTAG
 
Protein sequence
MRRRVLPALC GALIATALAV QPALAAPTAR PTSSPADPSP SAAPSTPAAP NRSGRTAVRA 
AGDQPYASYW HPNTILNWDP ATDPDARFNR SRVPLRPRAS DPALKANPNA RAGEGKIASL
VSFAPTSGNP SQGSLDPNYY AFTYWQYIDT LVFWGGSAGE GLILAPNATV IDAAHRNGVK
VYGTVFFPPT AYGGQIQWVR DFVRKSGPNY PVADKLVQVA QHYGFDGWFI NQETAGGDAA
LATEIRSLMK YARAKSQAEF MWYDAMTENG SVSWQDALTT ANDAFLSDPA RVSDSMFLDF
GWGAGNLRSS RDLARSLSRD EHELYAGIDT EANGYNTGVS WDAVFPTGQP HVTSLGIYRP
EWTWKSSSGP ADFRTRDSRY WVGANADPSN TATSSSWKGL AHHIAESTAV TAKPFVTGFN
TGHGDFYNVG GTRVRTGGWN NLSVQDVPPT YQWVVDSSGT RLTPSIDYGD AYEGGSSLRL
TGNLDAANTV RLYQAKLPVT AGTKLSVVVK TPAAGPTRLK AAVAFTDAPG TFTTFDLGST
SGTGWERKTL DLSAHAGKTI AQLGLRAEGS AASYDIRVGQ LAVYDGAVDA PAAPSGLQVL
GATDVSPSRK SLRLAWTAAG SGSGQVHHYD VYRRNADGSR TYLGATPNDA YFVPQLDRAG
AETSTTIEVE AVSTEYGRSA AATATVAWSG EPGEDNRALG RPATASGQCN ANEGPAKAVN
GTVTGGNSDK WCTLTANRWL EVDLGEARSL TRFVVRHAQA GGEPAAFNTR DFTIQVRSAA
SEEWETAVTV TGNTTATTTH PVTLTARHVR LSITKPAQNT DSAARIYEFE AWGR