Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4999 |
Symbol | |
ID | 8668293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5522691 |
End bp | 5525195 |
Gene Length | 2505 bp |
Protein Length | 834 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase |
Protein accession | YP_003340541 |
Protein GI | 271966345 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.517089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.335211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGCC GCGTGTTACC CGCCCTGTGC GGGGCCCTGA TCGCTACGGC CCTCGCCGTC CAGCCCGCCC TCGCCGCCCC CACGGCCCGG CCCACCTCCT CACCCGCCGA CCCATCCCCC TCGGCTGCTC CGTCCACCCC GGCGGCGCCG AACCGGTCGG GCCGCACGGC GGTGCGGGCG GCGGGCGACC AGCCCTACGC CTCCTACTGG CACCCGAACA CGATCCTGAA CTGGGACCCG GCGACGGACC CGGACGCGCG GTTCAACCGG TCGCGGGTAC CGCTGCGGCC CCGCGCCTCC GACCCCGCGC TCAAGGCCAA CCCGAACGCG CGCGCCGGCG AGGGCAAGAT CGCCTCGCTG GTGTCGTTCG CACCCACCTC CGGCAACCCG TCACAGGGCT CCCTCGACCC CAACTACTAC GCCTTCACCT ACTGGCAGTA CATCGACACC CTGGTCTTCT GGGGCGGTTC GGCGGGCGAG GGCCTCATCC TCGCCCCCAA CGCGACGGTG ATCGACGCCG CGCACCGCAA CGGGGTCAAG GTCTACGGCA CGGTGTTCTT CCCGCCCACG GCGTACGGCG GGCAGATCCA GTGGGTACGC GACTTCGTGC GGAAGTCCGG CCCGAACTAC CCGGTCGCTG ACAAGCTGGT ACAGGTGGCC CAGCACTACG GGTTCGACGG CTGGTTCATC AACCAGGAGA CCGCGGGCGG CGACGCCGCC CTGGCGACCG AGATCCGTTC GCTGATGAAG TACGCCCGGG CCAAGAGCCA GGCCGAGTTC ATGTGGTACG ACGCGATGAC CGAGAACGGC TCGGTGAGCT GGCAGGACGC GCTCACCACG GCGAACGACG CCTTCCTGAG CGACCCCGCC CGGGTGTCCG ACTCGATGTT CCTGGACTTC GGCTGGGGCG CGGGCAACCT GCGCTCGTCG CGCGACCTGG CCCGCTCCCT CAGCCGGGAC GAGCACGAGC TGTATGCGGG CATCGACACC GAGGCCAACG GATACAACAC CGGGGTGAGC TGGGACGCGG TGTTCCCCAC CGGTCAGCCG CACGTCACCT CGCTGGGCAT CTACCGGCCC GAGTGGACGT GGAAGTCCTC CTCAGGCCCG GCCGACTTCC GCACCCGCGA CTCCCGGTAC TGGGTGGGCG CGAACGCCGA CCCGTCCAAC ACCGCGACCT CCTCGTCCTG GAAGGGCCTG GCGCACCACA TCGCCGAGTC CACGGCCGTG ACCGCGAAGC CGTTCGTCAC CGGGTTCAAC ACCGGCCACG GCGACTTCTA CAACGTCGGC GGGACCCGCG TGCGCACCGG CGGCTGGAAC AACCTGTCAG TACAGGACGT GCCCCCCACC TATCAGTGGG TGGTGGACTC CAGCGGGACC CGGCTCACGC CATCCATCGA CTACGGCGAC GCGTACGAGG GCGGCTCGTC GCTGCGGCTC ACCGGGAATC TCGACGCAGC CAACACCGTG CGGCTCTACC AGGCGAAGCT GCCGGTGACG GCCGGCACCA AGCTGTCCGT CGTGGTCAAG ACGCCCGCCG CGGGTCCGAC CCGCCTGAAG GCGGCCGTCG CGTTCACCGA CGCGCCGGGT ACCTTCACCA CATTCGACCT GGGGTCCACG AGCGGCACCG GCTGGGAGCG GAAGACGCTG GATCTGTCGG CGCACGCCGG CAAGACCATC GCCCAGCTCG GCCTGCGCGC CGAGGGCTCC GCCGCCTCCT ACGACATCCG GGTCGGGCAG CTCGCCGTGT ACGACGGAGC CGTGGACGCG CCCGCCGCGC CCTCCGGCCT CCAGGTGCTC GGCGCGACCG ACGTGTCCCC CTCGCGCAAG TCGCTCCGCC TGGCCTGGAC CGCCGCCGGC TCCGGGTCCG GCCAGGTCCA CCACTACGAC GTCTACCGCC GCAACGCCGA CGGCTCCCGG ACCTACCTGG GCGCGACCCC GAACGACGCG TACTTCGTGC CGCAGCTCGA CCGGGCCGGC GCGGAGACCA GCACGACGAT CGAGGTCGAG GCCGTCTCCA CCGAGTACGG CCGGTCGGCC GCCGCGACCG CGACGGTCGC CTGGTCGGGG GAGCCGGGGG AGGACAATCG CGCGCTCGGC CGCCCGGCAA CCGCCTCCGG GCAGTGCAAC GCGAACGAGG GGCCGGCCAA GGCCGTGAAC GGCACCGTCA CCGGCGGCAA CTCCGACAAA TGGTGCACGC TCACCGCGAA CAGGTGGCTG GAGGTGGACC TCGGCGAGGC CCGCTCCCTG ACCCGGTTCG TGGTCCGGCA CGCGCAGGCC GGCGGGGAGC CCGCCGCGTT CAACACCCGG GACTTCACGA TCCAGGTGCG CTCCGCCGCC TCGGAGGAGT GGGAGACCGC GGTGACCGTC ACCGGCAACA CGACGGCCAC CACCACTCAC CCGGTCACCC TCACCGCCCG CCACGTCAGG CTGTCCATCA CCAAACCGGC TCAGAACACC GACTCCGCCG CCCGGATCTA CGAGTTCGAG GCGTGGGGGA GGTAG
|
Protein sequence | MRRRVLPALC GALIATALAV QPALAAPTAR PTSSPADPSP SAAPSTPAAP NRSGRTAVRA AGDQPYASYW HPNTILNWDP ATDPDARFNR SRVPLRPRAS DPALKANPNA RAGEGKIASL VSFAPTSGNP SQGSLDPNYY AFTYWQYIDT LVFWGGSAGE GLILAPNATV IDAAHRNGVK VYGTVFFPPT AYGGQIQWVR DFVRKSGPNY PVADKLVQVA QHYGFDGWFI NQETAGGDAA LATEIRSLMK YARAKSQAEF MWYDAMTENG SVSWQDALTT ANDAFLSDPA RVSDSMFLDF GWGAGNLRSS RDLARSLSRD EHELYAGIDT EANGYNTGVS WDAVFPTGQP HVTSLGIYRP EWTWKSSSGP ADFRTRDSRY WVGANADPSN TATSSSWKGL AHHIAESTAV TAKPFVTGFN TGHGDFYNVG GTRVRTGGWN NLSVQDVPPT YQWVVDSSGT RLTPSIDYGD AYEGGSSLRL TGNLDAANTV RLYQAKLPVT AGTKLSVVVK TPAAGPTRLK AAVAFTDAPG TFTTFDLGST SGTGWERKTL DLSAHAGKTI AQLGLRAEGS AASYDIRVGQ LAVYDGAVDA PAAPSGLQVL GATDVSPSRK SLRLAWTAAG SGSGQVHHYD VYRRNADGSR TYLGATPNDA YFVPQLDRAG AETSTTIEVE AVSTEYGRSA AATATVAWSG EPGEDNRALG RPATASGQCN ANEGPAKAVN GTVTGGNSDK WCTLTANRWL EVDLGEARSL TRFVVRHAQA GGEPAAFNTR DFTIQVRSAA SEEWETAVTV TGNTTATTTH PVTLTARHVR LSITKPAQNT DSAARIYEFE AWGR
|
| |