Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5005 |
Symbol | |
ID | 8668299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5530762 |
End bp | 5533776 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | MAN2C1 protein-like protein |
Protein accession | YP_003340547 |
Protein GI | 271966351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.457974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.47837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGACG ATCGCGAGCT GGTCGAGAAC CGGCTCAAGC GGGTGCTCGA CGAGCGGATC CGCCCCGCGG TGTACCCCGC GTCCGTGCCG CTGGACGTGG CCGTGTGGCA CGCGCCCGGC GAGCCCGTAC CTGTGGCCGA GGGCCTGGCG GCATCCCCCT CACCGATCGC GGTCGGTGAC GCGTGGGGCG CGCCGTGGGG CACGAGCTGG TTCACGGTCA CCGGCACGGT GCCCGCCGAC TGGGCGGGCC GGACCGTGGA GGCCATCCTC GACCTGGGCT TCGACGAGAA CATGCCGGGC TTCCAGTGCG AAGGCCTGGT CTACCGGCCC GATGGCAACC CGGTGAAGGG TCTCAACCCG CGTAACCAGT GGGTGCGCGT CGGCGCGCCG GTCCAGGGTG GCGAGGAGGT GCGGCTGCAC ATCGAGGCGG CCTCCAACCC GGTCATCCTC GACTACCACC CCTTCCTGCC CACGAAGCTC GGAGACAAGG AGACGGCCGG CAGCGAGCCG CAGTACCGGC TGGCCCGCAT GGAACTCGCG ATCTTCGACG AGACCGTCTG GCAGCTCGTG ATCGACCTGG AGGTGCTCGG CGAGCTGATG GCGGAGCTGC CCGTGGACGG CGCCCGCCGC TGGGAGCTGC TGCGCGCCGT CGAGCGGGCC CTCGACGCGA TCGACCTGCA GGACGTCAAC GGCACGGCCG CCGCCGCCCG CGCCGAGCTG GCCGGGGTGC TGGCCACGCC GGCCGCGCCC TCGGCCCACC GGATCAGCGC GGTCGGGCAC GCCCACATCG ACTCGGCCTG GCTGTGGCCG CTGCGCGAGA CCGTGCGCAA GGTCGCCCGC ACCACCTCCA ACATGACCGC GCTGCTGGAG GACGAGCCGG AGTTCGTCTT CGCCATGTCC CAGGCACAGC AGTGGGCGTG GATCAAGGAG CACCGGCCGG AGGTCTGGAC CCGGGTCGTC AAGGCCGTGG CCGATGGGCG GTTCGTCCCG ACCGGCGGCA TGTGGGTGGA GTCGGACACG AACATGCCCG GCTCGGAGGC CATGGCCCGG CAGTTCGTGC ACGGCAAGCG CTTCTTCCTC GACGAGTTCG GCGTCGAGAA CGAGGAGGCG TGGCTGCCCG ACACGTTCGG CTTCGCCGCC GGGCTCCCGC AGATCATCAA GGCGGCCGGA TCCAAGCGGC TGCTCACCCA GAAGATCTCC TGGAGCCAGA TCAACAAGTT CCCCCACCAC ACGTTCCTGT GGGAGGGCAT CGACGGCACC CGGATCTTCA CCCACTTCCC GCCCGTGGAC ACCTACAACT GCTCGATGAA GGGCAGCGAG ATCGCCCACG CGGCGCGCAA CTTCAAGGAC AAGGGCGTGG CCCGGCACTC GCTGGCTCCC ACCGGCTGGG GCGACGGGGG CGGCGGCACC ACCCGTGAGA TGGTCGCCAA GGCGGCCCGC CTGCGGGACC TGGAGGGGTC GGCCACCGTT GTCTGGGAGA CGCCCGCCGA GTTCTTCGCC AAGGCCGAGG TGGAGTACCC CGCCCCGCCG GTCTGGGTCG GCGAGCTGTA CCTGGAGCTG CACCGCGCCA CGCTCACCAG CCAGGCCAAG ACCAAGCAGG GCAACCGCCG CAGCGAGTCG CTGCTGCGCG AGGCCGAGCT GTGGGCGGCC ACCGCCGCCG TGCGCACCGG GTCCGAGTAC CCCCACGAGC GGCTCGACCG GATCTGGAAG ACCGTGCTGC TCCACCAGTT CCACGACATC CTGCCCGGCT CGTCCATCGC CTGGGTGCAC CGCGAGGCCG AGAAGACGTA CGCCGCCGTG GCCGCCGAGC TGAACGAGAT CATCGACGGC GCGCAGCGGG CCCTCGCGGG AAACCCCGGC TCCGGCGAGC TGGTCTTCAA CGCGGCGCCG CACACCCGCC ACGGCGTCCC GGCCGGCGGC GCGGCCGCGG TCCCAGCCCC CGCCCTCGCC GCCGAACTCC GCACGCGCGG CGAGGGCGGA TACGTCCTGG ACGACGGCCT GCTCCGGGTG GAGGTCGACG GCCGCGGCCT GGTCGTCTCG GCGTACGACC TGCGGGCGGA GCGGGAGACG GTCGCGCCGG GGCAGGCCGC CAACCTGCTG CAGATCCACC CCGACTTCCC CAACATGTGG GACGCCTGGG ACGTGGACCA GTTCTACCGG AACACGGTCA CCGACCTGGT GGACGCCGAC GAGGTCGCGC CGGGCGAGGA GCCGGGCTCG GTCCGCGTGG TGCGCTCGTT CGGCGCGTCG CGGGTCACAC AGGTCCTCAC CGTGGAGGCC GGGCGGCTCG ACATCCGCAC CGAGGTGAAC TGGCACGAGA CCGAGAAGTT CCTCAAGCTG GCCTTCCCGC TGGACGTGCA CGCCGACCGG TACGCCTCCG AGAGCCAGTA CGGCCACGCC TTCCGCGCCA CCCACACCAA CACCAGCTGG GAGGCCGCCA AGTTCGAGGC GTGCAACCAC AAGTTCGTGC ACGTGGAGGA GCCGGACTGG GGCGTGGCGC TGGTCACCGA CTCCACCTAC GGCCACGACC TGACCCGCAC CGTGCGGGCG TCCGACTCGG GCACGACCAC CACGCTGCGG GTGTCGCTGC TGCGGGCGCC GCGGTTCCCC GACCCCGAAA CCGACCAGGG CCTGCACCTG TTCCGGCACG CACTGGTGCC GGGCGCGGCG ATCGGCGACG CGGTGCGCGA GGGCTGGTTC CTCAACCTGC CCGAGCGGCG GGTCCCGGGC GACGCCGAGG TCGCGCCGCT GGTCACCGTG GACGACGACG CGGTCGTGGT AACCGCCGTG AAGCTGGCCG ACGACGGGAG CGGAGACGTC GTCGTCCGCT TCCACGAGTC CCGGGGAGGC CGGGCCCGCG CCCGCGTCTC CACCGGCTTC GCCGCCACCG GCGTCGCCGT GACCGACCTG CTGGAGCGCC CGCTCACCGA CACGGCCCCA CCGGAGCTGA CCGACGGGTC GGTGGCGGTG GCACTGCGCC CGTTCGAGCT GGTCACGCTA CGGTTCAGCC GCTAA
|
Protein sequence | MHDDRELVEN RLKRVLDERI RPAVYPASVP LDVAVWHAPG EPVPVAEGLA ASPSPIAVGD AWGAPWGTSW FTVTGTVPAD WAGRTVEAIL DLGFDENMPG FQCEGLVYRP DGNPVKGLNP RNQWVRVGAP VQGGEEVRLH IEAASNPVIL DYHPFLPTKL GDKETAGSEP QYRLARMELA IFDETVWQLV IDLEVLGELM AELPVDGARR WELLRAVERA LDAIDLQDVN GTAAAARAEL AGVLATPAAP SAHRISAVGH AHIDSAWLWP LRETVRKVAR TTSNMTALLE DEPEFVFAMS QAQQWAWIKE HRPEVWTRVV KAVADGRFVP TGGMWVESDT NMPGSEAMAR QFVHGKRFFL DEFGVENEEA WLPDTFGFAA GLPQIIKAAG SKRLLTQKIS WSQINKFPHH TFLWEGIDGT RIFTHFPPVD TYNCSMKGSE IAHAARNFKD KGVARHSLAP TGWGDGGGGT TREMVAKAAR LRDLEGSATV VWETPAEFFA KAEVEYPAPP VWVGELYLEL HRATLTSQAK TKQGNRRSES LLREAELWAA TAAVRTGSEY PHERLDRIWK TVLLHQFHDI LPGSSIAWVH REAEKTYAAV AAELNEIIDG AQRALAGNPG SGELVFNAAP HTRHGVPAGG AAAVPAPALA AELRTRGEGG YVLDDGLLRV EVDGRGLVVS AYDLRAERET VAPGQAANLL QIHPDFPNMW DAWDVDQFYR NTVTDLVDAD EVAPGEEPGS VRVVRSFGAS RVTQVLTVEA GRLDIRTEVN WHETEKFLKL AFPLDVHADR YASESQYGHA FRATHTNTSW EAAKFEACNH KFVHVEEPDW GVALVTDSTY GHDLTRTVRA SDSGTTTTLR VSLLRAPRFP DPETDQGLHL FRHALVPGAA IGDAVREGWF LNLPERRVPG DAEVAPLVTV DDDAVVVTAV KLADDGSGDV VVRFHESRGG RARARVSTGF AATGVAVTDL LERPLTDTAP PELTDGSVAV ALRPFELVTL RFSR
|
| |