Gene Sros_5005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5005 
Symbol 
ID8668299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5530762 
End bp5533776 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content72% 
IMG OID 
ProductMAN2C1 protein-like protein 
Protein accessionYP_003340547 
Protein GI271966351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.457974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.47837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGACG ATCGCGAGCT GGTCGAGAAC CGGCTCAAGC GGGTGCTCGA CGAGCGGATC 
CGCCCCGCGG TGTACCCCGC GTCCGTGCCG CTGGACGTGG CCGTGTGGCA CGCGCCCGGC
GAGCCCGTAC CTGTGGCCGA GGGCCTGGCG GCATCCCCCT CACCGATCGC GGTCGGTGAC
GCGTGGGGCG CGCCGTGGGG CACGAGCTGG TTCACGGTCA CCGGCACGGT GCCCGCCGAC
TGGGCGGGCC GGACCGTGGA GGCCATCCTC GACCTGGGCT TCGACGAGAA CATGCCGGGC
TTCCAGTGCG AAGGCCTGGT CTACCGGCCC GATGGCAACC CGGTGAAGGG TCTCAACCCG
CGTAACCAGT GGGTGCGCGT CGGCGCGCCG GTCCAGGGTG GCGAGGAGGT GCGGCTGCAC
ATCGAGGCGG CCTCCAACCC GGTCATCCTC GACTACCACC CCTTCCTGCC CACGAAGCTC
GGAGACAAGG AGACGGCCGG CAGCGAGCCG CAGTACCGGC TGGCCCGCAT GGAACTCGCG
ATCTTCGACG AGACCGTCTG GCAGCTCGTG ATCGACCTGG AGGTGCTCGG CGAGCTGATG
GCGGAGCTGC CCGTGGACGG CGCCCGCCGC TGGGAGCTGC TGCGCGCCGT CGAGCGGGCC
CTCGACGCGA TCGACCTGCA GGACGTCAAC GGCACGGCCG CCGCCGCCCG CGCCGAGCTG
GCCGGGGTGC TGGCCACGCC GGCCGCGCCC TCGGCCCACC GGATCAGCGC GGTCGGGCAC
GCCCACATCG ACTCGGCCTG GCTGTGGCCG CTGCGCGAGA CCGTGCGCAA GGTCGCCCGC
ACCACCTCCA ACATGACCGC GCTGCTGGAG GACGAGCCGG AGTTCGTCTT CGCCATGTCC
CAGGCACAGC AGTGGGCGTG GATCAAGGAG CACCGGCCGG AGGTCTGGAC CCGGGTCGTC
AAGGCCGTGG CCGATGGGCG GTTCGTCCCG ACCGGCGGCA TGTGGGTGGA GTCGGACACG
AACATGCCCG GCTCGGAGGC CATGGCCCGG CAGTTCGTGC ACGGCAAGCG CTTCTTCCTC
GACGAGTTCG GCGTCGAGAA CGAGGAGGCG TGGCTGCCCG ACACGTTCGG CTTCGCCGCC
GGGCTCCCGC AGATCATCAA GGCGGCCGGA TCCAAGCGGC TGCTCACCCA GAAGATCTCC
TGGAGCCAGA TCAACAAGTT CCCCCACCAC ACGTTCCTGT GGGAGGGCAT CGACGGCACC
CGGATCTTCA CCCACTTCCC GCCCGTGGAC ACCTACAACT GCTCGATGAA GGGCAGCGAG
ATCGCCCACG CGGCGCGCAA CTTCAAGGAC AAGGGCGTGG CCCGGCACTC GCTGGCTCCC
ACCGGCTGGG GCGACGGGGG CGGCGGCACC ACCCGTGAGA TGGTCGCCAA GGCGGCCCGC
CTGCGGGACC TGGAGGGGTC GGCCACCGTT GTCTGGGAGA CGCCCGCCGA GTTCTTCGCC
AAGGCCGAGG TGGAGTACCC CGCCCCGCCG GTCTGGGTCG GCGAGCTGTA CCTGGAGCTG
CACCGCGCCA CGCTCACCAG CCAGGCCAAG ACCAAGCAGG GCAACCGCCG CAGCGAGTCG
CTGCTGCGCG AGGCCGAGCT GTGGGCGGCC ACCGCCGCCG TGCGCACCGG GTCCGAGTAC
CCCCACGAGC GGCTCGACCG GATCTGGAAG ACCGTGCTGC TCCACCAGTT CCACGACATC
CTGCCCGGCT CGTCCATCGC CTGGGTGCAC CGCGAGGCCG AGAAGACGTA CGCCGCCGTG
GCCGCCGAGC TGAACGAGAT CATCGACGGC GCGCAGCGGG CCCTCGCGGG AAACCCCGGC
TCCGGCGAGC TGGTCTTCAA CGCGGCGCCG CACACCCGCC ACGGCGTCCC GGCCGGCGGC
GCGGCCGCGG TCCCAGCCCC CGCCCTCGCC GCCGAACTCC GCACGCGCGG CGAGGGCGGA
TACGTCCTGG ACGACGGCCT GCTCCGGGTG GAGGTCGACG GCCGCGGCCT GGTCGTCTCG
GCGTACGACC TGCGGGCGGA GCGGGAGACG GTCGCGCCGG GGCAGGCCGC CAACCTGCTG
CAGATCCACC CCGACTTCCC CAACATGTGG GACGCCTGGG ACGTGGACCA GTTCTACCGG
AACACGGTCA CCGACCTGGT GGACGCCGAC GAGGTCGCGC CGGGCGAGGA GCCGGGCTCG
GTCCGCGTGG TGCGCTCGTT CGGCGCGTCG CGGGTCACAC AGGTCCTCAC CGTGGAGGCC
GGGCGGCTCG ACATCCGCAC CGAGGTGAAC TGGCACGAGA CCGAGAAGTT CCTCAAGCTG
GCCTTCCCGC TGGACGTGCA CGCCGACCGG TACGCCTCCG AGAGCCAGTA CGGCCACGCC
TTCCGCGCCA CCCACACCAA CACCAGCTGG GAGGCCGCCA AGTTCGAGGC GTGCAACCAC
AAGTTCGTGC ACGTGGAGGA GCCGGACTGG GGCGTGGCGC TGGTCACCGA CTCCACCTAC
GGCCACGACC TGACCCGCAC CGTGCGGGCG TCCGACTCGG GCACGACCAC CACGCTGCGG
GTGTCGCTGC TGCGGGCGCC GCGGTTCCCC GACCCCGAAA CCGACCAGGG CCTGCACCTG
TTCCGGCACG CACTGGTGCC GGGCGCGGCG ATCGGCGACG CGGTGCGCGA GGGCTGGTTC
CTCAACCTGC CCGAGCGGCG GGTCCCGGGC GACGCCGAGG TCGCGCCGCT GGTCACCGTG
GACGACGACG CGGTCGTGGT AACCGCCGTG AAGCTGGCCG ACGACGGGAG CGGAGACGTC
GTCGTCCGCT TCCACGAGTC CCGGGGAGGC CGGGCCCGCG CCCGCGTCTC CACCGGCTTC
GCCGCCACCG GCGTCGCCGT GACCGACCTG CTGGAGCGCC CGCTCACCGA CACGGCCCCA
CCGGAGCTGA CCGACGGGTC GGTGGCGGTG GCACTGCGCC CGTTCGAGCT GGTCACGCTA
CGGTTCAGCC GCTAA
 
Protein sequence
MHDDRELVEN RLKRVLDERI RPAVYPASVP LDVAVWHAPG EPVPVAEGLA ASPSPIAVGD 
AWGAPWGTSW FTVTGTVPAD WAGRTVEAIL DLGFDENMPG FQCEGLVYRP DGNPVKGLNP
RNQWVRVGAP VQGGEEVRLH IEAASNPVIL DYHPFLPTKL GDKETAGSEP QYRLARMELA
IFDETVWQLV IDLEVLGELM AELPVDGARR WELLRAVERA LDAIDLQDVN GTAAAARAEL
AGVLATPAAP SAHRISAVGH AHIDSAWLWP LRETVRKVAR TTSNMTALLE DEPEFVFAMS
QAQQWAWIKE HRPEVWTRVV KAVADGRFVP TGGMWVESDT NMPGSEAMAR QFVHGKRFFL
DEFGVENEEA WLPDTFGFAA GLPQIIKAAG SKRLLTQKIS WSQINKFPHH TFLWEGIDGT
RIFTHFPPVD TYNCSMKGSE IAHAARNFKD KGVARHSLAP TGWGDGGGGT TREMVAKAAR
LRDLEGSATV VWETPAEFFA KAEVEYPAPP VWVGELYLEL HRATLTSQAK TKQGNRRSES
LLREAELWAA TAAVRTGSEY PHERLDRIWK TVLLHQFHDI LPGSSIAWVH REAEKTYAAV
AAELNEIIDG AQRALAGNPG SGELVFNAAP HTRHGVPAGG AAAVPAPALA AELRTRGEGG
YVLDDGLLRV EVDGRGLVVS AYDLRAERET VAPGQAANLL QIHPDFPNMW DAWDVDQFYR
NTVTDLVDAD EVAPGEEPGS VRVVRSFGAS RVTQVLTVEA GRLDIRTEVN WHETEKFLKL
AFPLDVHADR YASESQYGHA FRATHTNTSW EAAKFEACNH KFVHVEEPDW GVALVTDSTY
GHDLTRTVRA SDSGTTTTLR VSLLRAPRFP DPETDQGLHL FRHALVPGAA IGDAVREGWF
LNLPERRVPG DAEVAPLVTV DDDAVVVTAV KLADDGSGDV VVRFHESRGG RARARVSTGF
AATGVAVTDL LERPLTDTAP PELTDGSVAV ALRPFELVTL RFSR