Gene Sros_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3052 
Symbol 
ID8666339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3329975 
End bp3331729 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content75% 
IMG OID 
ProductThiamine pyrophosphate-requiring protein-like protein 
Protein accessionYP_003338747 
Protein GI271964551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.912898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.512138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGTCG CCGAGGCCGT GGGCCGCGTC CTCGCCTCGC TCGGTGTGGA CACCGCCTTC 
GGCGTGGTCG GCAGCGGCAA CTTCCACGTG ACCAACGCGC TGGTCGAGCA CGGCGTGCGG
TTCGTCGCCG CACGCCACGA GGGCGGCGCG GCGACCATGG CCGACGCCTA CGCGCGCACC
GGCGGCCGGG TCGGCGTGCT CAGCGTCCAC CAGGGGCCGG GGCTCACCAA CGCGATGACC
GGCATCACCG AGGCCGCCAA GAGCCGCACG CCGCTCATCG TGCTGGCCGC CGAGGTGACC
GAGCCCCGGT CCAACTTCTT CGTCGACCAG GCCGCCCTCG CCACGGCGGT CGGCGCCGTC
CCCCTGCGGA TCACCTCCGC CGGGACCGCC GTCGCGGAGA CGCTCCAGGC GTTCCACCTG
GCCCGCGACG GGCGGCGCAC GGTCCTGCTG AACCTCCCGC TGGAGGTCCA GGGCCGGCCC
GTCCCCACCC CGCCCGCTCC CGCCTCTCTC GCCTCCGTCC CGGAGCCCTC CGAGCCGGAG
GCGAGGGAGG TCGCGCGGCT GGCGGAGCTT CTCGGGGCCG CGCGGCGGCC GGTGTTCGTG
GCGGGGCGGG GAGCACGCGC GGCCAGGCTG GAGCTGGAGG AGCTCGCTGA GCGGATCGGG
GCGCTGCTCG CCACCTCCGC CGTGGCCAAG GGACTCTTCC GGGGCAGCCC GTGGGATCTG
GACGTGAGCG GTGGCTTCGC CTCGCCTCTC ACCGCCGAAC TCGTCCGCGG CGCCGACGTG
ATCGTCGGCT GGGGCTGCGC GCTCAACATG TGGACCATGC GCCAGGGCAC GCTCATCGGC
CCGGAGGCCA AGGTCGCCCA GGTCGACCTG GACGCCGACG CCCTCGGCGC CCACCGGCCG
ATCGATCTCG GCGTGGTCGG CGACGTCGCC CTCACCGCGC GGTCCGTCAC CACCCTGCTC
GCCGGGGGCG GGGACGACCT CCGCCAGGCG TCCGCCGCGC GGGACGGGGC AGGCCGGGAG
GCTTCTGCCG TACCGGGCGA CGGGCCCGGG GCGCCGGACG CGCCGGGTGG GAGCGGTCCC
GGCGCGCCGG GTGGCATCGG GTACCGGTCG CGGGTGTTGG CCGAGCGGAT CGCCCGCGAG
AACCGCTGGC GGGACGTGCC CTATGCCGAC GAGGGGGGCG AGGGCCGCAT CGACCCCCGC
ACCCTCACGA TCGAGCTGGA CGACCTCCTC CCCGCCGAAC GCGTCGTCTC CGTCGATTCC
GGAAATTTCA TGGGATATCC GTCGATGTTC CTCGACGTCC CAGATGAACG CGGTTTCTGC
TTCACCCAGG CATTTCAGTC CATCGGCCTC GGCCTGGCCA CCGCGATCGG CGCCGCCCTG
GCCCAACCGG CCCGACTCGC GGTGGCGGCG CTGGGCGACG GGGGCGCGCT GATGGGCGTC
GCCGAGTTGG AGACGGTCGT ACGGCTCGGC CTTCCGATGG TGATCGTGGT CTATGACGAC
GAGGGCTACG GGGCCGAGGT CCACCACTTC GGCCCGGACG GGCACAGCCT GGACACCGTC
ACCTTCCCGC CCGTCGACAT CGCCGCCATA GCCCGGGGTT TCGGCTGCGA GGCGGTGACC
GTACGGGGCC GGGAGGACCT CGCCGCGGTG GCCGGATGGC TGGACGGGCC GCGGCACCGG
CCGCTGCTGG TCCACGCCAA GGTCAGTGGC GCGCGGGGGT CGTGGTGGCT GGAGGAGGCC
TTCCGCGGGC ATTGA
 
Protein sequence
MNVAEAVGRV LASLGVDTAF GVVGSGNFHV TNALVEHGVR FVAARHEGGA ATMADAYART 
GGRVGVLSVH QGPGLTNAMT GITEAAKSRT PLIVLAAEVT EPRSNFFVDQ AALATAVGAV
PLRITSAGTA VAETLQAFHL ARDGRRTVLL NLPLEVQGRP VPTPPAPASL ASVPEPSEPE
AREVARLAEL LGAARRPVFV AGRGARAARL ELEELAERIG ALLATSAVAK GLFRGSPWDL
DVSGGFASPL TAELVRGADV IVGWGCALNM WTMRQGTLIG PEAKVAQVDL DADALGAHRP
IDLGVVGDVA LTARSVTTLL AGGGDDLRQA SAARDGAGRE ASAVPGDGPG APDAPGGSGP
GAPGGIGYRS RVLAERIARE NRWRDVPYAD EGGEGRIDPR TLTIELDDLL PAERVVSVDS
GNFMGYPSMF LDVPDERGFC FTQAFQSIGL GLATAIGAAL AQPARLAVAA LGDGGALMGV
AELETVVRLG LPMVIVVYDD EGYGAEVHHF GPDGHSLDTV TFPPVDIAAI ARGFGCEAVT
VRGREDLAAV AGWLDGPRHR PLLVHAKVSG ARGSWWLEEA FRGH