Gene Sros_2758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2758 
Symbol 
ID8666044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2995906 
End bp2997108 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content69% 
IMG OID 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_003338459 
Protein GI271964263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0785401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.423621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGT CCGCCTTGGG TGAGCCCTGC GTTCTGCTCA AACTGGGCGA GGTCGTCCTC 
AAGGGAAACA ACCGCGAGCT CTTCGAACAG CGGTTGCAGG CCAACATCAA GGCGGCCCTG
AAGGACTTCG ACTTCAAGGT GGACGTGCGC CAGCGCCACG GCGTCATCGC GCTCTTCCTG
CCGGACGGCA CCACCCCGGA GGTCGCCGAC GCGGTCGCCG AGCGGGTCGC CTACGTCCCC
GGTCTGGTCT GGATCCACCG GGCCTGGCGG GTGGAGAAGG ACCCGGGCGC GGTCACCAAG
GCGGCGATCG AACTGCTGGC CGACCGGGAC GACGTCAGGC GCGGCGTCTC CTTCGCCGTC
CGCTCCCGCC GCCGCGACAA GCGCTTCCCG CTGACCTCGA TGGAGATCGA CCGCTCGGTC
GGCGGCGAGC TCAACGACAT CTACGGCCTG CCGGTCGACC TGAAGAACCC CGAGCTGGTC
GTCTCCATCG AGGTGGACCG GGACGAGGTC TTCGTCTTCA CCGGCGGCAC GCCCGGTCAG
GGCGGCCTGC CGGTGGGCAC CAGCGGCCGG GGCCTGGTCC TGATGTCCGG CGGCATCGAC
TCCCCGGTGG CCGCCTACCG GATGATGCGG CGCGGCCTGC GGGTGGACTT CCTGCACTTC
TCCGGCATCC CGTTCACGAC GTCGGAGTCG ATCTACAAGG CGTACGCCCT GGTGCGCGCC
CTGGACAGAT TCCAGGGCAG GTCCCGGCTC TGGGTCGTCC CGTTCGGCAA GGCCCAGCAG
TCCATCAAGG CCTCCGGTCA GGACCGCCTG GCGGTCATCT CCCAGCGCCG TCTGATGCTC
AAGACCGCCG AGGAGGTCGC CCGCCGCCTC CGCGCCGGCG CGCTGATCAC CGGTGACTCC
CTGGGCCAGG TCTCCTCCCA GACCCTGCAG AACATCACCG CCCAGGACGA CGCGGTCGAC
CTGCCGATCC TGCGGCCGCT CATCGGTCTC GACAAGACCG AGATCATGGC GGAGGCCCGC
CGTATCGGCA CCCTGGAGAT CTCCGAGCTC CCCGACGAGG ACTGCTGCAC CCTGCTGGCC
CCCCGTCGCG CCGAGACCGC AGCCAAGATC GCGGACCTGC GGCAGATCGA GAAGCGTCTC
GACGCCGAGG AACTGGCCGT CCAGCTCGCC GGCTCCCTCC AGGAGTACAA GCTGGAAGGC
TGA
 
Protein sequence
MTMSALGEPC VLLKLGEVVL KGNNRELFEQ RLQANIKAAL KDFDFKVDVR QRHGVIALFL 
PDGTTPEVAD AVAERVAYVP GLVWIHRAWR VEKDPGAVTK AAIELLADRD DVRRGVSFAV
RSRRRDKRFP LTSMEIDRSV GGELNDIYGL PVDLKNPELV VSIEVDRDEV FVFTGGTPGQ
GGLPVGTSGR GLVLMSGGID SPVAAYRMMR RGLRVDFLHF SGIPFTTSES IYKAYALVRA
LDRFQGRSRL WVVPFGKAQQ SIKASGQDRL AVISQRRLML KTAEEVARRL RAGALITGDS
LGQVSSQTLQ NITAQDDAVD LPILRPLIGL DKTEIMAEAR RIGTLEISEL PDEDCCTLLA
PRRAETAAKI ADLRQIEKRL DAEELAVQLA GSLQEYKLEG