Gene Sros_8547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8547 
Symbol 
ID8671881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9431575 
End bp9432783 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content74% 
IMG OID 
Productthreonine dehydratase 
Protein accessionYP_003343932 
Protein GI271969736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATG CCGAGGTGAC CTACGACGAC GTGGTCGCCG CCCGTGAGCT CCTGGCCGAC 
GTGGCCCTGC GGACGCCGCT CCTGCACTCG CACGTGCTGT CGGCCGCCAT CGGCGGGCCG
GTCCACCTCA AGTGCGAGAA CCTGCAGCGC TCCGGGTCCT TCAAGGTGCG CGGGGCCTAC
GTGAGGATCG CCGGGCTGAC CGAGGAGGAG CGGGGCCGGG GCGTGGTCGC GGCCAGCGCC
GGCAACCACG CGCAGGGCGT GGCGCTCGGC TCGGGCCTGG TCGGGGCCAG GGCGACGGTC
TACATGCCGG AGGGGGCGCC GCTGCCCAAG GTGGCCGCGA CCCGCTCCTA CGGCGCCGAG
GTCGTCTTCG CCGGCCACAC CGTGGACGTC GCGCTGGCCC GCGCCAAGGA GCACGCCGAG
CGGACCGGGG CGGTGTTCAT CCACCCCTTC GACCATCCGG ACGTCATCGC CGGGCAGGGC
ACGATCGGGC TGGAGATCCT GGAGCAGCTC CCCGAGGTCG GCACGATCGT GATGTCGCTC
GGCGGCGGCG GGCTCACCGC CGGGGTCGCG CTCGCGGTCA AGTCGCTCCG GCCGGGCGTG
CGGATCGTCG GCGTGCAGGC CGAGCGGGCC GCCGCCTACC CGGCCTCGCT GGCGGCCGGG
CAGCCGGTCA TGGTCCAGCC CGCCTCCACG ATGGCCGACG GCATCGCGGT GGGCCGTCCG
GGGGAGCTGC CCTTCGAGCT GATCCACTAC CTGGTCGACA GCGTCGTTAC GGTCACCGAG
AGCGACATCT CCCGGGCGCT GGTGCTCTGC CTCGAACGGG CCAAGCAGGT CGTCGAGCCC
GCCGGGGTGG CGGGTGTGGC CGCGATCCTG GCCCAGCCGC AGGTCTTCGA GCCGCCCGTG
GTGGCCGTGC TCTCCGGCGG CAACATCGAC CCGCTGCTGC TGGCCAAGGT CCTCCGGCAC
GGCCTGGCCG CGGCGGGACG CTACCTGACC CTGCGGGTGC CGCTGCCCGA CCGGCCGGGC
GCGCTGGCCG TGCTCCTGAC CGAGCTGGCC GGGCTCGGCG CGAACGTCCT CGACATCGTG
CACGAGCGGC TCGGCGTGCA CCTGGGCGAG GTCGAGGTCC AGCTCCAGCT GGAGACCAAG
GGCCCCGCCC ACTCCGACGA GGTGCTCGCG TCGCTGCGCA AGGAGGGCTA CCGGCTCATC
TTCGGATGA
 
Protein sequence
MSDAEVTYDD VVAARELLAD VALRTPLLHS HVLSAAIGGP VHLKCENLQR SGSFKVRGAY 
VRIAGLTEEE RGRGVVAASA GNHAQGVALG SGLVGARATV YMPEGAPLPK VAATRSYGAE
VVFAGHTVDV ALARAKEHAE RTGAVFIHPF DHPDVIAGQG TIGLEILEQL PEVGTIVMSL
GGGGLTAGVA LAVKSLRPGV RIVGVQAERA AAYPASLAAG QPVMVQPAST MADGIAVGRP
GELPFELIHY LVDSVVTVTE SDISRALVLC LERAKQVVEP AGVAGVAAIL AQPQVFEPPV
VAVLSGGNID PLLLAKVLRH GLAAAGRYLT LRVPLPDRPG ALAVLLTELA GLGANVLDIV
HERLGVHLGE VEVQLQLETK GPAHSDEVLA SLRKEGYRLI FG