Gene Sros_4175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4175 
Symbol 
ID8667469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4647976 
End bp4649196 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339822 
Protein GI271965626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.33945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTTCG TCGGTGATGA CTGGGCTGAA GACCATCACG ATGTCGAGGT CCAAGACGAG 
GACGGCAAGG TGGTCAAGCG GGTCCGGCTG CCCGAGGGGA TGGCCGGGAT CACCCGGCTG
CACGACCTGG TCGGCCGGTT CGTGGCCGAG GACGCCGACC CGTCCGACGT GCTCGTCTGC
ATCGAGGTCG ATCGGGGCCC GTGGGTGCGG GCGCTGGTGG CCGCGGGCTA TCGGGTGTTC
GGCGTCGATC CCAAGCAGGC CGCCCGGCAC CGGGAGATCC TCGGCAGCTC GGGGGCCAAG
AGCGACAAGG GCGACGCCCA CGCCCTGGCC GACATGATCC GCACCCGCCG CAACCAGCTG
CGCCAGGTCG CCGGGGACTC GGAGATCGCA GAGGCCGTCA AGGTCGTCAC CCGGGCGCAT
CAGACGTTGC TGTGGGAACG CACCCGGCAC ATGCTGCGCC TTCGGGTGGC GTTGCGGGAC
TACTTCCCCG CCGCCCTTGC CGCCTACAAG CCGCTCGGCC TCACCTCGGA GGCGGTGCTG
AGGCTGCTGG CCAAGGCCCC CACCCCCGAG ACGGCGGCCA AGCTGACGAT CAACCAGATC
AGCGCGACGC TCAAGGGCCG CCGCGACATC GGCGCCAAAG CCGCGGCGAT CCAGGACGTG
CTGCGGGGCG AGCACCTCGG CCAGGCCCCG CTCGTCACAG GTGCCTACGC CTCCACCGTG
AAGGCCCTGG CCGCCGTCAT CACCGTCCTG AACAGCGAGA TCAAGACGCT TGAGGGTGAG
GTCGAGGCTC ATTTTGGCCG GCACCCGGAC GCTGAGGTCA TCCTCAGTCA GCCGGGCATC
GGCGTCGTCC TCGGCGCCCG GGTGCTCGCC GAGTTCGGAG ACGCCGAAGG CCGCTACGTG
AGCGCGAGGG CCCGCAAGAA CTACGCCGGA ACCTCGCCGA TCACCCGGCA GTTCGGCAAG
ACCAAGATTG TCCAGGCCCG GTTCGTCCAC AACGACCGGC TCGTCGACGC TCTCCATCTC
CAAGCCTCCT GCGCCCTCCT TCACGATCCT GAGGTCCGCG CTTACTACGA CCAGCTCAAA
GCCCGTGACG TCAGCCATAA CGCCGCTCTC CGCCAAGTCG GCAACCGCCT GGTGGGCATC
CTCCACGGCT GCCTCAAAAC CCACACCACC TACGACCAGG CAACCGCATG GTCACATCGC
AACCACGACC TCGCCGCTTG A
 
Protein sequence
MLFVGDDWAE DHHDVEVQDE DGKVVKRVRL PEGMAGITRL HDLVGRFVAE DADPSDVLVC 
IEVDRGPWVR ALVAAGYRVF GVDPKQAARH REILGSSGAK SDKGDAHALA DMIRTRRNQL
RQVAGDSEIA EAVKVVTRAH QTLLWERTRH MLRLRVALRD YFPAALAAYK PLGLTSEAVL
RLLAKAPTPE TAAKLTINQI SATLKGRRDI GAKAAAIQDV LRGEHLGQAP LVTGAYASTV
KALAAVITVL NSEIKTLEGE VEAHFGRHPD AEVILSQPGI GVVLGARVLA EFGDAEGRYV
SARARKNYAG TSPITRQFGK TKIVQARFVH NDRLVDALHL QASCALLHDP EVRAYYDQLK
ARDVSHNAAL RQVGNRLVGI LHGCLKTHTT YDQATAWSHR NHDLAA