Gene Sros_8147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8147 
Symbol 
ID8671475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8987644 
End bp8989548 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343545 
Protein GI271969349 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.675914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCC ATGCTCATGA GGCCTCACGG ATCGCTGTCG AGGACTTGTT CGACTCTCCC 
GTCCGTGCCG GTGCATCGAT CTCGCCCGAC GGCACGAAGA TCGCCTACCT GTCGCCCTGG
CGGGACCGGC TGAACGTATG GGTGGAGAGC CTCGACTCGG ACGAGGAGGC ACGGTGCGTG
ACCGCCGACG ACAACCGCAG CGTGCACATC TTCCACTGGA CGGATGATCC GCGCTGGCTG
CTGTACGAGC AGGACGGTGA CGGCGACGAG ATGTGGCACG TCTACCGGGT CGATCTGGCG
GACCCGGATG CCAAGGCGGT CGATCTCACC CCCTTTCCCG GGGCCCTGGC TGTGGGGTTC
GAGATGGTGG TCTCGCGGCC GGGTAAGGCG TTCCTGCATC TGAATGCGAG GAACCCCGTC
GAGTTCGACC TCTACGAACT CGACATCGCC ACAGGTGAGC TGACCATTCT GGCGCAGAAC
CCCGGCCAGG CCGCCGGCTG GCTGTGCACA CCCGCCGGAG ACCTGTACGC CCAGACCTTG
ACGGCCGGCG GTGACATCGA GCTGGCGCAG TGGGACACCT CAGAGGGGAA GCTGCGCCCG
GTCGCCACGT TCGACGGCGC TGACTACCCG CTGAGTATCC AGCCGTTCGA GCTCACCCCG
GACGGGACCG GCGTGTGGAT CGGCTCCAAT CGCAACAGTG ACCGGACCCG GCTGGTCCGG
CTCGATCTGG CCACCGGTGA GGAGACCGGT GTCGACAGCC ATCCGGTCTT CGACCTCGAC
ACACGGTGCG TCGTCTTCCC GACGATGCCG TCGCCGCTCA TCCGCAACCA GTACACCGGA
GCACTGCTCG GGGCGCGCTA TCTCGGCGAG CGGCAGGTGA TCCACGCCCT CGACCCGCAC
TTCGCCTCGG TGCTGCAGAA CCTGGAGCAG CTGTCCGACG GTGATCTGGC CGCGGTGTCC
TGCGACGTGA GCGGGAGGCG CTGGGTCGTC GGTTTCACCC ATGACCGTGA CCCCGGTGCC
ACTTACTTCT ACGACCACAC CAGCGGCGAG AGCCGACTGC TCTACCGGCC CTATCCGCAT
CTCGACCCCG ATGTCCTGGC CCCGATGACG CCGGTCACGA TCCCCGCACG CGACGGGCTG
CGCCTGCCCG CCTACCTGAC GCTGCCCGTC GGCGTCGATC CGGCCGGGCT GCCCCTGGTG
CTGCTGGTCC ACGGCGGCCC GTGGTTCCGC GACAGCTGGG GCTACCACCC CGTGGTGCAG
CTGCTGGCCA ACCGCGGCTA CGCGGTGCTG CAGGTCAACT TCCGCGGCTC GATGGGCTAC
GGAAAGGCGT TCCTCAAGGC CGGTATCGGG GAGTTGGCCG GGAAGATGCA CGACGACCTC
ATCGACGCCG TCGACTGGGC CGTCAAGCAG GGGTACGCGG ACCCGGACCG CGTCGCCATC
TTCGGCGGCT CGTACGGCGG ATACGCCACG CTGGTCGGCG TCACCTTCAC CCCCGACGTC
TTCGCCGCCG CGATCGACGT CTGCGGCCCC TCGAACCTCG TCACCTACCT GAGGACCCTG
CCGGAGTTCG CGCGGCCCGG CCTGGTCAAC AACTGGTATC TCTACGCCGG TGATCCGAGC
GACCCGGAGC AGGAGGCGGA CCTGCTGGCG CGCTCCCCCA TCAGCCGGGT GGACCAGATC
CGCACCCCGC TGATGGTGGT GCAGGGCGCC AACGACATCC GTGTCGTCAA GGCCGAGTCC
GACCGGATCG TCGACGCGCT GCGTGCCCGG GGCGTCGAGG TCGAGTACAT GGTCAAGGAC
AACGAGGGCC ACGGCTTCGT GAACCCGGAC AACAACATCG ACATGTACCG CGCGGCCGAC
CGCTTCCTCG CCCGTCACCT GAGCGGACGG CCGGACACGG AGTAA
 
Protein sequence
MPAHAHEASR IAVEDLFDSP VRAGASISPD GTKIAYLSPW RDRLNVWVES LDSDEEARCV 
TADDNRSVHI FHWTDDPRWL LYEQDGDGDE MWHVYRVDLA DPDAKAVDLT PFPGALAVGF
EMVVSRPGKA FLHLNARNPV EFDLYELDIA TGELTILAQN PGQAAGWLCT PAGDLYAQTL
TAGGDIELAQ WDTSEGKLRP VATFDGADYP LSIQPFELTP DGTGVWIGSN RNSDRTRLVR
LDLATGEETG VDSHPVFDLD TRCVVFPTMP SPLIRNQYTG ALLGARYLGE RQVIHALDPH
FASVLQNLEQ LSDGDLAAVS CDVSGRRWVV GFTHDRDPGA TYFYDHTSGE SRLLYRPYPH
LDPDVLAPMT PVTIPARDGL RLPAYLTLPV GVDPAGLPLV LLVHGGPWFR DSWGYHPVVQ
LLANRGYAVL QVNFRGSMGY GKAFLKAGIG ELAGKMHDDL IDAVDWAVKQ GYADPDRVAI
FGGSYGGYAT LVGVTFTPDV FAAAIDVCGP SNLVTYLRTL PEFARPGLVN NWYLYAGDPS
DPEQEADLLA RSPISRVDQI RTPLMVVQGA NDIRVVKAES DRIVDALRAR GVEVEYMVKD
NEGHGFVNPD NNIDMYRAAD RFLARHLSGR PDTE