Gene Sros_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3197 
Symbol 
ID8666485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3488919 
End bp3490940 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338885 
Protein GI271964689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGACG ATCCGGCCAT GCCCCCCGCC GTGGCGCGGG CCCTGGAGGA GTACCGGGCG 
CTGCTCGCCG AACACGGGGT GACCTGGGGC GAGGACCCCA TCCCCTACGT CAGGTCCATG
ACCGCCGACG CCTACCTGAT GGGCACCACG ACCTTCCGGG ACGTCTGCCA GGGCATGCTC
CGGGCCCGTT ACCCGCAGGC GTCCTCCGGC GAGCTGGCGG GGCGCTTCGC CGAGCTGGAC
ATGGACGAGG TCGTGCGCAA CGTCCTGGCG GGCAGGGTGT CGGACAACCT CGCGGCCCTC
CGGCTCACCG GGGAGGGCCT CGCGGTCGAG GCGCACCCGC TGGCCGTACT GGACGGCGGG
CCCCTGCGCA CGACGCTGCT GGTCGACTCC GCCCGCGACG AGCCGGTCAC CGTGCTGGTG
GACGGCCGGG CGCACGAGAT CGGCCCGCGC GGCGCGCGCC TCATCAAGAT CACCAGCGGG
AGCGAGGTGA TCGCCGACGG CGGGCGCGTG GACCTCACGC CCCTGACCCG CCCCGCCGCC
GCCGCGCGCC TGCGGCTGCG CGCCGGGTTC CCGTGCAGGT GGAGCGTGTA CGGCGAGCAG
GGGCAGGGCT GGTACCCCGA GGGCGCCCCG GCCAGGCGCG ACTACCACGT CCTGCCCTAC
TTCCACGGCG ACGACGTGGT GCTGGACGTC CCGGCCGAGC CGCTGACCGT GCGGGTCTCC
AGGGGGATGG AGTACGGCTC CGCCGAGCTG GCGGTCACTC CCGCGGCCGG CGAGGAGACC
CTGGTCGAGC TGGCGCCCGA GCGGATCTAC GACGCGGCGG CGCTGGGCTG GTACGGCGGG
GACATGCACG CCCACCTCAA CTGGGCCGGG GACATGGTCG GCACCCCGGC GCTGGCGGCG
GCCATGCAGC ACGGCGAGGA CCTGCACGTG CTCAACCTGG TGGCCGGGAA CGTCTCCTCC
GAGCGCGTCT ACGACTCCGA GGCGCTGGCA CACTGGGCGG GCCGGGACCT GCCGTGGTCG
GACGGCACCC ACCTGGCCAG GATCGGCGTC GAATACCGCA ACGACCTCCT CGGCCACCTC
TACGCCTTCG GCGTCTCGGC GCCGCCCTCG CGCTTCCACA CCGGTTTCCT GGGCACCGCG
GACTGGCCGC CCAACAGCGT CGCCTGCGAG GAACTGCGCG GCCTCGGCGC GCTCCTGGGC
TACAGCCACC CGTTCCACAA CCCGATCTCC GACACCGACG GCCCCGGCCA CCTGCTGTGG
CAGGGCCGCA ACTGCTCCTC CCGGGAGATC GTCGCCGACG CCGCCCTCGG CCTGGTGGAC
AGCCTCGACG TGCTCAACCA CACCTCGATC GCCGCGACCG CCGCCGTCTA CCGGCACCTG
ATCGGCGCGG GCAACCGGAT CGCGGTCACC GCGGGGACCG ACGCGATGGT CTCCTTCGCC
CGGCGCGGCA ACCAGTCCAA CCCGCCGGGC TGGGCCCGTG TCTACGCCCG CGTCGAGGGG
CCGCTCACCG CCGGGTCGTT CGCCGAGGCC GTCAGGCGGG GCCGTACGTT CGGCACCACC
GGCCCCTGGC TGGAGCTGTC GGCCGGCGGG CACGGACCCG GTGCCACCCT GGACCTCTCG
CCGGGAGAGC GGGTCACGGT CACCGCGAGG TCGGCAGGTC CCGAGGTGGA GAGGCTGGAG
ATCCGCACCG CCGACGGCGT CCTGGCCGAG GGGCCGCCCG CCGAGCTGAC CTGCGAGCTG
GTCGCCGGCG ACCCCACCTA CGTCGTCGCC GTGGCCGTCG GCGGGCCGCA CGAGCGCGCC
CTCACCGGCG GCGCCTACGC CCATACCAGC CCGGTCTACC TCGACGTCGC CGGCCGTCAC
GTGGCCAGGG AGCAGGACGT CCGCTGGTGC CTGGAGTGGC TGGACGGCAT GGAGACGCTG
CTCCGGCGGC AGGGCACGTT CGAGACCGCC GCGCAGCTCG GCGACCACCT GGAGCTGATC
GAGCGGGCCA GGGAGGTCTA CCGCGCCCGC CTGGGCTCAT AG
 
Protein sequence
MCDDPAMPPA VARALEEYRA LLAEHGVTWG EDPIPYVRSM TADAYLMGTT TFRDVCQGML 
RARYPQASSG ELAGRFAELD MDEVVRNVLA GRVSDNLAAL RLTGEGLAVE AHPLAVLDGG
PLRTTLLVDS ARDEPVTVLV DGRAHEIGPR GARLIKITSG SEVIADGGRV DLTPLTRPAA
AARLRLRAGF PCRWSVYGEQ GQGWYPEGAP ARRDYHVLPY FHGDDVVLDV PAEPLTVRVS
RGMEYGSAEL AVTPAAGEET LVELAPERIY DAAALGWYGG DMHAHLNWAG DMVGTPALAA
AMQHGEDLHV LNLVAGNVSS ERVYDSEALA HWAGRDLPWS DGTHLARIGV EYRNDLLGHL
YAFGVSAPPS RFHTGFLGTA DWPPNSVACE ELRGLGALLG YSHPFHNPIS DTDGPGHLLW
QGRNCSSREI VADAALGLVD SLDVLNHTSI AATAAVYRHL IGAGNRIAVT AGTDAMVSFA
RRGNQSNPPG WARVYARVEG PLTAGSFAEA VRRGRTFGTT GPWLELSAGG HGPGATLDLS
PGERVTVTAR SAGPEVERLE IRTADGVLAE GPPAELTCEL VAGDPTYVVA VAVGGPHERA
LTGGAYAHTS PVYLDVAGRH VAREQDVRWC LEWLDGMETL LRRQGTFETA AQLGDHLELI
ERAREVYRAR LGS