Gene Sros_3184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3184 
Symbol 
ID8666472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3471232 
End bp3472413 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content75% 
IMG OID 
Productcytosine deaminase 
Protein accessionYP_003338872 
Protein GI271964676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.379917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0692184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGAGC TGCTGCTGCG CGACGCCCGG ATCTGGGGCC ACGAGGGCGC CGCCGACCTG 
CTGATCAGGG ACGGCCGGAT CGCCGAGATC CGGCGGGCCG GTCAGGACGG CGTCGGGGAC
GGCGCGGAGG ACCTGGGCGG CCGGCTCGTG CTCCCCGGCC TCGTCGACGG CCACGCCCAC
CTGGACAAGA CGCTGTGGGG CGGCCCTTGG GTGCCGCACG ACGCGGCTCC CGGCCTGATG
AGCAAGATCC GCAACGGCCA GGAGCGGCGG CCGGAGTTCG GCGTCCCCAA CGCCGACTTC
GTGACCGCGC TGCTGGAGAA CATGATCGTC TGCGGCACCA CGCACGTCCG GTCGCACGTG
GACGTCGACC CGCTGGTGGG CCTGGGCTCG GTCGAGGCGG TCCGCGAGGC CGTCGAGCGG
CACGACGGCC GGATCTCCGC CGAGCTCGTC GCCTTCCCGC AGGCCGGCAT GCTGATCAGC
CCCGGCACCG AGGCGCTGCT GGAGGAGGCG CTCAAGGCCG GGGTCGAGTC GATCGGCGGG
CTCGACCCCG CGGGGGTGGA CCGGGACGCG GTCAGGCACC TGGACGCGGT GTTCGCCCTG
GCGGAGCGGT ACGGCGCGGG CGTGGACATC CACCTGCACG ACGGCGGCAC GCTCGGCGCC
TGGCAGTTCG AGCTCATCAT CGAGCGGACC AAGGTCCTCG GGCTGGGCGG CAAGGTCACC
GTCAGCCACG CCTTCGCGCT CGCCGACGCC GACCCCGCCA CCCGGGACAG GCTGATCTCC
GGCCTGGCCG AGGCCCGGAT CGCCCTCGCC TCGGTCGCCC CGAACCGGGG CCTGCTGCCG
CTCCGCCTGC TCGGCGAGGC CGGGGTCCCC GTGACGCTGG GCAACGACGG CGTCCGCGAC
CTGTGGAGCC CGTTCGGCAC CGGGGACATG CTGGAGCGCG CCCTGTTCCA GGCCAAGGGC
ACCGGGTCGC GCGACGAGGA CATCGAGCTG GCCCTGGACG CCGCGACCTA CGGCGGAGCC
CGCGCCCTCG GCCTCCGGGA CTACGGCCTG TCCGTCGGCG ACCACGCCGA CCTGGTGGTG
GTCGACGCCC GCAACGCGGC CGAGGCCGTG TGCGTGCGCC CGCGCCGGTC CCTGGTCGTC
AAGTCCGGCC GGATCGTCGC CCGCGACGGC GTTCTCGCCT GA
 
Protein sequence
MTELLLRDAR IWGHEGAADL LIRDGRIAEI RRAGQDGVGD GAEDLGGRLV LPGLVDGHAH 
LDKTLWGGPW VPHDAAPGLM SKIRNGQERR PEFGVPNADF VTALLENMIV CGTTHVRSHV
DVDPLVGLGS VEAVREAVER HDGRISAELV AFPQAGMLIS PGTEALLEEA LKAGVESIGG
LDPAGVDRDA VRHLDAVFAL AERYGAGVDI HLHDGGTLGA WQFELIIERT KVLGLGGKVT
VSHAFALADA DPATRDRLIS GLAEARIALA SVAPNRGLLP LRLLGEAGVP VTLGNDGVRD
LWSPFGTGDM LERALFQAKG TGSRDEDIEL ALDAATYGGA RALGLRDYGL SVGDHADLVV
VDARNAAEAV CVRPRRSLVV KSGRIVARDG VLA