Gene Sros_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1046 
Symbol 
ID8664320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1062151 
End bp1063278 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_003336789 
Protein GI271962593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.413626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0540655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGGC ACCCGAGGGC CGCCCGGCCG ACGACGCCCG CCCAGCCACC GAACGCGGCT 
CAACCCCCGA GTACGGCTCG GCCCACGGGT CCGTCGTCCG CCCCGCTCGG CGGTGGCGCT
CGTCTGCCGA TCGTCGTCGC GCATCGGGGG GCCAGCGCCT TCCGGCCCGA GCACACGCTT
CTCGCCTACG AGGTCGCCAT CCGGCTGGGC GCCGACTACA TCGAGCCGGA CCTGGTCTCC
ACCAAGGACC ACGTGCTGGT CTCCCGGCAC GAGAACGAGC TCTCGGCGAC CACCGACGTC
GCCGGCCACC CGGAGTTCGC CGCCCGCAGG ACGACCAAGA CCATCAACGG CGGCCGGGTG
ACCGGCTGGT TCACCGAGGA CTTCACCCTC GCCGAGCTGC GTACCCTCCG CGCCAGGGAA
CGCTTCCCCC GGCGGCGGCC GGCCAGCACG GCATACGACG GGAAGGCGCA GATCCCGACG
CTGGAGGAGA TCGTCCTGCT CGCGCAGAAG CACGGCGTGG GCATCTATCC CGAGATCAAA
TATCCGAGCT ACTTCGCCTC GATCGGGCTG CCGATCGAGG GACCGCTGCT GGAGACCCTC
CGGCGCCACG GCTGGGACGA CGCCTGCGAT CCGGTGTTCA TCCAGTCCTT CGAGACGGGG
AACCTCAAGC GGCTGCGGTC CGTCACACGG TTGCGGCTCA TCCAGCTCAT CGGGGCCGGA
AGCGGCCCGC CGTACGATCT GCTGAAGAGC GTCAACCCGC CCACCTGCGA CGATCTCGTC
ACCCCGGCCG GTCTGCGGCA GATCGCCGCG TACGCCACCG GCATCGGTGT GACCACCACG
CGGATCGTGC CGGTCGGCTC CGACGGGAGA CTGGGCGCTC CCACCTCGCT CGTCCAGGAC
GCCCACCAGC TGGGTCTCCA GGTTCACGTC GCGACGATCC GCGACGAGAA CATGAGCCTC
CCGGCGGACT ACCGGCGGGG CGATCCCGCC GGACGGGCCT ACTCCCGTGC CGCCGGGGAC
GTGACGGGCT GGCTGGCACG GCTGTACGGG CTCCGGGTGG ACGGGGTGCT CGCCGACAAC
CCGGGTGTCG CCCGTGCCGT ACGGGATCGC CTGCTCACCG GCGGCTGA
 
Protein sequence
MARHPRAARP TTPAQPPNAA QPPSTARPTG PSSAPLGGGA RLPIVVAHRG ASAFRPEHTL 
LAYEVAIRLG ADYIEPDLVS TKDHVLVSRH ENELSATTDV AGHPEFAARR TTKTINGGRV
TGWFTEDFTL AELRTLRARE RFPRRRPAST AYDGKAQIPT LEEIVLLAQK HGVGIYPEIK
YPSYFASIGL PIEGPLLETL RRHGWDDACD PVFIQSFETG NLKRLRSVTR LRLIQLIGAG
SGPPYDLLKS VNPPTCDDLV TPAGLRQIAA YATGIGVTTT RIVPVGSDGR LGAPTSLVQD
AHQLGLQVHV ATIRDENMSL PADYRRGDPA GRAYSRAAGD VTGWLARLYG LRVDGVLADN
PGVARAVRDR LLTGG