Gene Sros_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3059 
Symbol 
ID8666346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3335430 
End bp3336809 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338752 
Protein GI271964556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.253596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGT CAATCGCCAT CGTCCTGACC CTGCTGGGCT CGGGCGTGCA AGCACAGCAA 
GCCGACCCCC TCGACTGGCG TCCCTGCCCG AGTGACAAGG CCGGGATGGA ATGCGCCGAC
CTGCAGGTCC CGGTCAACTG GCAGAGAACC GACAGTCGCA AGATCACGCT GAAACTGGGC
CGCCTGAAGG CCACCGGCGC CTCCGAGGGC TCCGTCCTGG TGGCCTACGG CGGCCCGGGC
GGGCCGGGGA TCGCCCTCAC CCAGTCCGGA GCGGGAGGTT GGTGGACCAG GCTCCGCGAG
CGCATGGACA TCGTCACATG GGACACCCGG GGCTACGGCG AGCAGTTCGG CGGCCTCAGC
ACGGGCCTGC CGTGCACCTG GACACGGATC CCGCTGGCCG AGTTCCCCGA GGACGACGCC
GACTTCGGGC GGCTCGCCGA CACCAACCGC GGTTACGCCG AGGCCTGCCG CAACAAGGAC
CCCGAGTTCT TCGCCAACAT GAGCTCGGCC GACAACGCCA GAGACATGGA GGCGATCAGA
AAGGCGCTCG GCGGCGCCAG GCTCAACTTC TACGGCGCCT CCTACGCCGG GTTCTACGGG
CAGGACTACG CCCGCCTCTT CCCCGGCCAG GTGCGCACGA TGGTGCTCGA CGGCACGTGG
AACCACGGCG CGGCCGACTG GTCCGGCGAA CTGGAGGAGA TGGCCAGGAG CAACGAGGAG
GCCATCGGCC GGTTCTTCGA CTGGTGCGCC GCCGGCAAGT GCCGTGACGT GCCCGCGAAG
TGGCGCAGGC TGATCGCCGG GGCCGACCGC ACGCCGATCC CGGCCAAGCG GGCCGGCATC
GCCTACGACG GCCGCGACCT GCGGTCCTTC GTGGTCGGCG CCGCCAAGGA GGGCGTCAAG
GCGTGGCCGG AGCTGGCGCG GGACATCCGC GGGGCCGCCG GCGGTGACGC CTCCGGGTTC
GTCCCCGAGC GCGGCCTCCG CTACCCCGAC CAGTCCACCG GCGTCACCGA GTGCCTCGAC
TGGCCCCGTC CCGCCGGCCG TGCCGAGCTG GAGTCGACGA TCGCCCGGCT GCGCAAGGTC
GCGCCCAACG CGGGCACCGC CGACACGCTG GCGACCGCGA CCCTCGGCTG CGTCGGCTGG
CCGGTACAGG TGACCAACCC GCCGGCCCCG CTGCCGAAGG GCCTGCCGCC GATGCTGGGC
GCGGGCGCCT GGGGCGAGTC CGACGCCGTC CGGCGGGTGC TCAAGCAGGT GCCGGGCAGC
GTCACCGTCC GCCACGAGGG GCCCGGCCAC ACGCTCTACG GCTTCAACCC CTGTGCCCGC
GACCACATCG ACCGCTACTT CACCGACCGC GTGCTGCCCT CGGCGGAGAC GGAGTGCTGA
 
Protein sequence
MLKSIAIVLT LLGSGVQAQQ ADPLDWRPCP SDKAGMECAD LQVPVNWQRT DSRKITLKLG 
RLKATGASEG SVLVAYGGPG GPGIALTQSG AGGWWTRLRE RMDIVTWDTR GYGEQFGGLS
TGLPCTWTRI PLAEFPEDDA DFGRLADTNR GYAEACRNKD PEFFANMSSA DNARDMEAIR
KALGGARLNF YGASYAGFYG QDYARLFPGQ VRTMVLDGTW NHGAADWSGE LEEMARSNEE
AIGRFFDWCA AGKCRDVPAK WRRLIAGADR TPIPAKRAGI AYDGRDLRSF VVGAAKEGVK
AWPELARDIR GAAGGDASGF VPERGLRYPD QSTGVTECLD WPRPAGRAEL ESTIARLRKV
APNAGTADTL ATATLGCVGW PVQVTNPPAP LPKGLPPMLG AGAWGESDAV RRVLKQVPGS
VTVRHEGPGH TLYGFNPCAR DHIDRYFTDR VLPSAETEC