Gene Sros_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2124 
Symbol 
ID8665406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2279923 
End bp2281623 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337852 
Protein GI271963656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.415891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA ACCTCCCGAT CGCAGCGGGC GCGGCCCTGC TCGCCCTGTC GGTCTCCGCC 
TGTTCCGGCA CCTCCTCCCC GGCGGAGCCC GCCGGATCCG CCGGGACGGC CGGCTCCTCC
CAGGCGAGCC CCGCGGGATT CCAGGAGCAG CACAAGGGCG GCACGCTCCG CCTGCAGGCC
AAGTCCGGCG ACGGCACGCT CGACCCGCAC ATCAACTACA GCAACGGCAA CTGGCAGATC
TTCCAGGCCA TGTACGACGG CCTGCTGGCC TTCAAGAAGG TCGGCGGCGA GGCCTCCTAC
GACCTGGTCC CGGACCTGGC CGAGGCCATG CCCGAGGTCA GCCCGGACGG TAAGTCCTAC
ACCTTCACCC TCCGCAAGGG CGTGAAGTTC GCCGGCGGCG GCGAGGTCAC CGCCGACGAC
GTGGTGGCCT CCTTCGAGCG GATCTACAAG GTCTCCGGGC CCACCTCCGG CACCTTCTAC
GCCGGGATCG TCGGCGCCGC CGCCTGCGTC AAGAAGCCCA AGGAGTGCAC GCTCGACAAG
GGCGTCGTCG CCGACAAGGC CAAGAACACG GTCACGATCA ACCTGGTCGA GCCGGACTCG
GAGTTCCCGC TCAAGCTCGC CCTGCCGCAC GCGGCCGTCC TGCCGAAGGA CACCCCGGAC
AAGGACCAGG GCACCAAGCC GATCGGCGGC ACCGGACCTT ACATGGCGGT CTCCTACGAC
CCCAACAAGG AGCTCAAGCT CGTCCGCAAC CCCGACTTCA CCGAGTGGTC GCGCGAGGCG
CAGCCGCAGG GCTACCCGGA CGAGATCGTC TACTCCTACG GCCTCACCGC CGAGGCCGCG
GTCACCGCGG TCCAGAACGG CCAGGCTGAC TGGATCTTCG ACCCGCTGCC CGCCGACCGG
CTCAGCGAGA TCGGCACCAA GTACGCCTCC CAGGCGCACG TGAACCAGCT GTCGGCCTTC
TGGTACCTGC CGCTCAACAC CAACCTGGCG CCGTTCGACA AGCCCGAGGC CCGCCAGGCG
CTCAACTGGG CGATCGACCG CCAGGCCGTG GTGAAGATGT TCGGCGGCGC CAACGTGGCC
CAGCCCGCCT GCACGCTCCT GCCGCCCGGC ATCCCCGGCC ACGCCGACTT CTGCGACTTC
CCCCGGCCGG ACCTGGCCAA GGCGAAGGAC CTCGTCCAGC AGTCGGGCAC CGCGGGCCAG
GAGGTCGCCG TCGTCGTCTC CGACGACGAG GTGAGCAAGC AGATCGGCGA GTACGTCCGC
AGCACCCTGG AGCAGATCGG CTACCAGGCC AAGCTCAAGG TCATCTCGAC GAACATCCAC
TTCACCTACA TCCAGAACGA CAAGAACAAG GTCCAGGTCA GCGTCTCCCA GTGGTACGCC
GACTACCCCG CCGCCTCGGA CTTCCTGCAC GTGCTGCTCT CCTGCGCGTC CTTCCGTCCG
GGCAGCGACT CCAGCATCAA CATCTCCGGC TACTGCGACA AGGACATCGA CGCCCGGATG
GCCGAGGCCA TGACGCTGGA CCGTACCGAC AAGGACGCCG CGAACGCCAA GTGGGGCGAG
ATCGACCGCG ACCTGACCAA GGCGAGCCCG ATCATCCCGC TGTTCACCCC CAAGCAGGTG
GACTTCGTCT CCAGCAGGGT CGGCAACTAC CAGTTCCACA AGCAGTTCTT CATGCTCGTC
TCCCAGCTCT GGGTCAAGTA G
 
Protein sequence
MKRNLPIAAG AALLALSVSA CSGTSSPAEP AGSAGTAGSS QASPAGFQEQ HKGGTLRLQA 
KSGDGTLDPH INYSNGNWQI FQAMYDGLLA FKKVGGEASY DLVPDLAEAM PEVSPDGKSY
TFTLRKGVKF AGGGEVTADD VVASFERIYK VSGPTSGTFY AGIVGAAACV KKPKECTLDK
GVVADKAKNT VTINLVEPDS EFPLKLALPH AAVLPKDTPD KDQGTKPIGG TGPYMAVSYD
PNKELKLVRN PDFTEWSREA QPQGYPDEIV YSYGLTAEAA VTAVQNGQAD WIFDPLPADR
LSEIGTKYAS QAHVNQLSAF WYLPLNTNLA PFDKPEARQA LNWAIDRQAV VKMFGGANVA
QPACTLLPPG IPGHADFCDF PRPDLAKAKD LVQQSGTAGQ EVAVVVSDDE VSKQIGEYVR
STLEQIGYQA KLKVISTNIH FTYIQNDKNK VQVSVSQWYA DYPAASDFLH VLLSCASFRP
GSDSSINISG YCDKDIDARM AEAMTLDRTD KDAANAKWGE IDRDLTKASP IIPLFTPKQV
DFVSSRVGNY QFHKQFFMLV SQLWVK