Gene Sros_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1991 
Symbol 
ID8665273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2143113 
End bp2145248 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content77% 
IMG OID 
ProductCoA-binding domain protein 
Protein accessionYP_003337722 
Protein GI271963526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.820209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGT CCGATCCCCT GGCGACGGCG GGGCCGCCCC CCGAGCACGA GGTCAAGGCG 
CTGCTGCGGG AGGCCGGCGT GGCGGTGCCG CGCGGCGCCA CGGCCGCCTC CGCCGAGGGC
GTCGCCGCGG CGGCCGGAGG GCTGACCCCG CCGCTGGTGC TCAAGGCCTT CGGCCCCGGG
CTGGTCCACA AGTCGGATTC GGGTGCGGTA CGGCTGGCGC TCGCCGGGCC GGCCGAGGCC
GCCGGAGCCG CGGCCGAGAT GCTCGGCGGG CTCCTCGCCC AGGGCATCAC CCCGGACGGG
TTCCTGGTCG AGGAGCAGGC CGACCCGGGG GTCGAGCTGA TCGTCGGCGT CGTCCGCGAC
CCGGGCTTCG GTCCCGTCCT GCTCGCCGGG CTCGGCGGCG TCTGGACCGA GGCCCTGCGC
GACACGGCGG TGCGCGTGTG CCCGGTCTCC GAGGCCGACG CCCGGCGGAT GCTCGCCTCC
CTGCGTGGCT CCCGGCTGCT GCGGGGCTAC CGCGGCCGTC CGGCCGCGGA CATGGACGCC
CTGGTGAAGC TGCTGCTGAC GATCGGCGGG GCCGGCGGCC TGTGGGAGCG GCTGGAGCTC
GGCGAGTTCG AGCTCAACCC GGTGATCGCC ACCGGCCGGG GCGCCGTCGC GGTCGACGCC
CGCTACATTC CGGGCGGCCC CCGCGAGGAG GCGCCGCGAC CGGCCCCGGC GGCGGAGACC
GACTTCACGG CGCTGTTCGA GCCCCGGGCC GTGGCCGTGG TGGGGGCCTC CGCCAGCCGG
CCGAACTTCG GGAACATGTT CCTGGAGTTC TACCGGGCCA CGGGGGTGCC GCTGGTCGCG
GTCCACCCGG AGGCGGACGA GATCGACGGC GTGCCCGCCG TGCCGACGCT GGCGGACGCG
GACGCCGACT ACGCGCTGGT CGCCGTGCCC GCCGGCCGGT GCGCCGAGGT CGTACGGCAG
GCGGACGGCA TCCCGTTCGT CCAGGTGATG AGCGGCGGGT TCGGGGAGGC CGGCGCGCCC
GAGCTGGAGG CCGGGCTGGT CCGGGCGGCG CGGGAGGCGG GGACGCGGCT GCTCGGACCG
AACTGCATGG GTGTCTACAG CCCGCGCGGC GGCCAGACGT TCATCGGCGG CGAGCCGGGG
CCGCCCGGCC ACGTGGCGCT GATCTCGCAG AGCGGCGGGC TCGCCGGGGA GGTCGTCAGG
GTCGGCGAGC GGCGCGGGCT GGCGTTCAGC CGGGTGGCCA CGGTGGGCAA CTCCGCCGAC
GTGACCCCTG CCGAGCTGCT GCGCTGGCTG GCCACCGACG AGCACACCTC GGTCGTCGGC
ATGTATCTGG AGGATCCGCG CGGCGGCCGC GCCCTGTTCG AGGCCCTGAT GTCGGTGCGC
GGCAGGCTCC CGGTCGTCCT GCTGGTCGGC GGCCGGAGCG CCCAGGGACG GCGGGCCGCC
GCCTCGCACA CCGGCGGCAT GGTCGGCGAC GACCGCGTCT GGCGGGCCCT GGCCGAGCAG
TCCGGCGCCG CCCTGGTCAC CGGTCAGGAC GACCTGATCG GCGTCCTGTC CTTCTTCCAG
TCCCACGCCC GCCGCCTGAG CGCCCTCCGC ACCGACCCCG CGGACGGGGA CCCGGGCGGG
GATCCCGCGC TGCTGGTGAT CGGGCCGAGC GGCGGGGCGA GCGTCCTGGC GGCCGACGTG
TTCGACGCCG CCGGGCTCTC GCTCGACGCG TTGCCGGAGG AGGCGGAGGA CGGGCTGAAG
GCGCTCGGGA TCGGGGTGGG CAGCTCCCTG GCCAACCCGC TGGAGATCCC GGTGGGGCCG
CGTGGCCGCC CCGAGCTGGC CCGGGAGGCC ATCGCGGCGA TCGTGGCGCG GCGGCCGTAC
CCGGACGTCG TCGCCCACGT GAACGTGCAG AGCTTCTTCA CCTACGGCAG TTCGGCCGAG
CCGCTGTACG CCTGCGCCCG CAGCCTCGCC CGCGCCCAGG AGGACCTGCC CGAGGTCAGG
ATCACGCTGG TCACCAGGAA CGGCGAGTGC GCGCCACCCG GGGTCGAGGA CGGGGTCCGC
GCGATCGCGG CCGGTGCCGG GATCCCCGTC TACCGTTCGA TGGAGGCGGC CGCCGTGGCC
GTCGCCGCCG CCAAACGTTT CACGCGAGGA GCTTGA
 
Protein sequence
MSGSDPLATA GPPPEHEVKA LLREAGVAVP RGATAASAEG VAAAAGGLTP PLVLKAFGPG 
LVHKSDSGAV RLALAGPAEA AGAAAEMLGG LLAQGITPDG FLVEEQADPG VELIVGVVRD
PGFGPVLLAG LGGVWTEALR DTAVRVCPVS EADARRMLAS LRGSRLLRGY RGRPAADMDA
LVKLLLTIGG AGGLWERLEL GEFELNPVIA TGRGAVAVDA RYIPGGPREE APRPAPAAET
DFTALFEPRA VAVVGASASR PNFGNMFLEF YRATGVPLVA VHPEADEIDG VPAVPTLADA
DADYALVAVP AGRCAEVVRQ ADGIPFVQVM SGGFGEAGAP ELEAGLVRAA REAGTRLLGP
NCMGVYSPRG GQTFIGGEPG PPGHVALISQ SGGLAGEVVR VGERRGLAFS RVATVGNSAD
VTPAELLRWL ATDEHTSVVG MYLEDPRGGR ALFEALMSVR GRLPVVLLVG GRSAQGRRAA
ASHTGGMVGD DRVWRALAEQ SGAALVTGQD DLIGVLSFFQ SHARRLSALR TDPADGDPGG
DPALLVIGPS GGASVLAADV FDAAGLSLDA LPEEAEDGLK ALGIGVGSSL ANPLEIPVGP
RGRPELAREA IAAIVARRPY PDVVAHVNVQ SFFTYGSSAE PLYACARSLA RAQEDLPEVR
ITLVTRNGEC APPGVEDGVR AIAAGAGIPV YRSMEAAAVA VAAAKRFTRG A