Gene Sros_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2017 
Symbol 
ID8665299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2169308 
End bp2170366 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID 
Productluciferase family protein 
Protein accessionYP_003337748 
Protein GI271963552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.473194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT CGACCTTCCA CCTCTTCCAC CGGTTCGACG GCCAGAGCTT CAAGGACGTC 
TACGACTACC ACCTTGAGCT GGTCGAGCTC GCCGAGGAAC TGGGGTTTCA CGGGGTGCGG
CTGGCCGAGC ACCACTTCCG CGACTACGGC GTGGTGCCGA ACCTGTTCAC GATGCTGTCC
CACATGGCCG CGCGCACCGA GAGACTGCGG CTGGGGACCG GCATCGTGGT CCTCCCCCTG
CACAACCCGG TCCACGTCGC CGAGGAGGCG GCCATGGTGG ACATGCTCTC GGGTGGACGG
CTGGAGCTGG GGATCGGCCG CGGCTACCAG AGCTTCGAGT TCGAGGGCTT CGGCATCGAC
CTGGCCGAGG CCCGCGACCG GTTCAACGAG GCCCTGGAGG TCGTGCTCGG CCTCTGGGGC
AACCCGTCCT TCCGGCACGA GGGCAAGTTC TACCGGACCG GCACCGAGGT GGAGCTCGTC
CCCCGGCCGG TCCAGGACCC GTTCCCGCCG ATCCACGTGG CGGCCGTGTC CCCCGAGACG
GTGACCATGT ACGCCGAGCG CGGCCTGCCG ATCCTGGCCG ACCCGGCGGC GCCCTTCCGC
AAGGTGGTCA AGGCCGCCGA GACCTGGCGG GAGACCGCGG AGCGGGCCGG GCACGACGTG
GCCGCCTCGG AGCTGGTGGT CGCCCGCAGC GTCTACGTCG CCGCGACGGT CGAGCGGGCC
CGCGAGGACC AGGCCAGGTT CGAGACGATG TTCGACCGGT CCCGGATCTT CAACGAGAAG
AGCTCGCCGA TCGACCCCCG GACCGGCAAG GCCGCCCAGG GCTTCGAGTA CTACCAGGAC
CGCTACCTCA AGGGTGGCGC GGTCTCCAAC GATTTCCGCT GGGAGCAGCT GGAGGTCATC
GGCGACCCCG AGCGGGTGAT CGGCCAGATC CGGCTGCTCG AGGACGCCGG CTTCGCCAAC
CTGCTCTGCG ACTTCGGCAG CACGAGGCCG ATGCCGCTGG AGGAGATGAA GAAGGTCATG
CGGTTCTTCG CCGCCGAGGT CATGCCCGCT TTCAAGTGA
 
Protein sequence
MKFSTFHLFH RFDGQSFKDV YDYHLELVEL AEELGFHGVR LAEHHFRDYG VVPNLFTMLS 
HMAARTERLR LGTGIVVLPL HNPVHVAEEA AMVDMLSGGR LELGIGRGYQ SFEFEGFGID
LAEARDRFNE ALEVVLGLWG NPSFRHEGKF YRTGTEVELV PRPVQDPFPP IHVAAVSPET
VTMYAERGLP ILADPAAPFR KVVKAAETWR ETAERAGHDV AASELVVARS VYVAATVERA
REDQARFETM FDRSRIFNEK SSPIDPRTGK AAQGFEYYQD RYLKGGAVSN DFRWEQLEVI
GDPERVIGQI RLLEDAGFAN LLCDFGSTRP MPLEEMKKVM RFFAAEVMPA FK