Gene Sros_4397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4397 
Symbol 
ID8667691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4906604 
End bp4907890 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative sigma-70 factor, ECF subfamily 
Protein accessionYP_003340016 
Protein GI271965820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0260182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0263489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAGC AGCCGACCGA TACCGGTACG GACAGGGCCG TGGAGTCGGT GTTCCGGGAG 
GAACACGGTC GGCTGCTCGC CTCACTCGTC GGCCGTTTCG GGGACCTCGA CCTGGCGGAG
GAGGTCGCCT CCGAGGCGAT CGAGGCCGCG CTGATGCACT GGCCGGTGCA GGGCGTTCCG
GCCAAGCCGG GTGCCTGGCT GCTGACGACG GCCCGGCGCA AGGCCGTCGA CCGGCTGCGG
CGGGACCAGG CCTACGCCGC CCGGCTCGCC GCCCTGCAGG TGGAGGCGGA CCGGGCCGCC
TCCGCCCCGC CCGCGGACGC GGACGCCGAT CTCCCGGACG AGCGGCTGCA GCTGTTCTTC
ACCTGCGCCC ACCCGGCCCT GCCGGCCGAG GCTCGCGGGG CGCTGACGCT GCGCTGCCTG
GCCGGACTGA CCACACCCGA GGTCGCGCGG GCCTATCTCG TCCCGCCGGC GGCGATGGCC
CAGCGGATCG TGCGGGCGAA GAAGAAGATC CGCGAGGCCC GGATCCCCTT CAGGGTGCCG
GGCGCCGACG AGTTGCCCGC ACGCCTGCCG GGTGTGCTCC AGGTCCTCTA CTCGATCTTC
ACGGAGGGGT ACGCGGCCAG CGCCGGAGCG CAGCTGCAGC GGCTCGACCT CGCCGAGGAG
GCCCTTCGGC TGGCACGGAT CCTGCGCCGG TTGCTGCCCG CCGAGCGGGA GGTCGCCGGC
CTGCTCGGGC TCATGCTGCT GGTCCACGCG CGGCGCGATG CCCGGACCGG CCCGGACGGC
GAGCTCGTGC TGCTGGAGGA CCAGGACCGC GGCCGCTGGG ACCGTACGAT GATCGAGGAG
GGCCTCGCCC TGGTGCCCGC CGCGCTGACC GGCGGCCCGC CTGGACCGTA CGGCGTGCAG
GCCGCGATCG CCGCCCTGCA CGACGAGGCG GCAGACCTCG CGACCACCGA CTGGCCGCAG
ATCGTGGCGC TCTACGGCGT GCTGCTCGCC CTCGCCCCCT CTCCCGTCGT CGCCCTGAAC
CGGGCCGCGG CGGTGGCGAT GTGCGACGGC CCGGAGGCCG GCCTGGCGCT GCTCGACAGC
CTGGCCGGCG AGGAGAGGCT GCGCGGCCAC CACCCCTACC CGGCGGCCCG GGCGGACCTG
CTGCAACGGC TCGGCCGGCT CCCCGAGGCC GCCGCTGCCT ACCGGGAAGC GCTCGCCCTG
GCCGGCACCG AACCCGAACG CGCTCACCTG CGACGCAGGC TGGAGGCGGT CGAGCCATCC
GGCCCGGACG CCGGGGCCGG CACGTGA
 
Protein sequence
MAEQPTDTGT DRAVESVFRE EHGRLLASLV GRFGDLDLAE EVASEAIEAA LMHWPVQGVP 
AKPGAWLLTT ARRKAVDRLR RDQAYAARLA ALQVEADRAA SAPPADADAD LPDERLQLFF
TCAHPALPAE ARGALTLRCL AGLTTPEVAR AYLVPPAAMA QRIVRAKKKI REARIPFRVP
GADELPARLP GVLQVLYSIF TEGYAASAGA QLQRLDLAEE ALRLARILRR LLPAEREVAG
LLGLMLLVHA RRDARTGPDG ELVLLEDQDR GRWDRTMIEE GLALVPAALT GGPPGPYGVQ
AAIAALHDEA ADLATTDWPQ IVALYGVLLA LAPSPVVALN RAAAVAMCDG PEAGLALLDS
LAGEERLRGH HPYPAARADL LQRLGRLPEA AAAYREALAL AGTEPERAHL RRRLEAVEPS
GPDAGAGT