Gene Sros_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3126 
Symbol 
ID8666414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3402964 
End bp3405381 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338816 
Protein GI271964620 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.51395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGACG GGGACGGGGT ATCGGGGTCA ACCAGCGGCC GGCCGCGGCG CCTGGCCGGT 
GCCCGCCGGC TGCCCTTGCG TACCGTGCTG ATCATTGTCG TGTCCGTACC GAGCATGACT
TTCACCCCGC TGCTGGGGTC GGGCGTATAC CAGCTGGCTA CTCAGTGGCA GGCGGAGAAG
ATCCAGATGG ATCTGGCCAC CGATACCGTC GGCAGACCTG CGGCCGACCT GTTCTTCAGC
CTTGAGCAGG AGCGACTGCT CACTGCGGAG ACGCAGGCCA GTCCCCGTAG CACCTCCCGC
GACAAGCTCG CCAAGCAGCG CGCGGTGACC GATCAGGCCG TGAGCGCCTT CCGTCCGCTG
GCAGCCCTGG ACACCACCGA CGGACAGGAG GGGCTCGCCG ACGCCATCGC CCGGACCCAA
CGCAACCTCG ACCTCCTCGA CCAGCAGCGG CGGGCGGTGG ACGCCGGTTG GTCCAGCGAG
CAGAAGGCGT TCGACTACTA CAACGGCGTC CTTGAGTCAG ACCTCAGCGT ATTGACCGCC
CTCAGCCACA CCGACCACGC CGAGGTGAGT AGCAAGGCGC AGACGCTGGT CGACCTCTTC
TGGGCCGTCG ACATGATCGG CCGCGAAGAC GCCATCCTGG CCCGCGGCTG GGAGACGGGG
CACCTGACGA GGGACGAGTA CGGCCTTGTC GCCGATGCCA TCGGCACCAG GAAGCACCTC
CTGCAGTCCC GGGTGGCCCC CAGTCTCCTG GGCAACGAGA GCCAGTACGG CGCGCTGACG
GCCAGTAAGG CGTGGGAGAC CATGACCGCG CTGGAGGGGC GGCTGCTCGC CTCCGAGACC
AGCGCCGGGA GCAACCAGGT GACACTCCGC GAGGTCGGCC CGGTGTGGCG ATCCTCGGTG
GACACGGTCA CCCCGCAACT GCTGCAAGTT CTCAATCTCC GGCTCGAAAA CGTCAGCAGG
ATCGGCTACG GCCACGCCGA TCGGCTCTTC GCGACCTTCA TCAGTGTCAC CGCCGTGGGC
CTGCTCGCCC TGGGCCTGGT CATCGTCACC ATCTGGCGCC TCACCGCCAT TCTTCGCCGG
CGCATCCTCC ACCTGCGCGA GGACGCCCAG GAACTCCAGG AGCGGCTGCC CAACGTGGTC
GCCCGGCTGG AGCGCGGCGA GGACGTCGAC GTGGACGCGG AAGTACACAT GGTCGAGCCC
ACCCCCGACG AGCTCGGCGA GCTCGGGAAA GCCCTCAATC TGGCCAGCCG CAGCGCTGTC
CTCACTGCCG TACGGCAGGC CGAACAACAC CGCGGCTTCC AACGCATGCT CCAGCGCATC
GCCCGCCGTA CCCAGATCCT CATCGGCCTT CAGCTGAAGA AGCTGGACGA ACTGGAACGC
AAGCACGAGG ACCCCGAGGT GCTGGAGGGC CTGTTCGACC TGGACCACCT CACCGCTCGG
CTGCGCCGCT ACGAGGAGAA CCTGGTGATC CTCGGCGGCG GTCAGCCGCA GCGCCGCTGG
CGCAAGCCGG TACGCCTGCT CGATGTACTG CGCGCCGCGC AGGGCGAGGT CCAGGACTAC
CGGCGGATCT CGATCGATGT CGAGGGCGAA CCCTGGGTAA CCGAGCGCGC GGTGGGACCG
TTGATCCACG TCCTGGCGGA ACTGATGGAG AACGCCACCG CCTTCTCCAA GCCACTGACC
CCCGTCGAGG TGCGGGCCGC ACCGGTCAGC CGGGGCCTCG CCGTGGAGAT CGAAGACCGC
GGCCTGGGCA TGGAACCCGA GCAGTACGCC GCCGCCAACG CCCTGATGCA GTCGCCGCCG
CAGCTCGATG TGATGACCCA CGCCGACGAT GTACGTCTCG GCCTGTATGT GGTCGCTCGG
CTCTCCGCGG GTCTGGGCCT CCAGGTGGAA CTGCGGCCGT CGGCCTTCGG CGGCACCCGT
GTCATCGTGC TGCTCCCGGA ACCGCTGGTG GTGGACCGCC CCCGCGCGGT ACCCGGGCCG
GCCGCACCCC CCGAGGAGGC CCCCCGGGAC CCCACCGGTC CGCAGCTCCA CCCGCACGAA
GGTACGGCCG GTCCCCGCCC GCCCCACGAG GACGCGCAGT TGCCGACTCG CTCCAGAGGG
CGTGCGATGG CATACGTGAC AGCGCCCACC GCCGGATCAC CGGATCAGGG TGACGCGCCC
CCGTCGTCCG GCCAGGAGCC GCTGCCCCAG CGGGTGCGCC AGGCCAGCCT GGTCACCGAG
CTCAAAGTCC CGGCGGACAG GGATGAACAG ACCGACCAGG AGGACTGGGC CGTACGCGAC
CAGCCCAGCC GATCCAGTGC CACGATCGGC GCGTTCCAGA GGCAGTCCCG CAGGCGCCGG
ACCGGTGACG ACACCTTCCA GCCCCGGCCG GACGGATCCC CCGAGCCCGG TTCCCCTACG
ACGGAAGATC GAAGATGA
 
Protein sequence
MGDGDGVSGS TSGRPRRLAG ARRLPLRTVL IIVVSVPSMT FTPLLGSGVY QLATQWQAEK 
IQMDLATDTV GRPAADLFFS LEQERLLTAE TQASPRSTSR DKLAKQRAVT DQAVSAFRPL
AALDTTDGQE GLADAIARTQ RNLDLLDQQR RAVDAGWSSE QKAFDYYNGV LESDLSVLTA
LSHTDHAEVS SKAQTLVDLF WAVDMIGRED AILARGWETG HLTRDEYGLV ADAIGTRKHL
LQSRVAPSLL GNESQYGALT ASKAWETMTA LEGRLLASET SAGSNQVTLR EVGPVWRSSV
DTVTPQLLQV LNLRLENVSR IGYGHADRLF ATFISVTAVG LLALGLVIVT IWRLTAILRR
RILHLREDAQ ELQERLPNVV ARLERGEDVD VDAEVHMVEP TPDELGELGK ALNLASRSAV
LTAVRQAEQH RGFQRMLQRI ARRTQILIGL QLKKLDELER KHEDPEVLEG LFDLDHLTAR
LRRYEENLVI LGGGQPQRRW RKPVRLLDVL RAAQGEVQDY RRISIDVEGE PWVTERAVGP
LIHVLAELME NATAFSKPLT PVEVRAAPVS RGLAVEIEDR GLGMEPEQYA AANALMQSPP
QLDVMTHADD VRLGLYVVAR LSAGLGLQVE LRPSAFGGTR VIVLLPEPLV VDRPRAVPGP
AAPPEEAPRD PTGPQLHPHE GTAGPRPPHE DAQLPTRSRG RAMAYVTAPT AGSPDQGDAP
PSSGQEPLPQ RVRQASLVTE LKVPADRDEQ TDQEDWAVRD QPSRSSATIG AFQRQSRRRR
TGDDTFQPRP DGSPEPGSPT TEDRR