Gene Sros_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1543 
Symbol 
ID8664819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1633720 
End bp1635438 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337279 
Protein GI271963083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.100753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.657816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACC CAGACGTCCC TCGATCGCAG GGCAGGCACG CCGGCGGGCC CGCCTCCACC 
GGGGCACCGA GCCCGGCGGT GCTGGAACAG GCCTCGCTGG AGCAGGTGAT CGGGCGGCTC
GGGCCCATGG CGCCGCAGCA GGCCGCCACC GTCGGCCTCG CCGTGCTGGA CCAGCTCGTG
GTCGTGCACG GCCAGGGGAT GCTCCACGGT GACGTACGGC CCGGCTCGGT GCTGCTCGGT
CCCTACGACC AGATCATCCT CAGCGCGCCG ACCTTCCGGT CCCCGACCTT CACCGCCCCC
GAGGGCGTGA CGGGCCCGGC GGCCGACCTG TGGTCGCTCG GTGCCACCCT CTACACCGCC
GTCGAGGGGC GGGCGCCGTC ACCCGGGGGA TCCCTCGAGA ACGCGGGCCC GATCGCGCCG
GTCCTGTTCC AGCTGCTCTC CGGCGACCCC GCCCGGCGGC CCGACCCCGG CACCCTGCGC
AACATCCTGC TCGGCATCTC CCAGAGCCGC GGCGAGGCCC CGGCCCCGCT GCCCCCCGCC
CCCGCGGACC TGCTCTCTCC TCCGGATCCG CTGCCCTCCG CGGACCCGCT GTCGGGGCCG
CCTCCCGCGG ACGCGCTGTC GGGTCCCCCG GACGCGAGAT CCGCCGCCCC CTCCTCCCAC
GCGCCGTCCC CCTCGGACGC GCTGTCCGGA GTCCCGCTCC CCTCGGCGCC CTTCGAGACG
CAGTCCACCG TCCCGGTGCT GACCGCGACG GCGCAGCCCT CCACCCCGGA GGCGGCCTCA
CCGCCCCAGC CCGTCTTCGA CTCCGCGGAC ACCATGCCGC CCCGGGCCGC CTCCGATCCG
GCGGCCATCG CGCCGATCCC CGCGGATCCG CGCGGGCCGG GGGTTCCCCC GCAGGCCCCG
CCCCCTTCCG GCGCCCCCGC CTCGCAGGAA CTCGTCCCCG CCACCGGCGG GCCACGCGAG
GTTCTCCCCG CCTCTCCGGC GCAGGGCGAA TCGACGGGCC CGACCAGTCC CGCCGGTCCG
GCCGGACCGG CCGGACCGGC CGGACGCTCC GACCGGCGGG CCGGGGTGCT GGTGCCCCGC
CCGGTCGTGG CGCTGACCGG TGTCCTGGTC CTCGGCATGG CGGTCGCCAT CGGCGTCCTG
CTCGCCTCGC CGGGCGACGG CTCCGGCGAG GGCGACGCCA CCGCCGCACC CGCCGCCGGC
GCCAAGGGCC TGTTCGCCAC CGCGCCTCGC GCCTGCAGCC TGCTCGACGA CAAGCAGGTG
AACGAGCTCG TGCCGGGCTT CAGGAGCTCG GAGGTCGAGC CCGCCGCGTG CGACTGGCTC
AATCAGCATG ACTGGCGCAA GCCCAGCCCG GAGAAGTTCG ACCTCCGCGT ACGGCTGGTC
GCCCAGAAGC CGGACGCCTC CGGGGTCGAG CGGGCGAAGG AGTATCTGTC CGGCAAGAGG
ACGGACCTCG TGGCGAGCGG CAAGTTCGCG ACCCCGAAGC CCGCGCCGCC CCAGAGCCTG
AAGGGGATAG GCGAGGAGGC CTTCACCACG GGCGGCTACA ACTCGATCAA CCTCTACGGC
GGCTCCTACA AGGCGACCGT GCTCTTCCGG GTCGGCAACC TGATCGCCCA GGTCGAGTAC
GAACGGGGCG GCGTCAAGGA GGACCGCGAC GGCGAGATCG CGGCGGGCGC CCAGAAGGCC
GCCCGCTGGC TCACCCAGTC GTTGAAGACC GATGGCTGA
 
Protein sequence
MNHPDVPRSQ GRHAGGPAST GAPSPAVLEQ ASLEQVIGRL GPMAPQQAAT VGLAVLDQLV 
VVHGQGMLHG DVRPGSVLLG PYDQIILSAP TFRSPTFTAP EGVTGPAADL WSLGATLYTA
VEGRAPSPGG SLENAGPIAP VLFQLLSGDP ARRPDPGTLR NILLGISQSR GEAPAPLPPA
PADLLSPPDP LPSADPLSGP PPADALSGPP DARSAAPSSH APSPSDALSG VPLPSAPFET
QSTVPVLTAT AQPSTPEAAS PPQPVFDSAD TMPPRAASDP AAIAPIPADP RGPGVPPQAP
PPSGAPASQE LVPATGGPRE VLPASPAQGE STGPTSPAGP AGPAGPAGRS DRRAGVLVPR
PVVALTGVLV LGMAVAIGVL LASPGDGSGE GDATAAPAAG AKGLFATAPR ACSLLDDKQV
NELVPGFRSS EVEPAACDWL NQHDWRKPSP EKFDLRVRLV AQKPDASGVE RAKEYLSGKR
TDLVASGKFA TPKPAPPQSL KGIGEEAFTT GGYNSINLYG GSYKATVLFR VGNLIAQVEY
ERGGVKEDRD GEIAAGAQKA ARWLTQSLKT DG