Gene Sros_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4920 
Symbol 
ID8668214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5450261 
End bp5451895 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content65% 
IMG OID 
ProductSpore coat assembly protein-like protein 
Protein accessionYP_003340472 
Protein GI271966276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGC TCAAGCATCG TATTCCGGTC AAACTCCGGC ACAACTGGAA GCTGGTCGCG 
CTGTGCGCGG CGTTCCTGGC CCTGTGCTGG GGTGTGCTCG GCAGCGGGAT GATCCGCCCC
TACGTCACCA TCTCCCAGGC CGCCACGGCC GAGACCGTCA TCACGAACCT GACCGGCACC
AAGGACCTGT TCGACACCTC TGTGGCGCAC GAGGTGAAGC TCACCTTCAC CGACGCGGCC
TATGAGGACA TGCTGAGCGA GTACTTCAAG GACGGCGAGA AGAAGTACCT CGAAGCCGAC
CTGACCATCG ACGGCGTCCG CATCCCCAGC GTGGGCGTCC GGCTCAAGGG CAACTCCACC
CTGTCCGGGC TGACCTGGAA GGGACAGTCC AAGCAGCGGA GCATGCCCGG CGGCGGACAG
CGGCGCGGAG GTATGCCCGA GGGCTTCCAA CCACCCGAAG GTTTCCAACC ACCTGAGGGA
TTCCAGCCGC CCGAGGGCGG CGAGCCATCC GAAGGCTCCC GGTCACCGGG CAATGGGCCG
CCGCCAGTCG GGCAGCAGGG CAATGGCGAG CAGCCCAGAG GTGGCGGGTT CGGCGCGTCT
TTGAAGGGTG AGGAGCCGGA GAACCTGCCG TGGCTGATCA GCTTCGACGA GTTCGTGGAG
GGCCGCCGCT ACCAGGGGCA CAGCCAGGTG GCAGTACGGC CGGCCGCGAT GGGCTCGACG
ACGATGCTGA ACGAGGCACT GGGCATCGCA CTGGTCGGCG CCTCCGGTGA GCCCACCCAG
CGTTCGGCCC ACAGCGCCTT CACTGTCAAC GGCCGCACCT CGACACCGCG TCTGCTCGTG
GAATACCTCG ACGAGGGCTA TGCCGAAGGC CTCGGCGAAG GCGTGCTGTA CAAGTCGCTG
GCCGGCAGCT CCTTCAGCTA CAAGGGCGAG GACCAGACCG GGTACACCAA CGACTTCAAG
CAGATCAACA AGGTCGGCGG CCAGGATCTA CAGCCGGTCA TCGATCTGGT CAAATGGGTG
AACCAAGCCT CCGACCCCGA ATTCGCCGCA GGCCTGGGCG AGCGTTTGGA CGTGGAGTCG
TTCGCCCGTT ATCTGGTCCT GCAGAACCTC ATGGTCAACT TCGACGACAT GGCCGGGCCC
GGACGCAACT ACTACCTGTG GTACGACCTG GACACCAAGA AGTTCAAGGT CATCACCTGG
GACCTCAACC TCGCCTTCAG CGGTAACGCC AAGAGCGGTG TGAACGACAC GGTCACGATG
GGCTTCGGCC GGGGACGCCC CCAGCAGGAC CAGCAGGACC AGCAGGACCA GCCGCCTCAA
GGTTTCACCC CGCCCCAGGA CGGCCCTCAG CAGCCTCCAG AAGGAGGCAT GATGCGCATC
GGCCATCCTC TGAAGGAGAG GTTCCTCAAG AACGCCATCT TCAAGAAGGT CTACCAGGAG
CAGTATCGTG CTCTGTATGC CAAGCTGCTC GGCAACGGCA CCGCATCCGG CCTGCTCAAC
GATCTCGCCA CGTCCTACAA GCTGAACGAG GACGCCGACA CCGCCAAGGC CGACACCGAG
GCGCAGAACC TGCGCACGTT CCTGCAGACC CGCACTCAGA CACTGCGGTC CGACAAGGCG
ATCAGCGGCG GGTAG
 
Protein sequence
MAQLKHRIPV KLRHNWKLVA LCAAFLALCW GVLGSGMIRP YVTISQAATA ETVITNLTGT 
KDLFDTSVAH EVKLTFTDAA YEDMLSEYFK DGEKKYLEAD LTIDGVRIPS VGVRLKGNST
LSGLTWKGQS KQRSMPGGGQ RRGGMPEGFQ PPEGFQPPEG FQPPEGGEPS EGSRSPGNGP
PPVGQQGNGE QPRGGGFGAS LKGEEPENLP WLISFDEFVE GRRYQGHSQV AVRPAAMGST
TMLNEALGIA LVGASGEPTQ RSAHSAFTVN GRTSTPRLLV EYLDEGYAEG LGEGVLYKSL
AGSSFSYKGE DQTGYTNDFK QINKVGGQDL QPVIDLVKWV NQASDPEFAA GLGERLDVES
FARYLVLQNL MVNFDDMAGP GRNYYLWYDL DTKKFKVITW DLNLAFSGNA KSGVNDTVTM
GFGRGRPQQD QQDQQDQPPQ GFTPPQDGPQ QPPEGGMMRI GHPLKERFLK NAIFKKVYQE
QYRALYAKLL GNGTASGLLN DLATSYKLNE DADTAKADTE AQNLRTFLQT RTQTLRSDKA
ISGG