Gene Sros_7001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7001 
Symbol 
ID8670311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7723536 
End bp7725512 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003342444 
Protein GI271968248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.537258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00183275 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCACGA CAGCTGGCTG GCTGCGGAAG CAGCTACCCG GAATCGTCGC CCTCGTGCTG 
ATGACGGGCG CGTTCGTAGT GAGCCGCCCG CCCACGTCGT CCGCCGACGA GAAGGCGGAC
ATCGCGAGCA GGTACGGGTT CACCCCGATG AACATCGCGA TGCCCGGAGG GTTCACCCAG
AAGACGATGC GCGCGGTGAA CAAGAGATAC ACGCACATCG ACGCGTGGAT CTCCTCGGTC
GGCGCCGGCG TCGCGATGAA CGACCTCGAC GGGGACGGCC TGGCCAACGA CCTGTGCATC
ACCGACCCCC GGATCGACCA GGTCGTGGTC ACCCCCACGC CCGGTGAGCG GTCCGGCCGG
TACGCGCCGT TCGCGCTCAA CCCCGGCACC CTCCCCATGA ACGAGGCCAT GGCCCCGATG
GGCTGCCTGC CGGGTGACTT CAACGAGGAC GGCCGCACCG ACCTGCTCGT CTACCTGTGG
GGCCGCACGC CGATCGTCTA CCTGGCCAAG GCCGACGCGC AGAGCCTCGG CGCCGGGGCG
TACCTGCCCA CCGAGCTGGT GCCGGGCGCC GGCGGCGCCG AGTACACGGG GCCGCTGTGG
AACAGCAACA CGGCCACCTT CGCCGACTTC GACGGCGACG GGCACGGTGA CATCTTCATC
GGCAACTACT TCCCGCACAG CCCGGTGCTC AACGACAAGA TCAACGGCGG CGTGGAGATG
AACGACTCCA TGTCCCGGGG CCTCAACGGC GGCGAGAACT ACTTCTTCCG CTGGACCGCG
GGCAGCGCGG GCGCGCAGCC GTCGGCGACC TTCCAGAAGC TCGACCGCGT CCTGCCCAGC
AACGTCTCCA AGGGCTGGGA GCTCGGCGCG GCGGCCGCGG ACCTCGACGG CGACCTGCTG
CCCGAGCTGT ACCTCGCCAA CGACTTCGGT CCCGACCGGC TGCTGTACAA CCGGTCCAAG
CCGGGTGAGA TCAAGTTCTC GGTGGTGGAG GGCGTGCGCA CGCCGCTCAT CCCCAAGTCC
AAGTCGATCG GCCACGACTC CTTCAAGGGC ATGGGCCTCG ACTTCGGCGA CCTGAACGGC
GACGGCATCT ACGACATGTT CGTCAGCAAC ATCACCACCT CGTTCGGCAT CGAGGAGAGC
AACCTCCAGT TCATGTCCAC CGCCAAGGAC CAGGCCGACC TGCGCGCGAA GCTCGCCAAG
GGCGAGGCGC CGTGGGAGGA CCGCAGCGCC CAGCAGTCCA CCGCCTGGAC GGGCTGGGGC
TGGGACGTCA AGATCGACGA CTTCAACAAC AGCGGCGACC TGGCGATCGC CCAGGCCACC
GGCTTCGTCA AGGGTGAGGT CAACCGCTGG CCGGTGCTGC AGGAGCTGGC GACCGCCAAC
GACGGCGTCC TGCGCAATCC GAGCTCCTGG CCCAAGGTCC GGGCCGGTGA CGACGTCGCG
GGGCACCAGA CCCTCGCCTT CTTCGCCAAG AACGAGGACG GCCGGTACGC CAACATCTCC
GAGCAGCTCG GCCTGGCCAT CCCGGTGCCC ACCCGCGGCC TGGCCACCGG CGACTCCAAC
GGCGACGGCC TGCTCGACCT CGCGGTCGCC CGGCAGTGGG ACGAGCCGGT GTTCTACCAG
AACAAGAGCC CGAACCCCGG AGCCTTCCTC GGGCTGAAGC TGACCCACGA GGGCCCCGCC
ACCACCGGCG CCCCGGCCGC GGGCACGCTC CCGGCCCCGG GCTCGGCCGC GATCGGCACG
GAGGTGACGG TCACCACGCC CGACGGGCGC AAGACCATCG CCCGGGTCGA CGGCGGCAGC
GGCCACTCCG GCCGGCGCAG CCACGACGTG CACATCGGCC TGGGGCCGAA CGTGACCGGA
CCGGTGCAGG TCCGTCTGTG CTGGCGCGAC AGGACCGGAC AGATCCACGA CCAGACCCTT
CAACTGACCC CGGGCTGGCA CTCCCTCCAG CTCGGCTCTC AGGCCAAGGA GAAGTGA
 
Protein sequence
MTTTAGWLRK QLPGIVALVL MTGAFVVSRP PTSSADEKAD IASRYGFTPM NIAMPGGFTQ 
KTMRAVNKRY THIDAWISSV GAGVAMNDLD GDGLANDLCI TDPRIDQVVV TPTPGERSGR
YAPFALNPGT LPMNEAMAPM GCLPGDFNED GRTDLLVYLW GRTPIVYLAK ADAQSLGAGA
YLPTELVPGA GGAEYTGPLW NSNTATFADF DGDGHGDIFI GNYFPHSPVL NDKINGGVEM
NDSMSRGLNG GENYFFRWTA GSAGAQPSAT FQKLDRVLPS NVSKGWELGA AAADLDGDLL
PELYLANDFG PDRLLYNRSK PGEIKFSVVE GVRTPLIPKS KSIGHDSFKG MGLDFGDLNG
DGIYDMFVSN ITTSFGIEES NLQFMSTAKD QADLRAKLAK GEAPWEDRSA QQSTAWTGWG
WDVKIDDFNN SGDLAIAQAT GFVKGEVNRW PVLQELATAN DGVLRNPSSW PKVRAGDDVA
GHQTLAFFAK NEDGRYANIS EQLGLAIPVP TRGLATGDSN GDGLLDLAVA RQWDEPVFYQ
NKSPNPGAFL GLKLTHEGPA TTGAPAAGTL PAPGSAAIGT EVTVTTPDGR KTIARVDGGS
GHSGRRSHDV HIGLGPNVTG PVQVRLCWRD RTGQIHDQTL QLTPGWHSLQ LGSQAKEK