Gene Sros_5913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5913 
Symbol 
ID8669207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6481723 
End bp6483504 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content67% 
IMG OID 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_003341391 
Protein GI271967195 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0808698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0457082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA AACCCGCAGT CGCGACGTTC GCGGTCACCG CGGCGCTCGC CTTGGGCCTG 
TCGGCCTGCG GTGGAACCAG CACCGACTCG CCCAAGGAAA GCGCGGGTGC CGGCACTGGG
GCGGCCGCTC CGGCCCAGGC CGAGTTCAAC GCCGCGATGA CCAAGGTGTT CAACCCTTCG
ACCAAGAAGG GCGGCACCCT CAAGTTCGTC AACTCCGGTG ACTGGGACTC GCTGGACCCG
GCCGACACCT ACTACGGCTA CTCGTGGAAC TTCATGCGCC TGTACGGCCG GGCGCTGACG
GTGTTCAAGG CCGCGCCGGG CGCCGAGGGC GCGACCGTCG CACCCGACCT GGCCAAGGAC
CTCGGCAAGC CGAGCGCCGA CTTCAAGACC TGGACCTACA CGCTGCGTGA GGGTCTGAAG
TTCGAGGACG GCACCCCGAT CACGTCCAAG GACGTGGCCT ACGCCGTGGC GCGCGCGTTC
GACAAGGAGA CCTTCCCGAA CGGTCCCACC TACCTGAACG AGATGCTCGA CTGGCCCAAG
GACTACAAGG GCGCCTACAA GTCGAAGGAC GCCGACTTCA GCTCGGCCAT CGAGACGCCG
GACGACTCCA CGATCGTCTT CCACCTGAAG CAGCCCTACA GCGGCTTCGA CTACATCACC
CAGATGTCGC CGACCGTGCC GGTGCCGAAG GCCAAGGACA CGGGCGCGAA GTACCGCGAG
CACGTGATCT CCTCCGGCCC GTACATGTTC GAGAAGAACG AGATCGGCAA GGGCTTCTCG
CTCGTCCGCA ACCCGAACTG GGACGCGGCG ACCGACCCCA ACCGCCCGGC GCTGCCGGAC
CGGATCGAGG TCCAGACCAA CGTCAACGCC GACGACCTCG ACAACCGTCT CCTCTCCGGT
GACATCCACG TCGACATCGC GGGCACCGGT GTCCAGCCGG CCGCGATGAG CAAGATCCTG
CCGGACCCCG CCCTCAAGGC GCGGGCCGAC AACCCGACGC TGCAGCGTCT CTGGTACACC
TCGGTCAGCC CGACGGTGAA GCCGCTGGAC AACATCGACT GCCGCAAGGC CGTCCAGTAC
GCGGCCGACA AGACCGGCTA CCAGGCCGCC TACGGCGGTG AGTTCTCCGG TGGCGCGATC
GCGACCAGCC TCATGCCGCC GAGCGTCCCG GGCGCCGCGA AGATCGACCT GTACCCCAGC
GGTGCTGACG GCAAGGGTGA CCTGGCCAAG GCCAAGGAGC ACCTGGCCGC CTGCGGCCAG
CCGAACGGCT TCGAGACCAA CATCTCCTAC CGCGCCGAGC GGCCGAAGGA GAAGGCGACC
GCCGAGGCCC TGCAGGAGTC GCTGGCCCGC GTCGGCATCA AGCTGACCCT GAAGCCGTAC
CCGCAGGCGG ACTACTTCTC GCTGTACGCC GGCAAGCCGC CGTTCGTCGT GGAGAACAAG
CTCGGCCTGG CCGTCAACGG CTGGGGCTCG GACTACCCCG ACGGCTACGG CTTCCTGCAG
CAGATGGTCG ACAGCCGGGT CATCCGGGAG ACCGGTGGCT CCTCCAACGT CAGCGTCCGC
ATCCCCGATG TGGACAAGAT GCTCGACGAG TCCCTGCTCG AGGCCGACGC CAAGAAGCGT
GAGCCGATGT GGGCCGCCAT CGACAAGCGC GTCATGGAAG AGGCGGTCAT CCTGCCGGGT
GTCTGGGCCA AGAGCCTGCT GGTCCGCGGC CAGGGTGTGA CCAACGTCTT CATCAGTGAC
GGCCAGCAGA TGTACGACTA CGTCGCGATG GGCGTCGAGT AA
 
Protein sequence
MKRKPAVATF AVTAALALGL SACGGTSTDS PKESAGAGTG AAAPAQAEFN AAMTKVFNPS 
TKKGGTLKFV NSGDWDSLDP ADTYYGYSWN FMRLYGRALT VFKAAPGAEG ATVAPDLAKD
LGKPSADFKT WTYTLREGLK FEDGTPITSK DVAYAVARAF DKETFPNGPT YLNEMLDWPK
DYKGAYKSKD ADFSSAIETP DDSTIVFHLK QPYSGFDYIT QMSPTVPVPK AKDTGAKYRE
HVISSGPYMF EKNEIGKGFS LVRNPNWDAA TDPNRPALPD RIEVQTNVNA DDLDNRLLSG
DIHVDIAGTG VQPAAMSKIL PDPALKARAD NPTLQRLWYT SVSPTVKPLD NIDCRKAVQY
AADKTGYQAA YGGEFSGGAI ATSLMPPSVP GAAKIDLYPS GADGKGDLAK AKEHLAACGQ
PNGFETNISY RAERPKEKAT AEALQESLAR VGIKLTLKPY PQADYFSLYA GKPPFVVENK
LGLAVNGWGS DYPDGYGFLQ QMVDSRVIRE TGGSSNVSVR IPDVDKMLDE SLLEADAKKR
EPMWAAIDKR VMEEAVILPG VWAKSLLVRG QGVTNVFISD GQQMYDYVAM GVE