Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5913 |
Symbol | |
ID | 8669207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 6481723 |
End bp | 6483504 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | ABC-type dipeptide transport system periplasmic component-like protein |
Protein accession | YP_003341391 |
Protein GI | 271967195 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0808698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0457082 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAA AACCCGCAGT CGCGACGTTC GCGGTCACCG CGGCGCTCGC CTTGGGCCTG TCGGCCTGCG GTGGAACCAG CACCGACTCG CCCAAGGAAA GCGCGGGTGC CGGCACTGGG GCGGCCGCTC CGGCCCAGGC CGAGTTCAAC GCCGCGATGA CCAAGGTGTT CAACCCTTCG ACCAAGAAGG GCGGCACCCT CAAGTTCGTC AACTCCGGTG ACTGGGACTC GCTGGACCCG GCCGACACCT ACTACGGCTA CTCGTGGAAC TTCATGCGCC TGTACGGCCG GGCGCTGACG GTGTTCAAGG CCGCGCCGGG CGCCGAGGGC GCGACCGTCG CACCCGACCT GGCCAAGGAC CTCGGCAAGC CGAGCGCCGA CTTCAAGACC TGGACCTACA CGCTGCGTGA GGGTCTGAAG TTCGAGGACG GCACCCCGAT CACGTCCAAG GACGTGGCCT ACGCCGTGGC GCGCGCGTTC GACAAGGAGA CCTTCCCGAA CGGTCCCACC TACCTGAACG AGATGCTCGA CTGGCCCAAG GACTACAAGG GCGCCTACAA GTCGAAGGAC GCCGACTTCA GCTCGGCCAT CGAGACGCCG GACGACTCCA CGATCGTCTT CCACCTGAAG CAGCCCTACA GCGGCTTCGA CTACATCACC CAGATGTCGC CGACCGTGCC GGTGCCGAAG GCCAAGGACA CGGGCGCGAA GTACCGCGAG CACGTGATCT CCTCCGGCCC GTACATGTTC GAGAAGAACG AGATCGGCAA GGGCTTCTCG CTCGTCCGCA ACCCGAACTG GGACGCGGCG ACCGACCCCA ACCGCCCGGC GCTGCCGGAC CGGATCGAGG TCCAGACCAA CGTCAACGCC GACGACCTCG ACAACCGTCT CCTCTCCGGT GACATCCACG TCGACATCGC GGGCACCGGT GTCCAGCCGG CCGCGATGAG CAAGATCCTG CCGGACCCCG CCCTCAAGGC GCGGGCCGAC AACCCGACGC TGCAGCGTCT CTGGTACACC TCGGTCAGCC CGACGGTGAA GCCGCTGGAC AACATCGACT GCCGCAAGGC CGTCCAGTAC GCGGCCGACA AGACCGGCTA CCAGGCCGCC TACGGCGGTG AGTTCTCCGG TGGCGCGATC GCGACCAGCC TCATGCCGCC GAGCGTCCCG GGCGCCGCGA AGATCGACCT GTACCCCAGC GGTGCTGACG GCAAGGGTGA CCTGGCCAAG GCCAAGGAGC ACCTGGCCGC CTGCGGCCAG CCGAACGGCT TCGAGACCAA CATCTCCTAC CGCGCCGAGC GGCCGAAGGA GAAGGCGACC GCCGAGGCCC TGCAGGAGTC GCTGGCCCGC GTCGGCATCA AGCTGACCCT GAAGCCGTAC CCGCAGGCGG ACTACTTCTC GCTGTACGCC GGCAAGCCGC CGTTCGTCGT GGAGAACAAG CTCGGCCTGG CCGTCAACGG CTGGGGCTCG GACTACCCCG ACGGCTACGG CTTCCTGCAG CAGATGGTCG ACAGCCGGGT CATCCGGGAG ACCGGTGGCT CCTCCAACGT CAGCGTCCGC ATCCCCGATG TGGACAAGAT GCTCGACGAG TCCCTGCTCG AGGCCGACGC CAAGAAGCGT GAGCCGATGT GGGCCGCCAT CGACAAGCGC GTCATGGAAG AGGCGGTCAT CCTGCCGGGT GTCTGGGCCA AGAGCCTGCT GGTCCGCGGC CAGGGTGTGA CCAACGTCTT CATCAGTGAC GGCCAGCAGA TGTACGACTA CGTCGCGATG GGCGTCGAGT AA
|
Protein sequence | MKRKPAVATF AVTAALALGL SACGGTSTDS PKESAGAGTG AAAPAQAEFN AAMTKVFNPS TKKGGTLKFV NSGDWDSLDP ADTYYGYSWN FMRLYGRALT VFKAAPGAEG ATVAPDLAKD LGKPSADFKT WTYTLREGLK FEDGTPITSK DVAYAVARAF DKETFPNGPT YLNEMLDWPK DYKGAYKSKD ADFSSAIETP DDSTIVFHLK QPYSGFDYIT QMSPTVPVPK AKDTGAKYRE HVISSGPYMF EKNEIGKGFS LVRNPNWDAA TDPNRPALPD RIEVQTNVNA DDLDNRLLSG DIHVDIAGTG VQPAAMSKIL PDPALKARAD NPTLQRLWYT SVSPTVKPLD NIDCRKAVQY AADKTGYQAA YGGEFSGGAI ATSLMPPSVP GAAKIDLYPS GADGKGDLAK AKEHLAACGQ PNGFETNISY RAERPKEKAT AEALQESLAR VGIKLTLKPY PQADYFSLYA GKPPFVVENK LGLAVNGWGS DYPDGYGFLQ QMVDSRVIRE TGGSSNVSVR IPDVDKMLDE SLLEADAKKR EPMWAAIDKR VMEEAVILPG VWAKSLLVRG QGVTNVFISD GQQMYDYVAM GVE
|
| |