Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_7448 |
Symbol | |
ID | 8670769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 8231990 |
End bp | 8233870 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | ABC-type dipeptide transport system periplasmic component-like protein |
Protein accession | YP_003342874 |
Protein GI | 271968678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.338891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGACAG GCAACTCGCG GCGCAGGCTG GCCCTTGCCG CTTGCGCGCT GAGCGGTTCA CTGGTGGCCG CTGCCACCAC CTTCGGCGCC GCAGCCGCCG CAGCCGCCGC AGCCGCCGCG CCGGACGCGT CCGTGTCCGC GTCGCAGGAG ACCACGCTGC GGGTGAAGAT GTCGGGCGCG GGCGTCGACA CGCTCAACCC GTTCCTCGCC TTCTTCAACG GCGCCCTGGA CATCTTCGGC TCGATCTACC CCACGCTGAA CTCGCTGGAC GAGAACGGCA AGCCGGGCCC GTACCTGGCC GAGTCGTGGA CGCCGTCGGA GGACAAGCTC ACCTGGACCT TCAAGCTCAA GGACGGTCTG AAGTGGAGCG ACGGCAAGCC GATCACGGCT GAGGACGCCG CCTGGACCCT CAACCTGATC ATGACCGACA CGGTCGCGGG CACCGCCAAC GGCTCGCTGG TCGGCAACTT CGAGTCCGTC ACCGCGACCG ACCCTACGAC GCTGGTCATC AAGACCAAGG CGCCGCAGGC CAACGTCGTC TACGTGAGCA TCCCGATCAG CGGCATCCCG ATCGTGCCGA AGCACATCTG GGAGCCGCGG GCCAAGAACC TCAAGGACGC CAAGAACGAC ACCTTCCCCG TCGTCGGCTA CGGCCCGTGG ACGCTCACGG ACTACAAGCC CGAGCAGTAC GCGAAGTTCG ACGCCAACAA GGACTTCATC CTCGGCAGGC CCGGCTTCGA CCACATGATC CAGCAGAGCT TCAAGTCGAC CGACGCCGCC GTCGCCGCGC TGCGCAGCGG CCAGCTGGAC TACATCAACG CCGTGAACCC GACCCAGTTC AAGGCGCTGC AGGCGGACAA GAGCCTGCTG ACCGCCCAGG AGGTCGGCAA CGGCTGGACC GGTGTCGAGG TCAACCACAA CGCCCGCACC CGCACCGGCA AGAAGATCGG CACCGGCCAT CCCGCGCTCG GCGACCCGGT GCTGCGCCGC GCGATCTCGC TGGCCACCGA CAGGAAGACG CTGGTCACCA AGGTGCTCGA CGGCATGGGC GTGGCCGGCT CCGGCTACCT GCCCCCGGCC TGGCCGCAGT GGAGCTGGAA GCCGGCCGCG GGCCAGGAGA CGCCGTTCGA CCTCGCCCAG GCCGGCAAGA TCCTCGACGA CGCCGGCTAC ACCAAGGGCG CCGACGGCGT CCGCGTCGAC CCCAAGAGCG GCAGGCCGCT GGAGCTGCGC CTCGGCATCC ACTCCGACGA CACGGCCGAC GCGGGCATCT CCACCTACCT CAAGGGCTGG CTGGAGACGA TCGGGATCAA GCTCAAGATC CAGACGCTGA GCATGAGCGC GCTCAACAGC GACCTGGCCA AGGGCGACTG GGACCTGCTG ATGGACGGCT GGACCACCGG CCCCGACCCG ACCTACCTGC TCGGCATCCA GACCTGCGCC ACGCTGCCCA AGGACGACGG CACCGGCGGC AACACCGACG CCTTCTTCTG CGACGAGGCC TACGACGAGC TGTTCAAGAA GCAGCTCACC ACGTTCGACC AGGACGAGCG GGCCAAGGTC GTCGCCGAGA TGCAGGACAT CCTGTACAAG GCCGACGTCG ACCAGATCCA CTTCTACGCC AACACCCTCG ACGTCGCCCG CACCGACACC GTGACCGGCC TGATCACCGG GCAGCCGGAC GCGCAGGGCA TGTACCCGGC GCAGACCGCG TTCTGGAGCT ACCTCAAGGC CGCGCCGCCC GCCGCCAAGC CGGCCGCCGC CTCGGCCGGG GAGGAGAGCG GCGGCGGCCA GCTGTGGGTC GGCGCGGGCG TCCTGCTCCT CGCGCTCGTC GGCGGCGGGA TCGTGCTCAG GCGCCGCTCG GGCGCGGGCG ACCGTGAGTA G
|
Protein sequence | MGTGNSRRRL ALAACALSGS LVAAATTFGA AAAAAAAAAA PDASVSASQE TTLRVKMSGA GVDTLNPFLA FFNGALDIFG SIYPTLNSLD ENGKPGPYLA ESWTPSEDKL TWTFKLKDGL KWSDGKPITA EDAAWTLNLI MTDTVAGTAN GSLVGNFESV TATDPTTLVI KTKAPQANVV YVSIPISGIP IVPKHIWEPR AKNLKDAKND TFPVVGYGPW TLTDYKPEQY AKFDANKDFI LGRPGFDHMI QQSFKSTDAA VAALRSGQLD YINAVNPTQF KALQADKSLL TAQEVGNGWT GVEVNHNART RTGKKIGTGH PALGDPVLRR AISLATDRKT LVTKVLDGMG VAGSGYLPPA WPQWSWKPAA GQETPFDLAQ AGKILDDAGY TKGADGVRVD PKSGRPLELR LGIHSDDTAD AGISTYLKGW LETIGIKLKI QTLSMSALNS DLAKGDWDLL MDGWTTGPDP TYLLGIQTCA TLPKDDGTGG NTDAFFCDEA YDELFKKQLT TFDQDERAKV VAEMQDILYK ADVDQIHFYA NTLDVARTDT VTGLITGQPD AQGMYPAQTA FWSYLKAAPP AAKPAAASAG EESGGGQLWV GAGVLLLALV GGGIVLRRRS GAGDRE
|
| |