Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5287 |
Symbol | |
ID | 8668581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5803368 |
End bp | 5805419 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Dipeptidyl-peptidase IV |
Protein accession | YP_003340798 |
Protein GI | 271966602 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.645423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTCTG ACAAACTCCT CGCTCGTCTC GCCGCCACAG GGCGGTTCGC CTACGGCTCT CCCCGAGCCC TGACCATCTC CCCGGACGGG TCGCTGGTGC TGTTCCTGCG TTCCAGCGGC CCCGAAGACC CCGTCGAGCG GCTCTGGGCG CTCGACGTCG CCACCGGCGT GGAACGGCTG CTGGCCGAAC CCGCAGAGGG CCCGCACGAT CTACCGGCCG AGGAGCGCGC CCTGCGCGAG CGGCTGCGCC TGTGGGCGCC CGGTATCGGC TCCTACGCCA CCGACGGCTC CGTCGTCGTG TACGCGCTGG GCGGACGGCT CTTCCGGGTG GACGTCACCA CCGCCACCGT GGAGGAGATC CCGGCCGCCG GCCCCGTCTT CGACCCCCGG CCGTCCGGGG ACAGGATCGC CTACGTGTCC GCCGGGCGGA TCTACGTCCA CGAGAACGGC GGCGACCGCC TGATCGCCGG CGAATCCCAC GTCACCTGGG GTGTCGCGGA GTTCGCCGCC GCCGAGGAGC TCGGCAGGCA CCGGGGGCAC TGGTGGGCGC CCGACGGTTC GGCGCTCCTG GCCACCCGGG TGGACGAGAC CCGCGTGCGG CGCCGGCGAC TGGCCGACCC CTCCCGGCCC GAGCTGGCGG CGCCGGAGCC GGCCTATCCG CAGGCGGGCG GTCCGAACGC GGCCGTCGAG CTGCACCTGC TCGGGCTGGA CGGCGCGCGC AGGCAGATCC TGTGGGACGA CATCGGGTTC CCCTACCTGT CGGCGGTGGA CTGGAGCGAA CCCGACCGGC CCACGATCAC CGTGCTGGAC CGGCTCCAGC AGAACGGCCG GCTCCTCGCG GTCGACCTCG CCACCGGCAC GACCCTCCCC CTGGCGGAGT TCTCCGACCC GCGCTGGATC GAGGCCGTGC CCGGCACCCC CTCGCTCCTG GCCGGCGGCC GGATCCTGAC CGCGGTCGAG GCAGACGACA CCCAGGCACT GGCGATCGAC GGCACCGTGG TGACCCCCTC GTGGCTGTAC GTGCGGCGTG TCGCGGGCTC CCTGGGCCAC GACCTGCTCA TCGAGGGCAG CGAGGCCGAC CCCGCCGAGC AGCACGTCTA CCGCCTCGGG CCAACCGGTG CGCTGGCCAG GATCACCCAT GAGCCCGGCG TGCACAGTGC CCTGACCGGC GGCCCGACCG TGGTCCTGAC CAGCGGTTCG CTGGAGACTG CCCGCACCCG CAGGGTCGTC TACCCCGGTG GTCTCGTACA GGACGCCGCC GTGGAGCTCG GCGACCTGTC GGCGGCGTCG CCGTACCGGC CGGAGCCGGT TCTGGACCGG GTGACCGAGC ACCGCCTGCC CTCCGCGGTG CTCTACCCGA GCGGGCACGT GCCCGGCGAG AAGCTGCCGG TCCTCCTCGA CGTGTACGGC GGGCCCGGCT ACCAGGCGAT CGCCGCCGAA CCGGGCCGGT GGATCCGCAA GCAGTGGTGG GCCGACCAGG GTTTCGCCGT CGTGACGATC GACAACCGGG GCACGCCGAA CGTATCGGTC TCCTTCAGTC AGGCGATCTT CCGGCGCTTC TCGCAGGTGA CGCTGGACGA CCAGGTCGCG GGGCTGCACG AGCTCGCGGG CAAGCACCCC GACCTCGACC TGTCCAGGGT CGGCGTGCGC GGCTGGTCGT ACGGCGGCTA CTTCGCGGCA CTGGCCGTGC TCCGGCGCCC CGACGTCTTT CACGCCGCCT GCGCGGGCGC CCCTCCCACC GACTTCCGCT GGTACGACAC CGCCTACACC GAGCGATACC TCGGCCTGCC CGAGGAGAAC GCCTCCGGAT ACGACGGCGA CTCGCTGATC GCCGACGCCC CCCGGCTTGA GCGGCCGCTG CTGCTCATCC ACGGCCTGGC CGACGACAAC GTCTATCCGC TGCACACCCT GCGCCTGTCC GAGGCCCTGA CCCGGGCCGG ACGGCCGCAC TCCACGCTCC TGCTTCCCGG TGTCAGCCAC ATGACCCCGG ACGGCGTCGC GGAGAACCTC ATGGCGATCG AACTCGACTT CCTCCGCCGC AATCTCCGCT GA
|
Protein sequence | MLSDKLLARL AATGRFAYGS PRALTISPDG SLVLFLRSSG PEDPVERLWA LDVATGVERL LAEPAEGPHD LPAEERALRE RLRLWAPGIG SYATDGSVVV YALGGRLFRV DVTTATVEEI PAAGPVFDPR PSGDRIAYVS AGRIYVHENG GDRLIAGESH VTWGVAEFAA AEELGRHRGH WWAPDGSALL ATRVDETRVR RRRLADPSRP ELAAPEPAYP QAGGPNAAVE LHLLGLDGAR RQILWDDIGF PYLSAVDWSE PDRPTITVLD RLQQNGRLLA VDLATGTTLP LAEFSDPRWI EAVPGTPSLL AGGRILTAVE ADDTQALAID GTVVTPSWLY VRRVAGSLGH DLLIEGSEAD PAEQHVYRLG PTGALARITH EPGVHSALTG GPTVVLTSGS LETARTRRVV YPGGLVQDAA VELGDLSAAS PYRPEPVLDR VTEHRLPSAV LYPSGHVPGE KLPVLLDVYG GPGYQAIAAE PGRWIRKQWW ADQGFAVVTI DNRGTPNVSV SFSQAIFRRF SQVTLDDQVA GLHELAGKHP DLDLSRVGVR GWSYGGYFAA LAVLRRPDVF HAACAGAPPT DFRWYDTAYT ERYLGLPEEN ASGYDGDSLI ADAPRLERPL LLIHGLADDN VYPLHTLRLS EALTRAGRPH STLLLPGVSH MTPDGVAENL MAIELDFLRR NLR
|
| |