Gene Sros_5287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5287 
Symbol 
ID8668581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5803368 
End bp5805419 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content73% 
IMG OID 
ProductDipeptidyl-peptidase IV 
Protein accessionYP_003340798 
Protein GI271966602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.645423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCTG ACAAACTCCT CGCTCGTCTC GCCGCCACAG GGCGGTTCGC CTACGGCTCT 
CCCCGAGCCC TGACCATCTC CCCGGACGGG TCGCTGGTGC TGTTCCTGCG TTCCAGCGGC
CCCGAAGACC CCGTCGAGCG GCTCTGGGCG CTCGACGTCG CCACCGGCGT GGAACGGCTG
CTGGCCGAAC CCGCAGAGGG CCCGCACGAT CTACCGGCCG AGGAGCGCGC CCTGCGCGAG
CGGCTGCGCC TGTGGGCGCC CGGTATCGGC TCCTACGCCA CCGACGGCTC CGTCGTCGTG
TACGCGCTGG GCGGACGGCT CTTCCGGGTG GACGTCACCA CCGCCACCGT GGAGGAGATC
CCGGCCGCCG GCCCCGTCTT CGACCCCCGG CCGTCCGGGG ACAGGATCGC CTACGTGTCC
GCCGGGCGGA TCTACGTCCA CGAGAACGGC GGCGACCGCC TGATCGCCGG CGAATCCCAC
GTCACCTGGG GTGTCGCGGA GTTCGCCGCC GCCGAGGAGC TCGGCAGGCA CCGGGGGCAC
TGGTGGGCGC CCGACGGTTC GGCGCTCCTG GCCACCCGGG TGGACGAGAC CCGCGTGCGG
CGCCGGCGAC TGGCCGACCC CTCCCGGCCC GAGCTGGCGG CGCCGGAGCC GGCCTATCCG
CAGGCGGGCG GTCCGAACGC GGCCGTCGAG CTGCACCTGC TCGGGCTGGA CGGCGCGCGC
AGGCAGATCC TGTGGGACGA CATCGGGTTC CCCTACCTGT CGGCGGTGGA CTGGAGCGAA
CCCGACCGGC CCACGATCAC CGTGCTGGAC CGGCTCCAGC AGAACGGCCG GCTCCTCGCG
GTCGACCTCG CCACCGGCAC GACCCTCCCC CTGGCGGAGT TCTCCGACCC GCGCTGGATC
GAGGCCGTGC CCGGCACCCC CTCGCTCCTG GCCGGCGGCC GGATCCTGAC CGCGGTCGAG
GCAGACGACA CCCAGGCACT GGCGATCGAC GGCACCGTGG TGACCCCCTC GTGGCTGTAC
GTGCGGCGTG TCGCGGGCTC CCTGGGCCAC GACCTGCTCA TCGAGGGCAG CGAGGCCGAC
CCCGCCGAGC AGCACGTCTA CCGCCTCGGG CCAACCGGTG CGCTGGCCAG GATCACCCAT
GAGCCCGGCG TGCACAGTGC CCTGACCGGC GGCCCGACCG TGGTCCTGAC CAGCGGTTCG
CTGGAGACTG CCCGCACCCG CAGGGTCGTC TACCCCGGTG GTCTCGTACA GGACGCCGCC
GTGGAGCTCG GCGACCTGTC GGCGGCGTCG CCGTACCGGC CGGAGCCGGT TCTGGACCGG
GTGACCGAGC ACCGCCTGCC CTCCGCGGTG CTCTACCCGA GCGGGCACGT GCCCGGCGAG
AAGCTGCCGG TCCTCCTCGA CGTGTACGGC GGGCCCGGCT ACCAGGCGAT CGCCGCCGAA
CCGGGCCGGT GGATCCGCAA GCAGTGGTGG GCCGACCAGG GTTTCGCCGT CGTGACGATC
GACAACCGGG GCACGCCGAA CGTATCGGTC TCCTTCAGTC AGGCGATCTT CCGGCGCTTC
TCGCAGGTGA CGCTGGACGA CCAGGTCGCG GGGCTGCACG AGCTCGCGGG CAAGCACCCC
GACCTCGACC TGTCCAGGGT CGGCGTGCGC GGCTGGTCGT ACGGCGGCTA CTTCGCGGCA
CTGGCCGTGC TCCGGCGCCC CGACGTCTTT CACGCCGCCT GCGCGGGCGC CCCTCCCACC
GACTTCCGCT GGTACGACAC CGCCTACACC GAGCGATACC TCGGCCTGCC CGAGGAGAAC
GCCTCCGGAT ACGACGGCGA CTCGCTGATC GCCGACGCCC CCCGGCTTGA GCGGCCGCTG
CTGCTCATCC ACGGCCTGGC CGACGACAAC GTCTATCCGC TGCACACCCT GCGCCTGTCC
GAGGCCCTGA CCCGGGCCGG ACGGCCGCAC TCCACGCTCC TGCTTCCCGG TGTCAGCCAC
ATGACCCCGG ACGGCGTCGC GGAGAACCTC ATGGCGATCG AACTCGACTT CCTCCGCCGC
AATCTCCGCT GA
 
Protein sequence
MLSDKLLARL AATGRFAYGS PRALTISPDG SLVLFLRSSG PEDPVERLWA LDVATGVERL 
LAEPAEGPHD LPAEERALRE RLRLWAPGIG SYATDGSVVV YALGGRLFRV DVTTATVEEI
PAAGPVFDPR PSGDRIAYVS AGRIYVHENG GDRLIAGESH VTWGVAEFAA AEELGRHRGH
WWAPDGSALL ATRVDETRVR RRRLADPSRP ELAAPEPAYP QAGGPNAAVE LHLLGLDGAR
RQILWDDIGF PYLSAVDWSE PDRPTITVLD RLQQNGRLLA VDLATGTTLP LAEFSDPRWI
EAVPGTPSLL AGGRILTAVE ADDTQALAID GTVVTPSWLY VRRVAGSLGH DLLIEGSEAD
PAEQHVYRLG PTGALARITH EPGVHSALTG GPTVVLTSGS LETARTRRVV YPGGLVQDAA
VELGDLSAAS PYRPEPVLDR VTEHRLPSAV LYPSGHVPGE KLPVLLDVYG GPGYQAIAAE
PGRWIRKQWW ADQGFAVVTI DNRGTPNVSV SFSQAIFRRF SQVTLDDQVA GLHELAGKHP
DLDLSRVGVR GWSYGGYFAA LAVLRRPDVF HAACAGAPPT DFRWYDTAYT ERYLGLPEEN
ASGYDGDSLI ADAPRLERPL LLIHGLADDN VYPLHTLRLS EALTRAGRPH STLLLPGVSH
MTPDGVAENL MAIELDFLRR NLR