Gene Sros_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3167 
Symbol 
ID8666455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3448556 
End bp3450184 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content66% 
IMG OID 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_003338855 
Protein GI271964659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATGC GTACCCTCGG ACGCTGGTCC GGCGCCGTCA CGGCCGTCGT GACCGCGCTG 
ACGCTCACGT CGTGCGGCCT CAACGAGGGC CAGGACGCCT CGAAGCCGGC CGGTGGCGAC
AGCTCCACCA CGCTGCGCAT CGGCACCACC ACCGACGTCG CGAACTTCAA CCCGCTGCAG
TCGCTGAGCA AGACCGACAA CTGGATTCTC AACGCGATGT ACCCACACCT GCTGCGGATC
GACGGGGACG CCAAGAAGGT CCCCGAGCTC GCCAGCAAGT ACACCCACGA AGACGGTGGC
AAGTCGGTCG TCTTCACCCT GCGCGACGAT TTCGTGTGGA GTGACGGCAC CCCGGTGACG
TCGGCCGACG TGAAGTACTC GGCCGAGACG ATCATGAAGA ACAAGCTCGG GAACGTGGCC
GCGAAGCTCA CCTGGGTCGA GGGCATCGAG GCGCCCGACG CCACCACGGT GGTCTTCAAG
CTCAGCCAGC CCTACGCGCC GTTCGCCGAA GGCGTGGGCT TCTGGATGTC CATCGTCCCC
AAGCACATCT TCGAGAAGGC CGGCGACATC ACCAAGTTCG CCAACAACTC CGATTGGGTC
GGCGCGGGCG CCTACATCCT GAAGAGCGCC ACGCCCGGTC AGCGCTACGT GATGACGCGC
AACGACAAGT ACCCCTACGC GCCCGCGGGC GGCGCGGCCG TCGACACCGT CGAGTACCGG
CTGTACCCGG ACGTCAACAC CATGCAGCTC GCGTTGCGCA ACGGCGACAT CGACCTGATG
GGCACGCCGG TGCCCGCCTC GGCCATCGCC TCGTTCAGCG GCGACGAGAA GATCAAGCTG
CAGGAGGTCG GCTCGCTGGG CTTCGCCCAC ATCACCTACA ACGTCACCAA CGAGCACCTG
GCCCGGCCGA AGGTCCGGCA GGCGCTGTCG ATGGTCGTCG ACACCAAGTC GATCATCTCT
ACGGTGCTGC AGGGTGAGGG CTCACCGATG ACCGGCCCGA TCTCACCGAT CTTCGCGGAG
TACGACAACA CCGAGCTCCA GCCGTACCCG TTCGACCCGG CCGCCGCGCG CAAGCTGCTC
GAAGAGGACG GATACGCGGA CAAGAACGGC GACGGGAAGC TCGACGGCCT GTCCTTCGAG
ATGGTCTGCG ACCAGAGCAA CCCGAACCTG ACCCGGGTCG CCCAGGTGGT GCGGGAGGAC
GCCGCCAAGG CCGGCGTCGA ACTCGTCGCG TCGTGTGTCG AGCGGAACAC GTTCCTCAGC
AGGACCAAGA GCGGCGACTA CGACCTCGAC GTCTCGCAGT GGGCCGTCTT CGACAACCCG
ATGGACCAGC TCCGCAGCAC CTACCTGTCG AGCAACCCGG GCGGCATCAA CTACAACCTG
GTCAAGGACC CGAAGCTCGA CAAGCTCATC GACGAGGCCG CCGTCACGAC CGACCACGAC
AAGTTCGCGG GGAAGATCAA GGACCTTGAC GCCTACGTGC ACGAGCAGGC GCTGCTGACG
CCGCTCTACG TCGAGAAGAT CCAGTTCGCC TACAACGCCG GCAAGTTCAC CGGCTTCCAG
CCCTCGCCCA GCGATCTGCT CGGCATGGTG ACCGGCTACT CGCTGTCGCA GGTCCGACCC
GTCGGCTGA
 
Protein sequence
MNMRTLGRWS GAVTAVVTAL TLTSCGLNEG QDASKPAGGD SSTTLRIGTT TDVANFNPLQ 
SLSKTDNWIL NAMYPHLLRI DGDAKKVPEL ASKYTHEDGG KSVVFTLRDD FVWSDGTPVT
SADVKYSAET IMKNKLGNVA AKLTWVEGIE APDATTVVFK LSQPYAPFAE GVGFWMSIVP
KHIFEKAGDI TKFANNSDWV GAGAYILKSA TPGQRYVMTR NDKYPYAPAG GAAVDTVEYR
LYPDVNTMQL ALRNGDIDLM GTPVPASAIA SFSGDEKIKL QEVGSLGFAH ITYNVTNEHL
ARPKVRQALS MVVDTKSIIS TVLQGEGSPM TGPISPIFAE YDNTELQPYP FDPAAARKLL
EEDGYADKNG DGKLDGLSFE MVCDQSNPNL TRVAQVVRED AAKAGVELVA SCVERNTFLS
RTKSGDYDLD VSQWAVFDNP MDQLRSTYLS SNPGGINYNL VKDPKLDKLI DEAAVTTDHD
KFAGKIKDLD AYVHEQALLT PLYVEKIQFA YNAGKFTGFQ PSPSDLLGMV TGYSLSQVRP
VG