Gene Sros_7203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7203 
Symbol 
ID8670514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7950541 
End bp7952286 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content66% 
IMG OID 
ProductABC-type dipeptide transport system periplasmic component-like protein 
Protein accessionYP_003342636 
Protein GI271968440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.120174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.128293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCAAT CGCGGACATC TGGCAAGTTT CTAGCTGCTG TGGCAGGAGG TGCGCTCCTG 
CTGACCGCCT GTGCGGGTAA CGACACGGGG GCGGACAAGC CGGCCGCGTC GGGCTCGGCG
TCAGCCTCGG CTCCGGCTGC ACAGAGCATC ACGTACGCCT ACGAGCAGGA GTTCCACTCC
TACAACGGCA ACACCGCTGC GGAGAACGCC ACCCGTAACA ACGGCCCCCT CCAGCGCGTG
CTGACGGGCT TCTGGTTCTA CGGCGACAAG GGCGCCATCA CGCCCGACAA GGACTTCGGC
ACCTACGAGA AGACCTCCGA CGACCCGCTG ACCGTCGAAT ACACGATCAG CGACAAGGCC
GTCTGGTCGG ACGGCACGCC GATCGACTGT GACGACGCCC TGCTCTGGTG GGCCTACAAG
TCCGGCAAGA TCAAGGGCTT CTCCGCCTCC GGCACCGACG GCGTCCAGGA CACCAAGGTG
CTCGACTGCG ACAAGGGCGG CAAGAAGTTC ACCCTCGTCT ACGACAAGCC CTTCGCCGAC
TGGGTGGCCA ACGGCCCCGG CGCGGCCGAG ATCATGCCGG CCCACGTGGT CGAGAAGCAG
GGGGGCCTGT CCGAGGACGA GTTCATCGCC GCGGTCAAGG CCACCGACGC GAAGAAGCTC
GAGAAGGCCA TCAAGTTCTT CAACGACGGC TGGATCTCCG AGGGCACGCT GCCCGCGGCC
GACCTCATCC CCGCCTCCGG CCCGTACAAG CTCTCCAAGA TGGACGCCGG CCAGTCGCTG
ACCTTCGTCG CCAACGACAA GTGGTGGGGC ACCCCCGCCG CGACCCCGAC CATCGTGGAG
CGTTTCATCG CCCTCGACGA GCAGGCCCAG GCCCTGCAGA ACCGCGAGGT CCAGATCGTC
GAGCCGCAGC CCGGCCCCGA CGTGCTCAAC CAGCTCAAGG CCCTGGAGGG CGTCCAGGTC
AACCTGGGCG ACGCCTACAC CTACGAGCAC CTGGACTTCA ACTTCGACTC CAGCCCGTTC
AAGGACAAGG CGCTGCGTGA GGCCTTCGCC AAGTGCGTGC CCCGGCAGCT CATCGTCGAC
AACCTGATCA AGCCCGTCGC GCCGGAGTCC AAGCCTCTGG AAGTCCGCAA CGTGGCCCCG
TTCCAGAACA ACAGCGCCGC GGTCGTCGCG GCCAGCGGCG GCGCCGGCGT CTACGCCCAG
CAGGACATCG AGGGCGCCAA GGCTCTCGTG GAGAAGGCGG GCAAGACCGG CCTGGAAGTC
AAGATCGGCT ACCAGACCCC GAACCCGCGC CGTACCGCTG CCGTGCAGCT CATCATCGAC
TCCTGCAACA AGGCCGGCTT CAAGGTCGTC GACAAGGGCT CCGAGGACTT CTTCGGCACC
GTGATGCCCG CCAACAACTA TGACGTGGCG CTGTACGCCT GGGCGGGCTC CTCCCTGGTC
AGCGGCTGGG CCTCCACCTT CACCACCCCG AAGAAGTGCG ACGGCGAGAA CAAGGGCAAC
AACAACGGCT GCTACTCCAG CAAGAAGGTC GACGAGCTCA TCAAGAAGCT GAACTCGACC
GTGGACCTGG CCGCGCAGGA CCCGATCATC GCTGAGATCG AGAAGAACCT CTGGGCCGAC
CTGGCCACCA TCCCGCTGTT CCAGCACCCG GGCCTCAGCG CGTGGGACGA GACGGTCAAG
AACGTCGTCC CGAACCCCGC GCAGTCGACC ATCACCTGGA ACATGGACAA GTGGAGCCTG
TCGTAA
 
Protein sequence
MFQSRTSGKF LAAVAGGALL LTACAGNDTG ADKPAASGSA SASAPAAQSI TYAYEQEFHS 
YNGNTAAENA TRNNGPLQRV LTGFWFYGDK GAITPDKDFG TYEKTSDDPL TVEYTISDKA
VWSDGTPIDC DDALLWWAYK SGKIKGFSAS GTDGVQDTKV LDCDKGGKKF TLVYDKPFAD
WVANGPGAAE IMPAHVVEKQ GGLSEDEFIA AVKATDAKKL EKAIKFFNDG WISEGTLPAA
DLIPASGPYK LSKMDAGQSL TFVANDKWWG TPAATPTIVE RFIALDEQAQ ALQNREVQIV
EPQPGPDVLN QLKALEGVQV NLGDAYTYEH LDFNFDSSPF KDKALREAFA KCVPRQLIVD
NLIKPVAPES KPLEVRNVAP FQNNSAAVVA ASGGAGVYAQ QDIEGAKALV EKAGKTGLEV
KIGYQTPNPR RTAAVQLIID SCNKAGFKVV DKGSEDFFGT VMPANNYDVA LYAWAGSSLV
SGWASTFTTP KKCDGENKGN NNGCYSSKKV DELIKKLNST VDLAAQDPII AEIEKNLWAD
LATIPLFQHP GLSAWDETVK NVVPNPAQST ITWNMDKWSL S