Gene Sros_7456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7456 
Symbol 
ID8670777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8243771 
End bp8245462 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content66% 
IMG OID 
ProductABC-type dipeptide transport system, periplasmic component 
Protein accessionYP_003342882 
Protein GI271968686 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.388989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG TTAGATTCGC AGCCACAGCC GGCGCGCTGG GGCTGGCGTT GCTGCTGACC 
GCGTGCTCGC CGGGCACCTC GGGCGGTGCC CGGGAGACCA CCACCAACGC CGCCGGCGGC
ACGTCGGACT CGATCAGCCT GCGCATCAAC AACGCCACCA CGTACTCCCG CAACTTCAAC
ATCTACTCCC CCTCCACCGA CATCGCCCCG CAGATCAGCC TCATCTATGA GCCGCTGGTG
CGCCGCAACG TCCTCAAAGG CGGCCGGCTG GAACCGTGGC TGGCCGAGTC GTGGGAGTGG
AGCGACGGCG ACAAGACCGT CACCCTCAAG CTCCGCACCG ACGTGAAGTT CTCCGACGGC
ACGCCGATGA CCAGCAAGGA CGTCGCCTTC ACGCTGAACA TCTCGCTGGA GCACCCCGAG
CTGAACACCG GCGGCCAGAC CTACGTCTCG GCCGAGGCGA CCGACGACCA CACCGTGGTC
GTCAAGTGGA AGAAGCCCGC GCAGCTCGAC TTCTACCGCT TCGCGCTCGG CGTCACCGGC
TTCGCGCCCC GGATCGTTCC CGAGCACATC TGGAAGGACA AGGACCTCAA GACCTGGACG
AACCCCGACC CGATCGGCAC AGGCGTGGGC AAGCTCACCC AGTTCACCCC GCAGCAGTTC
ACCCTGGAGA CGCGCGCGGA CTACTGGGGC GGCCAGTTCC CGATGAAGTC GATCAAGATC
GTCGCGACCG GCGGTGACGA CCAGACCAAG GCGCGCCTGC TCAAGGGTGA CATCGACTAC
GCCACCATTT CCTGGCCGAA CGCCGAGCAG GAATACATGG CCCGGAACCC GAAGGCCAAC
GTCTACAAGA CGTTCCACAC CGGCGGGGAG GAGTCGCTGC TGTTCAACCT GGCCAAGGAG
CCCTTCTCCG ACGTCAACGT GCGCAAGGCG CTGGCCATGA GCGTGGAACG TGCCAGCGTG
CTCAAGCTCG CCCCCACCGG CCAGGAGCCG GCCAACGCCT GTGGCCTGGA GCCGCAGGTG
TACGCCGAGT TCATGGCGCC GGAGTGCAAG CCGCAGCCCC TCGACGTCGA GGGCGCCAAG
AAGGCCCTGG CCGACGGCGG CTGGACCGTC GAGGGCGGCC GGCTCGCCAA GGACGGCAAG
ACCTACCCGC TGAGCATCAA GGTCGTGCAG GAGTACGCCA ACTGGATGGC CTACGGCAAG
GGCATGCAGG ACCAGTGGAA GTCCAACCTG GGCCTGGACG TGAAGGTCAT GGCCATCCCC
GAGGAGAACT ACGACCCGCA GCTCAACGAG GGCGACTACG ACATGGCCCT CTACTGGACC
GGCAACTCCA ACGGCCTGTA CTCCGTCTTC GCCGACCAGC TCGACTCCGA CAAGTACAAG
CCCATCGGCA AGGACGCCCA GTACCAGAAC CAGAGCCGCT GGAAGGACAC GTCCACCACG
CCGCTGCTCG ACAGGCTGCG CGACACCGTC GGCGACCCGG CCGCCCAGAC GGAGGCCGGC
TACCAGCTCC AGAAGGTCGT GCTGGACCAG GTGCCGTTCT CCCCGATGTT CACCGCCGAC
TGGTTCGTCG AGATGAACCA GTCCCGGTGG GTGGGCTGGC CCGAGACGGG CGAGACCGAC
CACGTCCCGC ACAGCGCCCT CGGCCCTGAC ATCGTCATGA CCCTCAAGGG TCTCAAGCCC
GCGGGCAAGT AG
 
Protein sequence
MKKVRFAATA GALGLALLLT ACSPGTSGGA RETTTNAAGG TSDSISLRIN NATTYSRNFN 
IYSPSTDIAP QISLIYEPLV RRNVLKGGRL EPWLAESWEW SDGDKTVTLK LRTDVKFSDG
TPMTSKDVAF TLNISLEHPE LNTGGQTYVS AEATDDHTVV VKWKKPAQLD FYRFALGVTG
FAPRIVPEHI WKDKDLKTWT NPDPIGTGVG KLTQFTPQQF TLETRADYWG GQFPMKSIKI
VATGGDDQTK ARLLKGDIDY ATISWPNAEQ EYMARNPKAN VYKTFHTGGE ESLLFNLAKE
PFSDVNVRKA LAMSVERASV LKLAPTGQEP ANACGLEPQV YAEFMAPECK PQPLDVEGAK
KALADGGWTV EGGRLAKDGK TYPLSIKVVQ EYANWMAYGK GMQDQWKSNL GLDVKVMAIP
EENYDPQLNE GDYDMALYWT GNSNGLYSVF ADQLDSDKYK PIGKDAQYQN QSRWKDTSTT
PLLDRLRDTV GDPAAQTEAG YQLQKVVLDQ VPFSPMFTAD WFVEMNQSRW VGWPETGETD
HVPHSALGPD IVMTLKGLKP AGK