Gene Strop_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1830 
Symbol 
ID5058289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2095724 
End bp2096929 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID640474100 
Productaminodeoxychorismate lyase 
Protein accessionYP_001158670 
Protein GI145594373 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.194696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.869905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG ATCTGGACCC TGATTTCGAT ACCGATCGGG GAGAGAAGGG CCGGCATCGG 
CGTGGCTACG TGCGAAAGCG TCAGCGCGAG CGGACGCGCA GCTCGGGTGG AGGGCGCGGC
AAGACCGTGC TGGCCCTGCT GCTGACCCTG ACTCTGCTCG GCGGCCTCGG CGGTGGTGCC
TTCTACGGCT TCGAACGGAT CCAGAGCCTT CTCGGTACCC CAGACTACGA CGGCTCCGGG
ACCGAGGCGG TGACGGTCGA GATCCCGGAG GGCGCCCCGA TCGCCGCCAT CGCGGTCACG
CTCTACGAGG CCGGGGTTGT CAAGAGCACC AAGGCATTTG TTGAGGCGGC GGAGGAGAAC
GACCAGAGCA AGAGCATCCA GCCCGGCCAG TACGAGTTGC GGAGGCAGAT GAGTGGCGAG
TATGCCGTGG CCGCGATGCT GGACCCGAAG AACCGGATCG TGAACGGGAT CACTGTTCCC
GAGGGGCGGA CCGCGAAGAG CATCTACAAG CTGCTCGCCG AAGAGACCGA GATCCCGGTG
GAGGAGTTCG AGGCGGCGGC GAAGGATCCG CTCGCGCTCG GTGTCCCGGA GTGGTGGTTC
ACGCGCACGG ACGACCGGAC GGCCGAGCCG TCGATCGAGG GCTTCCTCTT CCCCGACACC
TACGAGCTTC CCGCGGAACC CACGGCTGAG TCGATTCTCG CGCTGATGGT GGAGCGATTC
CTCACCGTCG CCGAGGAGTT GGAGTTCGTC GACCGGGTGC AGAACGAACG CCAGATCGCG
CCCTACGAGG CGCTGATCGT CGCGTCGCTC GCCCAGGCCG AGGCGGGCAC TCCGGAGGAC
CTCGGCAAGG TCGCCCGGGT CGCCTACAAC CGGGTCTACG GCGACTTCCA GTGCAACTGC
CTGGAGATGG ACGTCACGGT CAACTACTAC CTCGAGTTGA CCGGCCAGGA GACCAAGACC
TCGGCCGAGA TGACCCAGGA CGAGTTGCTT GACACGGAGA GCCCGTACCG CCGCAAGCTC
GAGGGCCTGA TCCCCACGCC GATCAACAAC CCGGGTCAGT TGGCGATGGA AGGCGCCATG
GACCCGCCGC CGGGAAAGTG GCTCTTCTTC GTCGCGATCA ACAAAGAGGG GGAGTCCGCC
TTCGCGGAAA CCTACGACGA GCATCTGCGC AACGAGGCCA AGGCGAGGGA GGCGGGTGTC
ATCTGA
 
Protein sequence
MIDDLDPDFD TDRGEKGRHR RGYVRKRQRE RTRSSGGGRG KTVLALLLTL TLLGGLGGGA 
FYGFERIQSL LGTPDYDGSG TEAVTVEIPE GAPIAAIAVT LYEAGVVKST KAFVEAAEEN
DQSKSIQPGQ YELRRQMSGE YAVAAMLDPK NRIVNGITVP EGRTAKSIYK LLAEETEIPV
EEFEAAAKDP LALGVPEWWF TRTDDRTAEP SIEGFLFPDT YELPAEPTAE SILALMVERF
LTVAEELEFV DRVQNERQIA PYEALIVASL AQAEAGTPED LGKVARVAYN RVYGDFQCNC
LEMDVTVNYY LELTGQETKT SAEMTQDELL DTESPYRRKL EGLIPTPINN PGQLAMEGAM
DPPPGKWLFF VAINKEGESA FAETYDEHLR NEAKAREAGV I