Gene Strop_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0197 
Symbol 
ID5056633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp228612 
End bp229682 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID640472467 
Producthypothetical protein 
Protein accessionYP_001157060 
Protein GI145592763 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGGGT ATCTTGCTTT GGCCGCGATC TGGGGTTCCA GCTTTCTCTT CATCAAGATC 
GGGGTGCGGG AGCTACATCC CCTGCACCTG ACCCTCTACC GGGTCGGGGC CGGCGCGTTA
ACGCTGCTGA TACTGCTCGT GGCGCTGCGC GACCGACTGC CCCGCGAGCC GCGGGTCTGG
GCCCATCTGG TCGTCACCGG TGGGATCGGC GTGGCGCTTC CGTTCACCCT GTTCGGCTAC
GGCGGGGAGC GGGTCGAGTC CATGCTCTCC GGGATCTGGA ACGCCACCAC ACCGCTGGTC
GTGCTGCCCA TGGCGGTGCT GGTCTTCCGT ACCGAACGGA TTACCGCCGC CCGGGCGGTC
GGGCTCGGGC TGGGCTTTCT CGGCGTACTG GTGGTGCTCG GGGTGTGGCA GGGCGCTGGT
GGTTCGCACT TCGTCGGCCA GCTCATGTGC TTCGGCGCCG CGGCCTGCTA CGGGGTGGTC
ATCCCGTACC AGAAGAAGTT CGTCGCGGGC CGCTCCTACT CCGGGCTGGC CCTGTCGGCG
GCGCAGTTGC TGATGGCGCT GGCGCTGCTC ACCATCGTCA CTCCGTTCGT GGCGGGCGTA
CCGCCGATGC CGACCGCCCT CTCCGGCTCG GTCCTGGCCA GCATGGTCGC GCTCGGCGCG
CTCGGCACCG GGTTGGCCTT CCTGATTCAC TTTCGCAATA TCCGGGTCGC TGGCGCCAGT
ACCGCAGCGA CGGTGACCTA CGTGATCCCG GTCTTCGCGG TGCTGGCCGG TGCGCTGGTG
CTCGACGAGC GGCTGACCTG GCACCAACCG GTTGGCGCGG TGGTGGTCCT GCTCGGTGTC
GCGGTCACCC AGGGGCTGAT CGGTCCCCGC CGCCGACCGC GGGCCGTCGC GCTACCGACC
TCGGCAGCCG GCACCTCCGC CTCGGCGGCG GGAGTGCCGG CCACCGCCGA TCAGGAGCTG
CTCCCAGCCC ACGCCACCAG CCGCTCCGCC GGCCAGGCAT TGACCACCCG ATCGGCAGGG
ACGCCGCAGC GGGCGGCGCG TTCGCAGCCG AACCGTTGCC AGTCGAGCTG A
 
Protein sequence
MPGYLALAAI WGSSFLFIKI GVRELHPLHL TLYRVGAGAL TLLILLVALR DRLPREPRVW 
AHLVVTGGIG VALPFTLFGY GGERVESMLS GIWNATTPLV VLPMAVLVFR TERITAARAV
GLGLGFLGVL VVLGVWQGAG GSHFVGQLMC FGAAACYGVV IPYQKKFVAG RSYSGLALSA
AQLLMALALL TIVTPFVAGV PPMPTALSGS VLASMVALGA LGTGLAFLIH FRNIRVAGAS
TAATVTYVIP VFAVLAGALV LDERLTWHQP VGAVVVLLGV AVTQGLIGPR RRPRAVALPT
SAAGTSASAA GVPATADQEL LPAHATSRSA GQALTTRSAG TPQRAARSQP NRCQSS