Gene Strop_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3587 
Symbol 
ID5060062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4101130 
End bp4102473 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID640475842 
Productextracellular solute-binding protein 
Protein accessionYP_001160396 
Protein GI145596099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.69287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCT TCGCCAGACC ACACCAAGCC TTCGTAGTAG CTGGCGTGCT CGGCCTGGCC 
GTCGGCGCCA CCGCCTGCGG TAGTGACGAC GACGGCAGCA GCAGGGCCGA TTCCCCAGAG
TGCGCGGTAT ACGAGCAGTA CCAGGGCAAC GGCGGCACCG AGGTCTCCAT CTACGCGTCC
ATCCGCGACG CGGAGGCGGA CCTGCTCGAA CAGTCGTGGG AGCAGTTCGC AGACTGCACC
GGCATTGAGA TCGACTACGA GGGCAGCGGC GAATTCGAGG CACAGCTCCA GGTCCGGGTA
GACGGTGGCA ATGCGCCCGA CATCGCCTTC ATCCCACAGC CGGGCCTGCT GAAGCGCTTC
GCGCAGGCCG GCAAGCTCAC ACCGGCCTCG GCCGAGACCA CGGCGATGGC CGAGGAGAAC
TACGCCGCCG ACTGGCTGCG GTACAGCACC ATCGGGGGAG AGTTCTACGG CGCTCCGCTG
GGCTCGAACG TCAAGTCATT CGTCTGGTAC TCGCCGACGA TGTTCCAGGA GCAGGGCTGG
TCGGTGCCGA CCAGCTGGGA CGATCTGATC GAACTCAGTG ACCGGGCCGC CGCCGACGGC
ATCAAGCCGT GGTGCGTCGG CATCGAGTCC GGTGACGCCA CCGGCTGGCC AGCCACCGAC
TGGATCGAGG ACGTACTGCT GCGGACGCAG ACCCCCGAGG TCTACGACCA GTGGACCACA
CACGCCATCC CCTTCAACGA CCAGCGTGTC GTGGACGCGG TCGAACGGGC CGGCACCATT
CTGCGAAACG ACCGGTACGT CAACGGCGGC TACGGCGGCG TCAAGAGCAT CGCCACCACC
TCGTTCCAGG AGGGCGGTCT GCCGATCCTC CAAGACGAGT GCGCCCTGCA CCGGCAGGCG
TCGTTCTACG CCAACCAGTG GCCCGAGGAG AGCCGGGTGG CCGAGGACGG CGACATATTC
GCGTTCTACT TCCCGCCCAT CGACCCGGCG AAGGGCAAGC CGGTGTTGGG AGGCGGCGAG
TTCACCGTCG CCTTCGACGA CCGCCCGGAG GTCCAGGCGG TGCAGACCTA CCTCGCCTCC
GGTGAGTACG CCAACGGCCG GGCCAAGCTG GGCAACTGGG TGTCGGCGAA CACGAAGCTC
GACCTCTCGA ACGTGACCAA CCCGATCGAC CGGCTCTCGG TCGAAATCCT TCAGGACGAG
CAGACAGTCT TCCGCTTCGA CGGCTCCGAC CTGATGCCCG CCGCCGTCGG CGCCGGAACG
TTCTGGAAGG AGATGGTGTC CTGGATCAGC GGCAAGGACA CCGTGGCAGC CCTGGACGCC
ATCGAGAGTT CCTGGCCCAG CTGA
 
Protein sequence
MAVFARPHQA FVVAGVLGLA VGATACGSDD DGSSRADSPE CAVYEQYQGN GGTEVSIYAS 
IRDAEADLLE QSWEQFADCT GIEIDYEGSG EFEAQLQVRV DGGNAPDIAF IPQPGLLKRF
AQAGKLTPAS AETTAMAEEN YAADWLRYST IGGEFYGAPL GSNVKSFVWY SPTMFQEQGW
SVPTSWDDLI ELSDRAAADG IKPWCVGIES GDATGWPATD WIEDVLLRTQ TPEVYDQWTT
HAIPFNDQRV VDAVERAGTI LRNDRYVNGG YGGVKSIATT SFQEGGLPIL QDECALHRQA
SFYANQWPEE SRVAEDGDIF AFYFPPIDPA KGKPVLGGGE FTVAFDDRPE VQAVQTYLAS
GEYANGRAKL GNWVSANTKL DLSNVTNPID RLSVEILQDE QTVFRFDGSD LMPAAVGAGT
FWKEMVSWIS GKDTVAALDA IESSWPS