Gene Strop_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0803 
Symbol 
ID5057246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp891824 
End bp893125 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID640473072 
Productextracellular solute-binding protein 
Protein accessionYP_001157658 
Protein GI145593361 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.437157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCA CAGCCAAGGG AGTCGCCGTA CTCGCCTCCA CCGCCCTTGC TCTGACCCTC 
GTCGCCTGTG GCGGCGAGGA GCAGGGTGCG GGGGAAGGGT CAACGACCGA CCCCGCAACC
ATGAAGGCCG AACTGACCTG GTGGGACACC TCGGACCCGA AGAACGAGGG CCCGGTTTTC
CAGGAGCTGA TCGCCAGGTT CAACCAGACC TATCCGAACG TGAAGATCAA CTATCAGTCG
GTTCCGTTCG GTGAGGCCCA GAACCAGTTC AAGACGGCAG CGCAGGCCGA GACGGGTGCG
CCGGACATCC TGCGGGCGGA GGTGGCCTGG GTTCCGGAGT TCGCCTCGCT GGGCTACCTC
TACGCGCTGG ACGGCTCCGA ACTACTCGCC GACGAATCGG ACTTCCTGGC GACTCCGCTC
GCCTCGAACA AGTACGACGG CAAGACCTAC GGCGTCCCGC AGGTGACCGA CACGCTGTCG
CTCATGTACA ACAAGAAGCT GCTGGCCGAG GCGGGTGTCG CCGCGGCGCC GACGACCTGG
GCCGAGCTGA AGACCGCCGC CCAGGCCGTC AAGCAGAAGA CCGGCGCGGA CGGCCTCTAC
CTCAATCCGG CGGGTTACTT CCTGCTGCCG TTCCTCTACG GGGAGGGCGG GGACCTGGTT
GATGTCCCGG CCAAGAAGAT CGTCATCGGC TCGGACCAGA ACGTGGCCGG GCTGAAGATC
GCCAAGGACC TGATCGACAG TGGCGCGGCC GTCCCGCCGC CCGCGACGGA CTCCTACGGA
ACCATGATGA CCCTCTTCAA GGAAGAGAAG GTCGCCATGA TCATTAATGG TCCCTGGGAG
GTCAACAACG TTGCGCAGGC GCCGACCTTC GGCGGCGTGG AGAACCTTGG CATCGCCCCG
GTTCCCAGCG GCTCGGCCCG GGCCGGTGGT CCGGTCGGTG GGCACAACTA CACCATTTGG
TCCGGTATGT CGGAGGAGAA GGTCGAGGCC GCCGTCGCCT TCGTGGCCTT CATGAGCTCC
ACCGAATCAC AGGCCTTCCT CGCGGAGAAG CTCGGGCTGC TGCCGACGCG TAAGTCGGCC
TACGAAATCG ACGCGGTGCT GAGCAATCCG ATCGTGACCG CGTACCAGCC GGCCGTCGAG
GCCGCGGTTG GCCGCCCCTG GATCCCCGAG GCCGGCCAGT TCTTCGACCC GCTGGACCAG
ATGGCCACCG AGGTCCTGAT CCAGAACCGG GACCCGAAGG CCGCGCTCGA CGAGGTCGCG
AAGAGGTACC AGGCTGAGGT CGTCACCGCG TACGGGTTCT GA
 
Protein sequence
MRRTAKGVAV LASTALALTL VACGGEEQGA GEGSTTDPAT MKAELTWWDT SDPKNEGPVF 
QELIARFNQT YPNVKINYQS VPFGEAQNQF KTAAQAETGA PDILRAEVAW VPEFASLGYL
YALDGSELLA DESDFLATPL ASNKYDGKTY GVPQVTDTLS LMYNKKLLAE AGVAAAPTTW
AELKTAAQAV KQKTGADGLY LNPAGYFLLP FLYGEGGDLV DVPAKKIVIG SDQNVAGLKI
AKDLIDSGAA VPPPATDSYG TMMTLFKEEK VAMIINGPWE VNNVAQAPTF GGVENLGIAP
VPSGSARAGG PVGGHNYTIW SGMSEEKVEA AVAFVAFMSS TESQAFLAEK LGLLPTRKSA
YEIDAVLSNP IVTAYQPAVE AAVGRPWIPE AGQFFDPLDQ MATEVLIQNR DPKAALDEVA
KRYQAEVVTA YGF